- In the Document Definition Editor, right-click the Document Section name and select Create Field.
- Create a Text field.
- On the General tab, select the Can have region option.
-
In the Name field, specify a name for the field (for example, PreambleSegment).
Important! Field names must not contain spaces or non-English characters or start with a number.

If segmentation is used, a separate text field should be created for each segment.
- Create a non-repeating field in a repeating group.
- Select the Text segment option in the field properties.
- Select the Allow multiple regions option if some of the segments begin and end on different pages.
Important! You can have only one segmentation model for each document section. To create a segmentation model:
- In the Document Definition Editor, right-click the Document Section name.
- Select Properties…
- In the dialog box the opens, click the NLP tab and then click Create…
- In the Name field, specify a name for your segmentation model (for example, SegmentationModel).
- In the Model type field, choose Segmentation.
- In the Language list, select the required language.

- Click Next…
- In the dialog box that opens, specify all the fields into which the segments will be extracted.
- Click OK.
The Allow training option allows you to train your NLP model during document processing. Your NLP model will be trained when you train field extraction using a field extraction training batch. Training results can be either disabled or deleted. To disable training results, right-click the training batch and select the Disabled item on the shortcut menu. To delete training results, right-click the training batch and select the Delete item on the shortcut menu.
- In the Document Definition Editor, open the document section properties and click the NLP tab.
- Click Create…
- Specify a Name for your NLP model (for example, EntitiesExtraction).
- For the data source, select either a section (if no segmentation is used) or a segment (if have chosen to use segmentation).
- In the Model type field, choose Extraction.
- In the Language list, select the required language.
- Click Next…
- Choose the result fields that will be extracted from the selected document section or segment.
- Click Document Definition > Save to save your Document Definition.
- Click Document Definition > Close to close the Document Definition editor.
- Click Document Definition > Publish to publish your Document Definition.
The Allow training option allows you to train your NLP model during document processing. Your NLP model will be trained when you train field extraction using a field extraction training batch. Training results can either be disabled or deleted. To disable training results, right-click the batch and select the Disabled item on the shortcut menu. To delete training results, right-click the batch and select the Delete item on the shortcut menu.
