- Click File and select New Batch.
- In the dialog box that opens, select the Document Definition that you created earlier, then select the section for which you have configured fields and click OK.
- In the Look up Variant for Training Batch window, select the variant to be used for training.
- Select the newly created batch and either select the NLP batch option or click Field extraction training > NLP batch.

- Open the batch that you have created by double-clicking it.
- Click File > Load Images….
- In the dialog box that opens, click Image Processing Settings…, select the One document per file option, and click OK.
- Choose the documents to be used for training the NLP model.
- After all the documents have been loaded, select them and click Recognition > Match Document Definition. Alternatively, right-click the selection and click Match Document Definition. Then choose the appropriate Document Definition.
- All the fields described by the Document Definition should be marked up in the training documents.
- It is recommended to have between 100 and 500 documents in each training batch. This number of documents will enable the program to select the best parameters for your NLP model without slowing down the training process.
- Double-click a document to open it.
- Select a field for which information from the document should be extracted. Then either choose the value of the field on the document or draw a rectangle around it. Repeat this step for each field.
- Go to the next document by clicking the
button. Repeat the above steps for all the remaining documents. - Save the changes.
