Configuring auto-learning
To configure auto-learning, complete the following steps:- Create a Document Definition.
- In the section properties of the Document Definition, select Allow field location training.
- Create the necessary fields in the section. Select Can have region in the properties of each field.
- Save and publish the Document Definition.
- In the batch type properties dialog box, click the Workflow and enable the Training stage.
Additional steps required to configure variants
Additional steps required to configure variants
To enable the system to use variants in auto-learning, complete the following steps:
- Add section variants using one of the following three methods:
- Create variants manually. To do this, click the Data Sets tab in the section properties and then click the View… button. Then click the Add… button to add variants.
- Load variants from a database. To do this, click the Data Sets tab in the section properties and then click the Set Up… button. From the drop-down list, select Database as the data source.
- Create variants using a script. To do this, click the Data Sets in the section properties and then click the Set Up… button. From the drop-down list, select Script as the data source.
- Save and publish the Document Definition.
- Train a classifier on the newly created variants:
- Switch to Open Classifier Training Batches mode and load document images into a new batch.
- Assign a reference class to each document, using variants as separate classes:
- Click Set Class… → Add… → Add…
- Select Specify variant.
- Select a variant from the list.
- Train a classifier by clicking (Project → Classification Training → Train).
- Assign a reference class to each document, using variants as separate classes:
Field extraction training can also be done by the administrator if a project has to be trained before the operators start working on it.
The auto-learning procedure
The auto-learning procedure
Documents whose field locations have been verified and corrected by the operators are placed into a training batch.The documents are matched against the current version of the trained FlexiLayout. If all the fields are found correctly, there is no need to retrain the FlexiLayout.Note: It may so happen that the FlexiLayout finds the fields correctly, but the operator had to change them. Documents were processed using an old or untrained version of the FlexiLayout. While the documents were awaiting verification, the system trained the FlexiLayout on some other documents. As a result, the given documents are now processed correctly.In this case, the documents are kept in the training batch with For testing status. They will be used for regression tests to prevent future versions of the FlexiLayout from degrading.If a trained FlexiLayout is applied and some of the field regions do not match, the documents will be used in training a new version of the FlexiLayout. They will be assigned For training status.Training results is a new version of the FlexiLayout. To compare the new version with the previous version, both are applied to he documents in the training batch that have For training and For testing statuses. The system checks how well the detected fields regions match the layout that has been confirmed by the user. The FlexiLayout that yields the best match will be used in further document processing and the inferior version is deleted.
