Extraction Rules and Activites for Semi-Structured Documents
Upload and Pre-recognition of Images
Select documents, upload images, and configure pre-recognition settings for Extraction Rules activities.To begin creating an Extraction Rules activity, first select the documents that will be used to set up the activity. You can also use the skill document set (selected by default) or select a particular document set for a specific activity using the drop-down list to the left of the Toolbar. For more information about document sets, see Documents. To add documents to a selected document set, click Upload.Document processing begins with a full-text recognition of the document images, the results of which are then used to carry out object searches. Documents are automatically pre-recognized using skill settings.As part of pre-recognition, the program uses the images to look for objects to be used to search for Extraction Rules activity elements. You will be required to carry out an analysis of these objects when creating your activity elements and setting up such characteristics as minimum and maximum number of words/characters, maximum recognition error percentage, etc. Once recognition is completed, regions for detected objects are displayed on the image. You can specify the types of objects to be displayed using the Show Image Objects button on the toolbar. The following objects can be highlighted:
Recognized Words
Recognized Lines
Separators
Barcodes
Raw Objects
You can select and deselect objects of all types by choosing Show All Objects and Hide All Objects respectively.Advanced Designer automatically detects and corrects the orientation of a page. If required, you can change the page orientation manually by clicking the rotate icon in the toolbar and selecting one of the following options from the drop-down list: Rotate Left, Rotate Right, or Rotate 180º.