Setup Overview
To create and set up a Classify By Text and Image activity, follow these steps:- Create a Classify By Text and Image activity in the document processing flow.
- Upload images, create classes, assign expected classes to documents.
- Train the activity and analyze training results.
- Modify properties if classification results need to be improved.
Creating and Setting Up Using the Activities Tab
Create a Classify By Text and Image activity in the workflow. When it is created, a field to record the classification results will be created in the skill structure. The value of this field will be used to classify documents. This field will be displayed in the skill field structure, however, it will be marked as hidden and will not be editable.Note: A Classify By Text and Image activity does not return a confidence value for a class, it only returns its name.To navigate to the Activity Editor, click Activity Editor or double-click the activity block.
Setting Up Using the Activity Editor
Step 1: Upload Documents
Upload documents that will be used to set the activity up by clicking Upload in the toolbar and selecting an upload method: a. Upload Documents… Use the dialog box that will open to select the appropriate documents. The selected documents will be displayed in the No Class list. b. Upload Folder Like Classes… Use the dialog box that will open to select a folder that contains subfolders with images. Each subfolder should contain images of a single class. Uploading documents this way will automatically create classes that correspond to subfolders, with documents in those respective subfolders classified to be of that class. As such, you will not need to manually create classes in the Activity Editor.Step 2: Create Classes
Create classes that correspond to the different types of documents being processed by clicking either Create Class in the toolbar or Create in the Assign class pane. If your documents were uploaded using Upload folder like classes, make sure that all required classes have been created.Step 3: Classify Documents
Classify your documents using one of the following methods:- Select all documents of a single class in the list and click an appropriate class name in the Assign class pane.
- If an appropriate class has not been created yet, select all appropriate documents in the list and create a class by clicking either Create Class in the toolbar or Create in the Assign class pane.
- Select all documents of a single class and drag them to the list that corresponds to that class.
Additional Options
If required, you can change the orientation of document pages using the Rotate drop-down menu on the toolbar. You can select one of the following options: Rotate All Pages Left, Rotate All Pages Right, or Rotate All Pages 180º. To switch view modes, use the following buttons in the toolbar:- List view. Displays documents as a list
- Thumbnail view. Displays documents as thumbnails
Training a Classifier and Viewing Classification Results
Once documents have been classified, train your activity using the Train Activity button. After training has finished, statistics regarding the classification results will be displayed on the Results tab. Analyzing these statistics helps identify problem classes and evaluate the general quality of the classifier.General Statistics
The top pane displays general statistics for all documents and classes of the activity. These statistics help evaluate the general quality of your classifier:- accuracy. The percentage of documents the expected class of which matched the class assigned by the program.
- F-Measure. Use to evaluate classification precision and completeness.
- Recall. The ratio of documents correctly classified as a specific class to all documents of that class.
- Precision. The ratio of documents correctly classified as a specific class to all documents classified as that class (both correctly and incorrectly).
