- Select Classification Training → View Statistics in the main menu, or
- Click the
Statistics button on the toolbar.
- F-measure, Recall,andPrecision – The higher these values, the more precise the classification results. (For more details about how the F-measure is calculated, see Glossary, Classifier F-measure).
- The number of pages with reference classes
- Page classification results:
- True Positive – The number of pages to which the reference class was assigned.
- False Positive – The number of pages to which a class other than the reference class was assigned.
- False Negative – The number of pages with a reference class to which no class was assigned.
- True Negative – The number of pages with no reference classes to which no class was assigned.
- Confusion Matrix. The confusion matrix is a visual representation of the documents that are most often confused by a classifier. The values in the matrix cells represent the ratios of reference classes to result classes. Green cells show the number of pages to which a class was correctly assigned. Red cells show the number of pages with confused classes — classes that have been incorrectly assigned to pages with a reference class by the classifier.
Tools for working with the Confusion Matrix
Tools for working with the Confusion Matrix
- The Confused only option disables the visibility for classes, where the result classes for all pages corresponded to their reference class.
- The Pages and Percent buttons let the user switch between number and percentage data regarding the number of pages with correctly identified and confused classes (the percentage is calculated using the ratio of pages with a correctly assigned class relative to all pages with the same reference class).
- The matrix scale can be managed as follows:
- displays the matrix using a fixed scale;
- displays the whole matrix;
- zoom in;
- zoom out.
Show the Confusion Matrix
Show the Confusion Matrix

- Statistics by Class. A table containing statistics for pages for which the result class did not match the reference class. Lets the user identify the classes that cause the most errors for a given classifier. You can sort by number of confused pages, as well as by the ratio of confused pages to the total number of pages of this reference class.
- Confusing Classes. This tab contains a list of all classes that have been incorrectly assigned by a classifier. Using this data, you can determine which classes are most often confused with each other.
- Summary statistics for major classification parameters: F-measure, Recall, Precision and classification results broken down by page.
- Major classification parameters broken down by class.
- Confusing classes – the number and percentage of pages for each confusing class.
- All classes – the number and percentage of pages for each class.
- Added/removed documents with the For Training assigned;
- The For Training state has been assigned to or removed from a document;
- Classes have been added, deleted, or merged;
- A different reference class has been assigned to a document;
- A classification profile and/or the precision-recall priority have been modified.
