Skip to main content
To create an OCR skill, follow these steps:
  1. In the Skill Catalog, and click the Create button in the toolbar.
  2. Select the OCR Skill skill type. The Create OCR Skill dialog box opens.
  3. On the General tab, enter a name and description for the new skill.
  4. On the Languages tab:
    a. In the Allowed Languages section, select one or several document recognition languages. During processing, the document language will be automatically chosen from the languages specified during setup. Note that the number of selected languages may affect recognition speed and quality.
    b. If the document contains handwritten text, enable the Handwritten option in the Text Appearance section.
  5. On the Image Enhancements tab, Crop Image and Correct Page Orientation are enabled by default. Optionally, turn off these features if your documents do not require them.
    Crop Image crops the image to the edges of the original document. Correct page orientation automatically rotates the image to restore its original orientation.
  6. On the Barcodes tab, enable the barcode types that may appear on your documents. The number of selected barcode types can affect recognition speed.
    Important! If you are processing several document files using an OCR skill as part of a single transaction, all files will be merged into one. As a result, the number of output files will be identical to the specified number of export formats.
  7. On the Export tab, select one or several document export formats. For more information, see Available Export Formats.

Available Export Formats

  • JSON (default format):
    • Text only (default option). The exported JSON file will only contain recognized text without preserving the document layout. If you select Text only, you cannot export to DOCX, XLSX, and PPTX.
    • Preserve document structure. The exported JSON file will contain recognized text and the document layout will be preserved as well.
  • XML:
    • Text only. The exported XML file will only contain recognized text. The document layout will not be preserved. If you select Text only, you cannot export to DOCX, XLSX, and PPTX.
    • Preserve document structure. The exported XML file will contain recognized text and the document layout will be preserved.
  • ALTOXML:
    • Text only. The exported ALTO XML file will only contain recognized text; the document layout will not be preserved. If you select Text only, you cannot export to DOCX, XLSX, and PPTX.
    • Preserve document structure. The exported ALTO XML file will contain recognized text and the document layout will be preserved.
Important! Export options (Text only and Preserve document structure) for JSON, XML, and ALTOXML cannot be different. If you specify another export option for one of these formats, this option will be applied for the other formats.
  • PDF:
    • PDF/A-3a (the default PDF export format)
    • PDF/A-3b
    • Image-only. Non-editable PDF in PDF/A-3b standard
Tip: For each PDF export option, choose between “smaller file size” (default option) and “maximum quality”. Smaller file size is achieved by using Mixed Raster Content (MRC) compression, which determines optimal compression rates separately for the text, the pictures, and the background.
  • TXT
  • DOCX (Microsoft Word):
    • Editable. The exported DOCX file preserves the original format and text flow but at the same time allows for easy editing. The output document may differ from the original image.
    • Exact. The exported DOCX file maintains the formatting of the original document. This may limit the changes that can be made to the text and formatting of the output document.
  • XLSX (Microsoft Excel)
  • TIFF
  • JPEG:
    • Maximum quality. The exported file contains a JPEG with a compression level of 95%.
    • Reduced size. The exported file contains a JPEG with a compression level of 75%.
  • PPTX (Microsoft PowerPoint)
  • HTML