Skip to main content
The following table describes various available output XML file types. The Skill type column lists the skill types that can generate a particular type of XML file.
File typeExport optionDescriptionSkill typeFile nameXML schema
XMLText onlyThis mode is suitable for extracting all text from the input image, including small text areas of low quality. The document appearance and structure are ignored, pictures and tables are not detected. It is designed for the situations when you need to retrieve the data from the image for some further processing on your side, such as extracting data from bills, receipts or invoices. Selecting this mode makes export to DOCX and XLSX impossible.OCR<Source_file_name>.xml **XML schema
XMLText onlyThis mode is suitable for extracting all text from the input image, including small text areas of low quality. The document appearance and structure are ignored, pictures and tables are not detected. It is designed for the situations when you need to retrieve the data from the image for some further processing on your side, such as extracting data from bills, receipts or invoices. Selecting this mode makes export to DOCX and XLSX impossible.Process<Applied_skill_name>.xml *XML schema
XMLPreserve document structureThis mode is focused on retaining the original document structure and appearance, including font styles, pictures, background color, etc., and is more focused on documents like agreements, contracts, specifications.OCR<Source_file_name>.xml **XML schema
XMLPreserve document structureThis mode is focused on retaining the original document structure and appearance, including font styles, pictures, background color, etc., and is more focused on documents like agreements, contracts, specifications.Process<Applied_skill_name>.xml *XML schema

File Naming Conventions

*<Applied_skill_name> will be one of the following:
  • The name of the skill if a transaction was created for a Document or Classification skill.
  • If a transaction was created for a Process skill:
    • the name of the last Document skill applied to the document
    • the name of the last Classification skill applied to the document if no Document skills were applied
    • “Unknown” if no Document and Classification skills were applied, while at least one of them exists in the Process skill flow.
** <Source_file_name> will be the name of the first file used to assemble the document, e.g. “IMG_12234”.