Json | Extracted data, such as field values, field structure, and rule check errors. |
Pdf | PDF document with or without a text layer. |
Text | Extracted text data in TXT format. |
Docx | Extracted text and image data in DOCX format. |
Xlsx | Extracted text and image data in XLSX format. |
OcrJson | Full-text recognition data in JSON format. |
FieldsJson | Simplified JSON file. Contains only field values and rule check errors. |
FieldPicture | Extracted image fields in JPG format. |
Xml | Extracted full-text recognition data. |
Tiff | Extracted image data in TIFF format. |
Jpeg | Extracted image data in JPG format. |
Csv | Extracted data values of repeating or non-repeating fields. |
Html | Extracted full-text recognition data in HTML format. |
Pptx | Extracted text and image data in PPTX format. |
Alto | Extracted full-text recognition data in XML format that corresponds to ALTO standard, schema version 4.2. |