Skip to main content
In ABBYY Vantage, processing and OCR results are available in JSON and XML formats: Vantage will generate a separate JSON or XML file or a set of such files for each processed document — depending on the skill applied to the document and the Output activity settings if a Process skill is used. The following table describes the types of output files available in Vantage. All of these file types are part of the ResultFileType enumerator in the Vantage API.
File typeDescription
JsonExtracted data, such as field values, field structure, and rule check errors.
PdfPDF document with or without a text layer.
TextExtracted text data in TXT format.
DocxExtracted text and image data in DOCX format.
XlsxExtracted text and image data in XLSX format.
OcrJsonFull-text recognition data in JSON format.
FieldsJsonSimplified JSON file. Contains only field values and rule check errors.
FieldPictureExtracted image fields in JPG format.
XmlExtracted full-text recognition data.
TiffExtracted image data in TIFF format.
JpegExtracted image data in JPG format.
CsvExtracted data values of repeating or non-repeating fields.
HtmlExtracted full-text recognition data in HTML format.
PptxExtracted text and image data in PPTX format.
AltoExtracted full-text recognition data in XML format that corresponds to ALTO standard, schema version 4.2.
When working with the Vantage API, you can get information about output files of the processed transaction using the GET https://your-server/api/publicapi/v1/transactions/transaction-id request. The file type is in the type property of each object in the resultFile array. For more information about file types, see Output activity and Image (FieldPicture).