Skip to main content
Each element in the pages array describes a single document page and in turn contains arrays of its own representing texts, tables, images, barcodes, checkmarks, and separators detected on that page. The properties of each element are used to store coordinates for the corresponding region on the image. The confidence level of the element being recognized correctly is also stored in the properties of the text elements.
Note: If the document is exported to JSON but not to any of the image formats, the coordinates will be specified relative to the original image. If the document is exported to JSON and at the same time to one of the image formats, the coordinates will be specified relative to the preprocessed image that is exported.
All text elements are placed into either the texts array or the tables array.
PropertyData typeDescription
widthintegerThe width of the page in pixels.
heightintegerThe height of the page in pixels.
rotatedstring enumRotation of the page relative to the original image. Possible values: none, clockwise, counterclockwise, upside-down
textsobject arrayArray of text blocks.
tablesobject arrayArray of blocks containing tables.
picturespicture object arrayArray of image blocks.
barcodesbarcode object arrayArray or barcode blocks.
separatorsobject arrayArray of separator blocks.
checkmarksobject arrayArray of checkmark blocks.

Confidence

All objects that describe recognized text or images have a property called confidence, which indicates the likelihood of the text being recognized correctly. First, a level of confidence is calculated for individual characters. The level of confidence for elements of higher levels is calculated using confidence levels for elements that they contain. A special type of data called confidence is defined for this property. This data type is a derivative of the number data type. The allowed values are from 0 to 100. A value of -1 indicates an element that does not contain any text data.