Result structure at a glance
Every result follows the same hierarchy:Documents[0].ExtractedData.RootObject.Fields to access the extracted values.
Reading extracted fields
Here is a simplified example from an invoice processed with the ABBYY Invoice skill:| Property | What it means |
|---|---|
Name | The field name defined by the skill (e.g., “InvoiceNumber”, “VendorName”) |
Value | The extracted value after normalization — dates become ISO format, numbers get standardized |
Annotation.Confidence | An integer from 0 to 100 indicating how certain Vantage is about the extraction |
Annotation.RawValue | The original text as recognized by OCR, before any normalization |
Annotation.Source | Whether the value came from the document’s Text layer or was read from the Image |
NeedVerification | true if the field was flagged for human review (e.g., low confidence or failed a validation rule) |
isVerified | true if a human operator has already confirmed the value in Manual Review |
Notice how
TotalAmount above has Confidence: 62 and NeedVerification: true. The raw value 1.250,00 (European format) was normalized to 1,250.00, but the low confidence suggests Vantage wasn’t fully certain about the OCR result. This is exactly the kind of field a human reviewer would check.Table fields (line items)
Skills that extract tables — like invoice line items — represent them as fields containing repeating groups. Each row appears as anExtractedObject within the field’s List array, with its own sub-fields:
Confidence scores in practice
Confidence scores reflect how certain Vantage is about an extracted value. Several factors affect confidence:- Document quality — blurry scans, low-resolution images, or handwritten text produce lower scores
- Layout complexity — unusual table layouts or overlapping fields can reduce accuracy
- Training data match — values that closely match patterns the skill was trained on score higher
- High confidence (90–100) — accept the value automatically
- Medium confidence (70–89) — accept with caution, or route to review for critical fields
- Low confidence (below 70) — route to Manual Review for human verification
NeedVerification: true have already been flagged by Vantage based on confidence thresholds and validation rules configured in the skill.
Common field names by skill
Each skill defines its own set of field names. Here are the most common fields for frequently used skills:| Skill | Key fields |
|---|---|
| Invoice | InvoiceNumber, InvoiceDate, VendorName, TotalAmount, TaxAmount, LineItems |
| Receipt | ReceiptNumber, Date, VendorName, Total, Tax, PaymentMethod, LineItems |
| Purchase Order | PONumber, OrderDate, BuyerName, SupplierName, TotalAmount, LineItems |
| ID Document | FirstName, LastName, DateOfBirth, DocumentNumber, ExpirationDate, Nationality |
These are the most common fields. Each skill may include additional fields. Browse the Skill Catalog to see the full field list for any skill.
Next steps
Full JSON schema
Complete reference for all objects and properties in the JSON output.
XML output
Alternative output format with the same extracted data in XML structure.
Manual Review
Configure human-in-the-loop verification for extracted fields.
What to Learn Next
Choose your path through the Vantage documentation based on your goals.
