After you create a Document skill, follow these steps to train and publish it.Documentation Index
Fetch the complete documentation index at: https://docs.abbyy.com/llms.txt
Use this file to discover all available pages before exploring further.
Upload training and test documents
Navigate to the Documents tab in the Skill Designer and click Upload documents — in the center of the designer, on the toolbar, or in the Actions pane. Each file must contain a single document image.
Label fields in your training documents
Navigate to the Editor tab — either by clicking the tab name, or by selecting one or more documents from the list and clicking Label Fields and Create Business Rules in the Actions pane. Label every field you want to extract. You can also add validation rules and skill parameters here.
Train the skill and review accuracy
Click Train in the Actions pane. When training finishes, the Train button shows Completed. To stop training, click Cancel under the Train button. Review extraction accuracy and correct any errors.
Process structured documents
ABBYY Vantage offers a machine learning mode for processing structured documents — documents where field locations are the same on every instance. Examples include questionnaires, application forms, and tax return forms. This mode handles forms that have multiple variants, such as IRS Form 1040 for different years, where the set and location of fields differ slightly between variants. Each variant is a separate structured document, and you must upload a blank form for each.Enable fixed-form documents
Create a new Document skill and turn on the Fixed-form documents toggle.

Upload a blank form for each variant
Navigate to the Blank Form tab and click Upload Blank Form — in the center of the designer, on the toolbar, or in the Actions pane. If you don’t have a blank form, upload a completed form and mark it as a blank form.One skill can handle up to 10 variants of one form (for example, IRS Form 1040 for different years).
Eliminate field background (if needed)
In the field settings, enable Eliminate field background for fields whose background may affect recognition.
Test with completed documents
Click the Test Set tab and upload completed test documents. Confirm that all fields are labeled correctly on each document. If any field locations don’t match an uploaded blank form, add a blank form for that variant.
Review test results
In the Actions pane, test your skill. When the operation completes, review the results. If you are not satisfied, adjust the labeling and train again.

Switch between structured and semi-structured
If you later decide your documents are better treated as semi-structured:- Open Document skill settings.
- Turn off the Fixed-form documents toggle. All labeled fields are preserved.
- Retrain the skill.
Work with tables and repeating groups
When processing structured documents, Vantage can handle tables and repeating groups if:- The maximum number of table rows or group instances is known in advance.
- The boundaries of the table or group are fixed.
Only tables with text values are supported. If your table has columns with checkboxes or barcodes, use a repeating group instead.
Configure recognition languages
When processing a document, Vantage selects a processing language from the list of languages enabled on the skill. By default, new skills have English, French, German, and Spanish enabled. To modify the list:- Open Document skill settings.
- Select the languages you need. The list is sorted alphabetically, with the currently selected languages pinned to the top. At least one language must be selected.
- Click Save to keep your changes, or Cancel to discard them.
The number of selected languages may affect document processing speed. Restrict the list to the languages you actually expect in your documents.
Configure Online learning
Online learning collects processed documents into a training set and continues training the skill using those documents. It is available for Document skills and Classification skills. Document skills support two Online learning modes:| Mode | Behavior |
|---|---|
| Collect and learn | Default. Documents are collected and the skill is retrained automatically. |
| Collect only | Documents are collected but the skill is not retrained. Use this mode to review documents added to the training set before retraining manually. |

Related topics
Enable Online learning
Continue improving a Document skill on production documents after publishing.
Labeling documents
Guidelines for labeling structured and semi-structured documents during training.
Create a skill
Prerequisite — create a new skill in the Skill Catalog before opening it in the Skill Designer.
Process structured documents in Advanced Designer
Use Advanced Designer when structured-document processing needs to combine with other Vantage technologies.
Supported recognition languages
Full list of OCR languages supported across Vantage skills.
