Vantage can normalize extracted data to ensure uniform representation. The following data types can be normalized: To normalize data extracted from a field, specify its data type:Documentation Index
Fetch the complete documentation index at: https://docs.abbyy.com/llms.txt
Use this file to discover all available pages before exploring further.
For normalization to work, set the data-type-specific properties for each field so that Vantage extracts everything that needs to be normalized. Click Advanced in the Field options dialog to access these properties. See Properties by data type.

Normalize dates
When normalizing dates, Vantage converts extracted dates into ISO 8601 format:YYYY-MM-DDfor datesHH:MM:SSfor time
Examples
| Extracted data | Normalized data |
|---|---|
| 15.06.2023 | 2023-06-15 |
| 2023/06/15 22:17 | 2023-06-15 22:17:00 |
| 06-15-2023 | 2023-06-15 |
| 02/11/2022 | 2022-02-11 or 2022-11-02 |
| Saturday, December 3rd, 2022 | 2022-12-03 |
| The second of May 2022 | 2022-05-02 |
If both Day-Month-Year and Month-Day-Year formats are enabled, Vantage may not be able to normalize the date unambiguously. In that case, you can choose between the two candidate dates.
- The date is incomplete — for example,
4:39 am(time values are only normalized when extracted together with a date). - Adverbs of time are used instead of exact dates — for example,
last month,a few days ago. - Extra words or characters appear next to the date or time — for example,
2016/06/15 22. - Uncommon date representations are used — for example,
14 Jumada Al-Awwal 1445.
Normalize numbers
Vantage can normalize numbers using Western or Indian digit grouping:- Western — Groups digits by threes from right to left, using commas to separate thousands, millions, and so on.
- Indian — Groups the first three digits from the right, then by twos for tens of thousands, lakhs, tens of lakhs, crores, and so on.
.) to separate integer and fractional parts. For accepted separators, see Data types.
Examples
| Extracted data | Normalized data |
|---|---|
| 12,345,678 | 12345678 |
| -12,345.678 | -12345.678 |
| 12.0000 | 12 |
| 1.000 | 1000 or 1 |
| 12,345.678 % | 12345.678 |
| 1,23,45,67,890 (Indian numbering system) | 1234567890 |
| twenty-first | 21 |
- Extra words or characters appear next to the number — for example,
EURO12,345.678or5 kilos. - There is an irregular number of digits between the fractional and integer parts, or between the decimal and thousands parts — for example,
123,456,7890. The fractional part must contain 3 or fewer digits. If123,456,789is extracted, the normalized value is123456789; if123,456,78is extracted, the normalized value is123456.78. - Irregular number representations are used.
Normalize money amounts
A money amount contains a number value and a currency symbol, with the symbol before or after the amount. When normalizing, Vantage outputs the currency symbol first, followed by the amount normalized as a number. Currency is identified by symbol or name —€, EURO, and euros all map to the euro. The normalized value uses the exact symbol or name found in the extracted text.
Examples
| Extracted data | Normalized data |
|---|---|
| 12,345.678 EURO | EURO 12345.678 |
| 12,345.678 ¥ | ¥ 12345.678 |
| 13,87E | E 13.87 |
| 13 euro 87 | euro 13.87 |
| fifty dollars | dollars 50 |
| ₹1,23,455 | ₹ 123455 |
Amounts written out in words are normalized only when they’re in English and English is selected in the skill settings.
12 ttt.
Related topics
Text field
Add a Text field, choose a data type, and configure recognition properties.
Labeling documents
Guidelines for labeling structured and semi-structured documents during training.
Supported recognition languages
Full list of OCR languages supported across Vantage skills.
