With the German activity configured, set up the second Extraction Rules activity for Dutch and Belgian sick notes. Dutch and Belgian sick notes have a different structure from the German documents and vary widely across the class, so Fast Learning isn’t viable here. Some fields are unique to these documents — you’ll add them to the data form as you configure the activity. Start with fields that appear on every document, then extend the form for the new ones.Documentation Index
Fetch the complete documentation index at: https://docs.abbyy.com/llms.txt
Use this file to discover all available pages before exploring further.
Extracting the issue date
Dates in these documents can be easily extracted using the Date element, so this time we will use the search element that was created automatically for this field.Add the Date field
- Open the Manage Fields dialog on the Fields tab and select a “Date” field to be used in this activity. Click Save.
- Go to the Search Elements tab. A search element of type Date has been created for the “Date” field and mapped automatically.
Create the IssueDateGroup with the kwDate label
- Create a Group search element called “IssueDateGroup” and make it optional.
- Add a Static Text element called “kwDate” inside the group — this finds the label that helps locate the actual date.
- Since this document class contains documents in Dutch or French, enter the label text options on separate lines in the Text to find dialog: “Date” on the first line and “Datum” on the second line.
- Disable the Search for parts of words option.
Add the Date element to the group
Configure the Date element's search area
- Delete the Nearest to relation that was automatically added when the element was created.
- Select the “kwDate” element as the one nearest to the element being searched for.
- The date can be located to the right of the keyword or below it. Specify the search area below the “kwDate” element.
- The search area should also include the line on which the keyword is located. Click the bottom boundary icon to the right of the element name and select Top Boundary of Region. The lines may be uneven, so set the Below value to -10 to extend the search area slightly above the line.
Extracting the sickness dates
We’ll extract these dates using Key value elements. The Key value element allows you to search both for a static text label and the value. However, it doesn’t allow too much variation in the value location and properties. In these documents, the sickness dates are formatted so that each date component is in a separate cell of a table. The table cells can be located in non-standard places in each document, but the relative position of the cells is always the same. We can’t count on the table cell boundaries being very clear but we will still use the Table Cell element because it allows for fuzzy borders and will be convenient if we decide to train the activity on more documents. So we’ll use the Group element to organize the search elements hierarchy.Add the Start Date and End Date fields
- Start Date
- End Date
Create the StartDateGroup with Table Cell elements
| Parameter | Value |
|---|---|
| Group search element: | |
| Name | StartDateGroup |
| Static Text search element: | |
| Name | kwStartDate |
| Text to find | Vanaf / From, A partir du, Van |
| Search for parts of words | Disabled |
| Table Cell search elements: | |
| Name | StartDateDay |
| Search pattern | Number |
| Character count | {1, 1, 3, 3} |
| Search for parts of words | Disabled |
| Search area | Below the “kwStartDate” element, nearest to “kwStartDate” |
| Table Cell search element: | |
| Name | StartDateMonth |
| Search pattern | Number |
| Character count | {1, 1, 3, 3} |
| Search for parts of words | Disabled |
| Search area | Below the “kwStartDate” element, right of “StartDateDay”, nearest to “StartDateDay” |
| Table Cell search element: | |
| Name | StartDateYear |
| Search pattern | Number |
| Character count | {2, 2, 4, 4} |
| Search for parts of words | Disabled |
| Search area | Below the “kwStartDate” element, right of “StartDateMonth”, nearest to “StartDateMonth” |
Create the EndDateGroup as a copy
- Create a copy of the “StartDateGroup” element and rename it to “EndDateGroup”.
- Rename the group’s sub-elements: “kwStartDate” to “kwEndDate”, “StartDateDay” to “EndDateDay”, “StartDateMonth” to “EndDateMonth”, “StartDateYear” to “EndDateYear”.
- Change the text to find of the “kwEndDate” element to “Tot en met / Till and incl., Jusqu’ au, Tot en met”.
- Specify the search area for the “EndDateDay” element: below the “kwEndDate” element and nearest to it. Delete the other relations.
Add Data Composition fields for the dates
-
Open the Manage Fields dialog and add a Data Composition Field called “Start Date Composed”. Map the following elements to the fields:
- “StartDateDay” to
Day - “StartDateMonth” to
Month - “StartDateYear” to
Year
- “StartDateDay” to
-
Create a Data Composition Field called “End Date Composed”. Map the following elements:
- “EndDateDay” to
Day - “EndDateMonth” to
Month - “EndDateYear” to
Year
- “EndDateDay” to
- Map the “Start Date Composed” and “End Date Composed” data composition fields to the “Start Date” and “End Date” fields.

Extracting the type of sick note
We’ll extract the type of sick note using a checkmark in just the same way as we did for the German documents.Enable the Primary and Secondary checkmarks
Create the TypeOfSickNoteGroup and PrimaryGroup label
- Create a Group element called “TypeOfSickNoteGroup”.
- Create a copy of this group, rename it to “PrimaryGroup”, and place it inside “TypeOfSickNoteGroup”.
- Add a Static Text element called “kwCheckmark” to the “PrimaryGroup” group.
- Set the text to find to “eerste / Primary, première, primair”.
Configure the Checkmark, XMark, and CheckmarkRegion elements
| Parameter | Value |
|---|---|
| Static Text search element: | |
| Name | Checkmark |
| Text to find | X |
| Character count | {1, 1, 3, 3} |
| Search for parts of words | Disabled |
| Search area | Right of “kwCheckmark”, nearest to “kwCheckmark” |
| Static Text search element: | |
| Name | XMark |
| Text to find | X |
| Character count | {1, 1, 3, 3} |
| Search for parts of words | Disabled |
| Search area | Below the “kwCheckmark” top boundary, Below value = -15, Left of “kwCheckmark”, Above the “kwCheckmark” bottom boundary, Above value = -15, Nearest to “kwCheckmark” |
| Under what conditions | Do not find element if “Checkmark” is found |
| Region search element: | |
| Name | CheckmarkRegion |
| Search Conditions section of the Code Editor | if Checkmark.IsFound then RSA: Checkmark.Rect; else if XMark.IsFound then RSA: XMark.Rect; else DontFind; |
Create the SecondaryGroup and RelapseGroup
- Create a copy of “PrimaryGroup” and rename it to “SecondaryGroup”. Change the text to find of its “kwCheckmark” element to “prolongation”, “verlenging”.
- Dutch and Belgian sick notes are divided into three types — ‘relapse’ is an additional type compared to German sick notes. Create another copy of “PrimaryGroup” and rename it to “RelapseGroup”.
- Change the text to find of the RelapseGroup’s “kwCheckmark” element to “Herval” and enable the Match case option to exclude words occurring in the middle of a sentence.

Add the Relapse checkmark and map fields
- Open the Manage Fields window and add a “Relapse” checkmark to the “Type of Sick Note” checkmark group. Enable all checkmarks in the group to be used in the current activity and click Save.
- Map the checkmarks to the corresponding Region elements and delete the elements that were automatically created when enabling the fields.

