Detect pages in multipage files and assemble them into documents
Documents can be sent to Vantage for processing as follows:
As a set of files, where each document corresponds to a single file
As one multipage file containing one or several documents
As multiple single-page files
The last two require documents to be assembled using a special process activity.The Assemble activity is used to detect pages in multipage files and assemble them into documents.There are three different document assembly types. You can select the appropriate assembly type in the Actions pane once you have added the activity:
By files (used by default): each source file is assigned to a single corresponding document.
One document: all files are assembled together into a single document.
By classification: documents are assembled based on page classification.
By Document Splitter Skill: documents are split according to the rules described in the By Document Splitter Skill section below.
The By classification assembly type is carried out based on classes assigned to pages by a Classification skill. Vantage assigns each page a class and merges consecutive pages of the same class into one document.Once you have selected the assembly type, click the Settings button located under the assembly type name to specify additional settings for it:
Select an appropriate Classification skill.
Map classes to document types and specify classes to be treated as first pages of documents if applicable.
Note: If some of the classes have not been mapped to document types, the Assemble activity will assemble pages of those classes into separate documents for each such class.
Click Save to save the settings.
By Classification assembly rules are as follows:
Consecutive pages of the same class are assigned to a single document.
A page that is of a different class than the previous page is assigned to be the first page of a new document.
Pages that could not be classified are assembled into the document currently being assembled (i.e. do not interrupt a chain of consecutive pages of the same class).
Pages that were classified as first pages are used to start new documents even if the two new documents are both of the same class.
This option involves the use of specific skills named Document Splitter skills. These skills split file(s) with multiple documents into specific documents and may assign a type to each document to determine how the document will be processed. For more information about creating and editing Document Splitter skills, see the Advanced Designer Guide.Once you have selected the By Document Splitter Skill assembly type (in Process skill Designer of Vantage), perform the following:
Click the Select Skill button located under the assembly type name to select the necessary Document Splitter skill:You can preview the skill by clicking on it. The Preview window displays the respective classes specified in the skill. If no Document Splitter skill is found, the user will be prompted to select a skill from the Skill Catalog.Once a skill is selected, the Assemble activity will be configured. The necessary skill will be displayed in the Actions pane.
Configure data extraction using the Extract activity. This activity allows the skill to extract documents of several types simultaneously depending on a class that corresponds to the document type.
If the Manual Review activity is added, check the created workflow in Manual Review client.