Skip to main content
A Document Splitter skill splits a PDF (or a stream of pages from multiple files) into separate documents. The skill’s Splitter Script activity decides whether each page is the first page of a new document, based on the type and the data the page carries.
Unlike Document skills, Document Splitter skills can only be created and edited in Advanced Designer.

When to use a Document Splitter skill

Use a Document Splitter skill to:
  • Assemble pages into documents.
  • Separate annexes from primary documents.
  • Determine each document’s type while splitting.
  • Reorder pages and remove blank or garbage pages.
For worked scenarios, see Split pages into separate documents in Advanced Designer.

Configure a Document Splitter skill

After you create a Document Splitter skill on the start page, configure it in this order:
1

Skill settings

Click the settings button next to the skill name to view and adjust Document Splitter skill settings.
2

Upload documents

On the Documents tab, upload the documents the skill will work with.
3

Configure activities

On the Activities tab, build the document processing flow, ending with a Splitter Script activity.
4

Configure the Splitter Script activity

Define the rules that turn the flow of pages into a set of documents — including which page numbers belong to each document. See Splitter Script activity.
5

Test the skill

On the Documents tab, choose Test Skill for selected documents, or Test Skill Using This Set for a test set, and check how the skill splits and labels the page stream before you publish.
6

Publish

On the Publish tab, publish the skill to make it available in the Skill Catalog in ABBYY Vantage.
To adjust the skill’s behavior without retraining and republishing — connect data catalogs, supply values for regular expressions, or toggle Boolean flags — configure skill parameters.

Use a Document Splitter skill

A published Document Splitter skill is consumed by the Assemble activity of a Process skill in ABBYY Vantage. In the Process skill flow, the Assemble activity invokes the published splitter to break the incoming page stream into separate documents before downstream activities run.

Next steps

Splitting use cases

Worked scenarios for the four splitter patterns.

Splitter Script activity

Reference for the script that defines document boundaries.

Activities

Configure the upstream classification and extraction activities.

Assemble activity

Plug a published Document Splitter skill into a Process skill.