Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.abbyy.com/llms.txt

Use this file to discover all available pages before exploring further.

A Document Splitter skill splits a PDF (or a stream of pages from multiple files) into separate documents. The skill’s Splitter Script activity decides whether each page is the first page of a new document, based on the type and the data the page carries.
Unlike Document skills, Document Splitter skills can only be created and edited in Advanced Designer.

When to use a Document Splitter skill

Use a Document Splitter skill to:
  • Assemble pages into documents.
  • Separate annexes from primary documents.
  • Determine each document’s type while splitting.
  • Reorder pages and remove blank or garbage pages.
For worked scenarios, see Split pages into separate documents in Advanced Designer.

Configure a Document Splitter skill

After you create a Document Splitter skill on the start page, configure it in this order:
1

Skill settings

Click the settings button next to the skill name to view and adjust Document Splitter skill settings.
2

Upload documents

On the Documents tab, upload the documents the skill will work with.
3

Configure activities

On the Activities tab, build the document processing flow, ending with a Splitter Script activity.
4

Configure the Splitter Script activity

Define the rules that turn the flow of pages into a set of documents — including which page numbers belong to each document. See Splitter Script activity.
5

Publish

On the Publish tab, publish the skill to make it available in the Skill Catalog in ABBYY Vantage.

Use a Document Splitter skill

A published Document Splitter skill is consumed by the Assemble activity of a Process skill in ABBYY Vantage. In the Process skill flow, the Assemble activity invokes the published splitter to break the incoming page stream into separate documents before downstream activities run.

Next steps

Splitting use cases

Worked scenarios for the four splitter patterns.

Splitter Script activity

Reference for the script that defines document boundaries.

Activities

Configure the upstream classification and extraction activities.

Assemble activity

Plug a published Document Splitter skill into a Process skill.