Note: This activity can’t extract complex structures (for example, nested tables, which are repeating structures inside other tables) and fields of type other than Text. To extract such structures, use the Extraction Rules activity.
Use Cases
Add this activity to your document processing flow when:- Your skill will be used to process multiple variants of a certain document type.
- You are planning to process document variants on which your skill has not yet been trained. For example, you may have a Document skill with a Fast Learning activity which has been trained to extract fields from loan agreements (with different field structures) coming from several different banks. If you decide to use this existing skill to process loan agreements from a new bank yet unknown to the skill, the extraction quality may be below par. To improve extraction quality, you can use a Deep Learning activity instead of a Fast Learning activity.
How It Works
Deep Learning combines Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), and Natural Language Processing (NLP) tokens. Through this combination, Deep Learning understands image patterns, the structure of documents, field contents, and surrounding labels. It requires a large number of documents to train, but it generalizes to new document layouts it has not encountered yet, providing a true templateless approach to extraction, which is the only way to deal with documents for which no exhaustive set of layouts is available at the training stage.Training Requirements
For best results, it is essential to correctly label as many documents as possible. The number of sample documents used for training significantly affects the quality of field extraction. The recommended number of sample documents is as follows:- For high-variability documents: At least 200-300 sample documents (2-3 sample documents per variant) are required.
- For low-variability documents: At least 100 sample documents are required.
