Create a Standard Forms extraction model in the Control Room.
Prerequisites
- Ensure your Control Room has the Document Automation
(Number of pages) product license. For more details on licenses, see Understanding licenses in Document Automation.
- You must have the
AAE_IQ Bot Admin
or
AAE_IQ Bot
Services
roles to complete this task
- Identify between five and fifteen training documents
- The maximum limit of Standard Forms document type
is 157286400 bytes.
- For limitations about using custom neural models, see Document Intelligence custom neural model.
Procedure
-
In the Control Room, navigate to , and click Create
model.
The Projects window opens in a new tab.
-
Click Create project.
-
Enter a name for the project, click Browse to upload
documents to train the extraction model, and click
Create.
-
Define the field tags:
-
Click the plus (+) icon on the upper-right corner of the screen and select
Field.
-
Enter a name for the tag and select Enter. For example,
Invoice Number
or Invoice Date
.
Note: Each field tag and its corresponding location tag
is assigned a unique color so that it is easy to associate them when
distinguishing or validating data visually. Clicking a field tag will highlight
the corresponding location tag on the document.
-
Define the table tags:
-
Click the plus (+) icon on the upper-right corner of the screen and select
Table.
-
Enter a name for the table and click Create.
-
For each column in the table, select the column name and perform one of the
following actions:
- Rename column: Select this option to rename the
column and select Enter to rename.
- Insert column: Select this option to insert a
column after the selected column.
- Delete column: Select this option to delete the
selected column.
-
Update the column details.
- Optional:
Click the plus (+) icon to add more rows.
-
Click the close (x) button on the upper-right corner of the screen to exit
the table view.
-
Click the Region option on the upper-left corner of the
screen and define the location of the data for each field:
-
For the form fields, highlight the location of the data to extract and select
the corresponding field name from the list.
-
For each cell in the table, highlight the location of the data to extract,
then select the cell. Click the close (x) button on the upper-right corner of
the screen to exit the table view.
-
Select the next document and repeat the steps to define the locations of the
form and table fields.
-
Click the Train icon and enter a name for the model.
-
Select one of the following models:
- Template: Custom template is an easy-to-train document
model that accurately extracts labeled key-value pairs, selection marks, tables,
regions, and signatures from documents. This model is typically used to extract
data from structured documents that have consistent layout and defined visual
templates. See Document Intelligence custom template model.
- Neural: Custom neural model is a deep learned model
type that combines layout and language features to accurately extract labeled
fields from documents. This model is more suitable to extract data from
semi-structured and unstructured documents. See Document Intelligence custom neural model.
Important: As the custom neural model uses deep
learning technology, it might take some time for completing the training for
the model. Therefore, you might not see the model immediately on the custom
models page. We recommend that you wait for the training to be complete and
then select this model for use.
-
Click Train.
-
Click the Analyze icon, click Browse for
file, and select a document, and click Run
analysis to test the extraction model.
Note: As of
Automation 360 v.29, you can create
and train new models in
Azure AI Document Intelligence v.3.0 only. If
you are in
Azure AI Document Intelligence v.2.1, then you must update to
Azure AI Document Intelligence v.3.0 by following the below steps:
- Open Project.
- Click the Train icon.
- Click Train. The model is created in the Azure AI Document Intelligence v.3.0.