Using the Train Advanced Classifier action
- Updated: 2023/02/02
Using the Train Advanced Classifier action
Use the Train Advanced Classifier action to create a model file that is used by the Classify Document, Classify Pages, or Split Document actions to sort the documents into required categories for input.
Prerequisites
Before building the bot, collect example documents and categorize them
into folders. Ensure the set of example documents meets the following requirements:
- Has at least two categories.
- A minimum of 15 files per category is required, with a recommendation of 20 files per category.
- There are no restrictions on the maximum number of categories. However, it is important to note that as the training data set and the corresponding model size increase, the performance of the classification process can decline. Therefore, it is advisable to keep the number of categories within a range of 150 per model file for optimal performance.
- The supported file formats are as follows:
- .tiff
- .bitmap
- .jepg
- .png
- .txt
- We recommend that you provide images with a resolution of 300 dpi (dots per inch). The minimum acceptable resolution is 200 dpi.
Note:
If these minimum requirements are not met, an error message is displayed during bot run-time.
Procedure
Next steps
After creating the model, build a bot to classify input documents. For more information, see Using the Classify Document action.