Select an OCR engine
- Updated: 2024/03/14
Select an OCR engine
You can select an OCR engines that suits your requirement for data extraction based on your document types. Restarting IQ Bot services is not necessary for implementing an engine change.
During IQ Bot installation, the system sets the latest version of Tesseract Optical Character Reader as the default OCR engine. This is also the default setting for the product. However, you can manually set the OCR engine in the Settings.txt file, which becomes the default engine. Similar to the prior releases of IQ Bot, you can continue to manually update the Settings.txt file with the OCR engine name you want to set as default.
- Selecting an OCR engine in the interface overrides the settings in the Settings.txt file.
- As Tegaki API OCR requires a separate On-Premises set
up that is not supported in Automation 360 IQ Bot
Cloud, all other OCR engines except
the Tegaki API OCR are available.
You will always have the latest version of the OCR engines supported by Automation 360 IQ Bot Cloud, but cannot select a specific OCR version.
The following table lists the various OCR engines supported in IQ Bot and the corresponding options:
Qualifiers | OCR Version | Supported installation | Handwritten | Languages Supported | Document Quality | Document Type |
---|---|---|---|---|---|---|
Tesseract OCR | 4 | Cloud and On-Premises | N/A |
English German Spanish Italian French |
No noise No dark background No stamps/ watermarks 200+ dpi |
Invoices, POs, etc. Semi-structured formats |
ABBYY FineReader Engine | 12.3, or 12.4 | Cloud and On-Premises | N/A |
English All Latin+ Chinese Japanese Korean |
Less noise Dark background with white fonts Has stamps/ watermarks 96+ dpi |
Invoices, POs, etc. Semi-structured formats Mortgage Forms, Tax Forms Unstructured Formats |
Microsoft Azure Computer Vision OCR engine | 2.0 or 3.2 | Cloud and On-Premises | English only |
English All Latin+ Chinese Japanese Korean |
Less noise Dark background Has stamps/ watermarks 96+ dpi |
Invoices, POs, etc. Semi-structured formats Passports, Driving license, etc. KYC documents |
Google Vision API | Version is updated automatically to match current release | Cloud and On-Premises | N/A |
English All Latin+ Asian |
Less noise Dark background Has stamps/ watermarks 96+ dpi |
Invoices, POs, etc. Semi-structured formats Mortgage Forms, Tax Forms Unstructured Formats |