IQ Bot 11.x: Use Google Vision API OCR engine

Use the Google Vision API OCR engine with IQ Bot to improve the accuracy of the optical character recognition (OCR) results for training documents in Asian languages, particularly in Japanese and Korean. Google Vision API OCR engine in IQ Bot supports all languages supported by the engine.

Prerequisites

Verify that the default port 443 is available to enable communication for the API endpoint using TCP.

Note: See a list of allowed service URLs for using Google Vision API in the example section below.
Google Vision API files are installed on your machine automatically during the IQ Bot installation and is provided as a built-in plug-in, with a single-step installation, and is available for all users:
  • It is a cloud OCR engine.
  • Google Vision API does not support documents with more than one language. Before using this feature, ensure you want to primarily extract Japanese or Korean text only.
  • Languages that are written right to left, rather than left to right, are not supported within IQ Bot for document extraction. For example, Arabic, Aramaic, Azeri, Divehi, Fula, Hebrew, Kurdish, N'ko, Persian, Rohingya, Syriac, and Urdu.

    Prior to this release, the engine primarily supported data extraction for Japanese and Korean text only.

    IQ Bot 11.x: List of languages in IQ Bot

  • Data is not stored in Google Cloud and is removed immediately.

    For additional information on privacy for data retention see: Google Data Usage

IQ Bot provides you with the license to use the Google Vision API OCR engine. You can also use your own license key.

IQ Bot 11.x: Use your own license keys for Google Vision API OCR engine

You have the following options for using this OCR engine:
Options Description
Option 1: Set Google Vision API as your default engine Configure the Settings.txt file.
Restriction: Configuring the settings.txt file is not applicable to Automation 360 IQ Bot Cloud
Option 2: Select Google Vision API engine directly from the UI when creating a learning instance IQ Bot 11.x: Selecting an OCR engine

Procedure

To set Google Vision API as your default OCR engine, configure the Settings.txt file as follows:

Restriction: Configuring the settings.txt file is not applicable to Automation 360 IQ Bot Cloud

  1. Run the IQ Bot installer.
  2. Navigate to ..\Automation Anywhere IQ Bot\Configurations\Settings.txt, and modify the following value: OCREngine=GoogleVisionAPI.
    This will set Google Vision API OCR engine as the default engine for your environment.
  3. The GoogleOCREngineSettings.json file is generated within the ..\Automation Anywhere IQ Bot\Configurations folder with default values as follows:
    • Use the following with Google Vision API:
      
      {
       "ADCJson": null,
        "EngineType": 2,
        "EnableDebugging": false,
        "DebuggingFolder": "C:\\Users\\Public\\Documents\\Automation Anywhere IQBot Platform\\Logs\\Engine\\Google",
        "EnableCustomParse": false,
        "DistanceThreshold": 15
       }
      
    • Use the following with Google Vision API in IQ Bot Version 11.3.5 and later:
      
      {
       "ADCJson": null,
        "EngineType": 2,
        "EnableDebugging": false,
        "DebuggingFolder": "C:\\Users\\Public\\Documents\\Automation Anywhere IQBot Platform\\Logs\\Engine\\Google",
        }
      
  4. For Japanese and Korean language documents, update the field: "EnableCustomParse": true.
    Note: A service restart is not required after making the change.
List of service URLs used for Google Vision API (optional)
Use the following allowed URLs if required:
Note: All URLs point to a global service region in the US.
  • API URL: https://vision.googleapis.com/
  • Auth URL: https://accounts.google.com/o/oauth2/auth
  • Token URL: https://oauth2.googleapis.com/token
  • Auth Provider URL: https://www.googleapis.com/oauth2/v1/certs
  • Client Auth URL: https://www.googleapis.com/robot/v1/metadata/x509

Next steps

  1. Create a new learning instance with the Asian language document you want to train.
  2. Train the document and set your learning instance to production.
  3. Upload Asian language files and run the bot.
  4. Download and view the accuracy of the extraction results.