Document Automation high-level data flow

Document Automation uses different data flows for different features available in the Control Room. Review the high-level data flow to understand how data traverses for each feature through different components.

Data extraction using generative AI providers

The following image shows the end-to-end data flow through different components for generative AI providers:
On-Premises
Data flow diagram for generative AI providers in Document Automation On-Premises
Cloud
Data flow diagram for generative AI providers in Document Automation Cloud

The following sections represent different stages of data flow through different components when using generative AI providers:

Stage 1: Uploading files to the Control Room
On-Premises
Data flow diagram showing file upload to Control Room in On-Premises deployment
Cloud
Data flow diagram showing file upload to Control Room in Cloud deployment

The user uploads files to the Control Room or a scheduler bot uploads the file from a shared location. The files are temporarily uploaded to the Control Room storage services.

Stage 2: Document extraction process using
On-Premises
Data flow diagram of document processing using OCR in On-Premises deployment
Cloud
Data flow diagram of document processing using OCR in Cloud deployment

The first step involved in document extraction is processing documents for OCR. Documents are processed using ABBYY OCR on the Bot Runner device or using Google Vision OCR via the Automation Anywhere proxy gateway.

On-Premises
Data flow diagram of document extraction process using generative AI providers in On-Premises deployment
Cloud
Data flow diagram of document extraction process using generative AI providers in Cloud deployment

The Bot Runner device initiates the data extraction process using either the Automation Anywhere pre-trained models or third-party Cloud extraction services.

  • Automation Anywhere pre-trained models: Data extraction is processed on the Bot Runner device.
  • Third-party Cloud extraction:
    • Document Automation subscriptions: Data extraction requests are sent and received from the third-party Cloud extraction services via the Automation Anywhere proxy gateway.
    • Bring your own key (BYOK): Data extraction requests are sent and received directly from the third-party Cloud extraction services without using the Automation Anywhere proxy gateway.
Stage 3: Downloading output
On-Premises
Data flow diagram of downloading output in On-Premises deployment
Cloud
Data flow diagram of downloading output in Cloud deployment

The data extraction results are downloaded on the network path defined by the user as CSV or JSON. Customers typically create bots to upload this information to downstream applications or system of records.

Data extraction using Microsoft Azure AI Document Intelligence

The following image shows the end-to-end data flow through different components for Microsoft Azure AI Document Intelligence:
On-Premises
Data flow diagram for Microsoft Azure AI Document Intelligence in On-Premises deployment
Cloud
Data flow diagram for Microsoft Azure AI Document Intelligence in Cloud deployment
Stage 1: Uploading files and fetching configuration details
On-Premises
Data flow diagram showing file upload to the On-Premises Control Room
Cloud
Data flow diagram showing file upload to the Cloud Control Room

The user uploads files to the Control Room or a scheduler bot uploads the file from a shared location. The files are temporarily uploaded to the Control Room storage services.

Stage 2: Data extraction process
On-Premises
Data flow diagram of document extraction process using Microsoft Azure AI Document Intelligence services in On-Premises deployment
Cloud
Data flow diagram of document extraction process using Microsoft Azure AI Document Intelligence services in Cloud deployment

The Bot Runner device initiates the OCR and data extraction process using Microsoft Azure AI Document Intelligence services. Data extraction requests are sent and received directly from Microsoft Azure AI Document Intelligence services for both Document Automation subscriptions and BYOK.

Stage 3: Downloading output
On-Premises
Data flow diagram of downloading output in On-Premises deployment
Cloud
Data flow diagram of downloading output in Cloud deployment

The data extraction results are downloaded on the network path defined by the user as CSV or JSON. Customers typically create bots to upload this information to downstream applications or system of records.

Video: Document Automation architecture

On-Premises deployments using generative AI providers

The following video shows how the data flows in the On-Premises deployment when using generative AI providers:

Cloud deployments using generative AI providers

The following video shows how the data flows in the Cloud deployment when using generative AI providers: