Language support matrix and OCR engines

Document Automation supports various languages for document extraction. This topic provides lists of supported languages in Document Automation based on various document types and OCR engines.

Document type

The following table provides a list of supported languages in Document Automation for various document types:

Language Document type
Invoices Receipts Standard Forms Unstructured document User-defined Document types such as Arrival Notice, Bill of Landing, Packing List, Utility bill, Waybill
English X X X X X X
German X X X X X
French X X X X X
Spanish X X X X X
Afrikaans X X X
Albanian X X X
Arabic X
Azerbaijani X
Basque X
Belarusian X
Bosnian X
Bulgarian X
Catalan X
Cebuano X
Chinese (Simplified) X X X
Chinese (Traditional) X X X
Croatian X X X
Czech X X X
Danish X X X
Dutch X X X X X
Esperanto X
Estonian X X X X
Filipino X
Finnish X X X
Flemish
Gailician X
Greek X
Haitian Creole X
Hebrew X
Hindi X
Hungarian X X X
Icelandic X X X
Indonesian X X X
Irish X
Italian X X X X
Japanese X X X X
Javanese X
Kazakh X
Korean X X X
Kyrgyz X
Latin X
Latvian X X
Lithuanian X X
Macedonian X
Malay X X X
Maltese X
Marathi X
Mongolian X
Nepali X
Norwegian X X X
Pashto X
Persian X
Polish X X X
Portuguese X X X X
Romanian X X X X
Russian X X X
Sanskrit X
Serbian (Latin) X
Slovak X X X
Slovenian X X X
Swahili X
Swedish X X X X
Tagalog X X X
Turkish X
Ukrainian X
Urdu X
Uzbek X
Vietnamese X
Welsh X
Yiddish X
Zulu X

OCR Engine

The following table provides the links to supported languages for all Document Automation supported OCR engines:

Document Automation supported OCR engines List of supported languages
ABBYY FineReader Engine ABBYY FineReader Engine OCR supported languages
Google Vision API OCR Language Support