Supported OCR languages#
The default OCR engine for Konfuzio Server supports the following languages:
Print text#
Language |
Code |
Language |
Code |
---|---|---|---|
Afrikaans |
|
Khasi |
|
Albanian |
|
K’iche’ |
|
Angika (Devanagiri) |
|
Korean |
|
Arabic |
|
Korku |
|
Asturian |
|
Koryak |
|
Awadhi-Hindi (Devanagiri) |
|
Kosraean |
|
Azerbaijani (Latin) |
|
Kumyk (Cyrillic) |
|
Bagheli |
|
Kurdish (Arabic) |
|
Basque |
|
Kurdish (Latin) |
|
Belarusian (Cyrillic) |
|
Kurukh (Devanagiri) |
|
Belarusian (Latin) |
|
Kyrgyz (Cyrillic) |
|
Bhojpuri-Hindi (Devanagiri) |
|
Lakota |
|
Bislama |
|
Latin |
|
Bodo (Devanagiri) |
|
Lithuanian |
|
Bosnian (Latin) |
|
Lower Sorbian |
|
Brajbha |
|
Lule Sami |
|
Breton |
|
Luxembourgish |
|
Bulgarian |
|
Mahasu Pahari (Devanagiri) |
|
Bundeli |
|
Malay (Latin) |
|
Buryat (Cyrillic) |
|
Maltese |
|
Catalan |
|
Malto (Devanagiri) |
|
Cebuano |
|
Manx |
|
Chamling |
|
Maori |
|
Chamorro |
|
Marathi |
|
Chhattisgarhi (Devanagiri) |
|
Mongolian (Cyrillic) |
|
Chinese Simplified |
|
Montenegrin (Cyrillic) |
|
Chinese Traditional |
|
Montenegrin (Latin) |
|
Cornish |
|
Neapolitan |
|
Corsican |
|
Nepali |
|
Crimean Tatar (Latin) |
|
Niuean |
|
Croatian |
|
Nogay |
|
Czech |
|
Northern Sami (Latin) |
|
Danish |
|
Norwegian |
|
Dari |
|
Occitan |
|
Dhimal (Devanagiri) |
|
Ossetic |
|
Dogri (Devanagiri) |
|
Pashto |
|
Dutch |
|
Persian |
|
English |
|
Polish |
|
Erzya (Cyrillic) |
|
Portuguese |
|
Estonian |
|
Punjabi (Arabic) |
|
Faroese |
|
Ripuarian |
|
Fijian |
|
Romanian |
|
Filipino |
|
Romansh |
|
Finnish |
|
Russian |
|
French |
|
Sadri (Devanagiri) |
|
Friulian |
|
Samoan (Latin) |
|
Gagauz (Latin) |
|
Sanskrit (Devanagari) |
|
Galician |
|
Santali(Devanagiri) |
|
German |
|
Scots |
|
Gilbertese |
|
Scottish Gaelic |
|
Gondi (Devanagiri) |
|
Serbian (Latin) |
|
Greenlandic |
|
Sherpa (Devanagiri) |
|
Gurung (Devanagiri) |
|
Sirmauri (Devanagiri) |
|
Haitian Creole |
|
Skolt Sami |
|
Halbi (Devanagiri) |
|
Slovak |
|
Hani |
|
Slovenian |
|
Haryanvi |
|
Somali (Arabic) |
|
Hawaiian |
|
Southern Sami |
|
Hindi |
|
Spanish |
|
Hmong Daw (Latin) |
|
Swahili (Latin) |
|
Ho(Devanagiri) |
|
Swedish |
|
Hungarian |
|
Tajik (Cyrillic) |
|
Icelandic |
|
Tatar (Latin) |
|
Inari Sami |
|
Tetum |
|
Indonesian |
|
Thangmi |
|
Interlingua |
|
Tongan |
|
Inuktitut (Latin) |
|
Turkish |
|
Irish |
|
Turkmen (Latin) |
|
Italian |
|
Tuvan |
|
Japanese |
|
Upper Sorbian |
|
Jaunsari (Devanagiri) |
|
Urdu |
|
Javanese |
|
Uyghur (Arabic) |
|
Kabuverdianu |
|
Uzbek (Arabic) |
|
Kachin (Latin) |
|
Uzbek (Cyrillic) |
|
Kangri (Devanagiri) |
|
Uzbek (Latin) |
|
Karachay-Balkar |
|
Volapük |
|
Kara-Kalpak (Cyrillic) |
|
Walser |
|
Kara-Kalpak (Latin) |
|
Welsh |
|
Kashubian |
|
Western Frisian |
|
Kazakh (Cyrillic) |
|
Yucatec Maya |
|
Kazakh (Latin) |
|
Zhuang |
|
Khaling |
|
Zulu |
|
Handwritten text#
The detection of handwritten text is supported for the following languages:
Language |
Language code |
Language |
Language code |
---|---|---|---|
English |
|
Japanese |
|
Chinese Simplified |
|
Korean |
|
French |
|
Portuguese |
|
German |
|
Spanish |
|
Italian |
|
The availability of OCR languages depends on the selected OCR engine and might differ across configurations (e.g. on-premise installation).