Skip to main content
Here is the list of internal names of the predefined languages that are supported in ABBYY FineReader Engine. Availability of this or that predefined recognition language depends on the availability of the corresponding modules among ABBYY FineReader Engine files. See the Installation section to know which recognition languages correspond to which ABBYY FineReader Engine modules. ABBYY FineReader Engine provides core recognition languages for OCR and ICR with full built-in dictionary support. Some recognition languages are available only for OCR, or do not have full built-in dictionary support. See details in the table below.
ABBYY FineReader Engine for Windows also provides a set of specific recognition languages. These languages contain special language units (addresses, date and time, human names, etc.). Such languages can be used for field recognition. See the list of special predefined languages for more information.
Internal nameRecognition languageCan be used for OCRFull dictionary support availableHandwritten (++) or handprinted (+) text supported1Can be used for text-based classification2Can be used for BCR
AbkhazAbkhaz+
AdygheAdyghe+
AfrikaansAfrikaans++
AgulAgul+
AlbanianAlbanian++
AltaicAltaic+
ArabicArabic (Saudi Arabia)++3+
ArmenianEasternArmenian (Eastern)+++
ArmenianGrabarArmenian (Grabar)+++
ArmenianWesternArmenian (Western)+++
AwarAvar+
AymaraAymara++
AzeriCyrillicAzerbaijani (Cyrillic)+
AzeriLatinAzerbaijani (Latin)++++
BanglaBangla+
BashkirBashkir+++
BasicBasic programming language+
BasqueBasque++
BelarusianBelarussian+
BembaBemba++
BlackfootBlackfoot++
BretonBreton++
BugotuBugotu++
BulgarianBulgarian++++
BurmeseBurmese+
BuryatBuryat++
C++C/C++ programming language+
CatalanCatalan+++
ChamorroChamorro++
ChechenChechen+
ChemistrySimple chemical formulas+
ChinesePRCChinese Simplified++
ChineseTaiwanChinese Traditional++
ChukchaChukcha+
ChuvashChuvash+
CMC7For MICR (CMC-7) text type4+
COBOLCOBOL programming language+
CorsicanCorsican++
CrimeanTatarCrimean Tatar++
CroatianCroatian++++
CrowCrow++
CzechCzech+++++
DanishDanish+++++
DargwaDargwa+
DigitsNumbers++
DunganDungan+
DutchDutch (Netherlands)+++++
DutchBelgianDutch (Belgium)++++
E13BFor MICR (E-13B) text type4+
EnglishEnglish++++<br />(including handwritten)++
EskimoCyrillicEskimo (Cyrillic)+
EskimoLatinEskimo (Latin)+
EsperantoEsperanto+
EstonianEstonian+++++
EvenEven++
EvenkiEvenki++
FaeroeseFaeroese+
FarsiFarsi+++
FijianFijian++
FinnishFinnish+++++
FortranFortran programming language+
FrenchFrench++++<br />(including handwritten)++
FrisianFrisian++
FriulianFriulian++
GaelicScottishScottish Gaelic5++
GagauzGagauz+
GalicianGalician++
GandaGanda++
GeorgianGeorgian6+
GermanGerman++++<br />(including handwritten)++
GermanLuxembourgGerman (Luxembourg)++
GermanNewSpellingGerman (new spelling)++++
GreekGreek+++++
GuaraniGuarani++
HaniHani++
HausaHausa+
HawaiianHawaiian++
HebrewHebrew+++
HungarianHungarian+++++
IcelandicIcelandic+
IdoIdo++
IndonesianIndonesian+++++
IngushIngush+
InterlinguaInterlingua++
IrishIrish5++
ItalianItalian+++++
JapaneseJapanese++++<br />(including handwritten)++
JapaneseModernJapanese (Modern)++++
JavaJava programming language+
KabardianKabardian+
KalmykKalmyk+
KarachayBalkarKarachay-Balkar++
KarakalpakKarakalpak+
KasubKasub++
KawaKawa++
KazakhKazakh++
KhakasKhakas+
KhantyKhanty+
KikuyuKikuyu+
KirgizKirghiz++
KongoKongo++
KoreanKorean++++
KoreanHangulKorean (Hangul)+++
KoryakKoryak+
KpelleKpelle++
KumykKumyk++
KurdishKurdish++
LakLak+
LappishSami (Lappish)++
LatinLatin++++
LatvianLatvian++++
LatvianGothicLatvian language written in Gothic script+
LezginLezgin+
LithuanianLithuanian++++
LubaLuba++
MacedonianMacedonian+
MalagasyMalagasy++
MalayMalay+
MalinkeMalinke++
MalteseMaltese+
MansiMansi+
MaoriMaori++
MathematicalMathematical+
MariMari+
MayaMaya++
MiaoMiao++
MinankabawMinangkabau++
MohawkMohawk++
MongolMongol++
MordvinMordvin++
NahuatlNahuatl++
NenetsNenets++
NivkhNivkh++
NogayNogay++
NorwegianNorwegianNynorsk and NorwegianBokmal+++++
NorwegianBokmalNorwegian (Bokmal)+++++
NorwegianNynorskNorwegian (Nynorsk)+++++
NyanjaNyanja++
OccidentalOccidental+
OcrAFor OCR-A text type+
OcrBFor OCR-B text type+
OjibwayOjibway++
OldEnglishOld English++++
OldFrenchOld French++++
OldGermanOld German++++
OldItalianOld Italian++++
OldSlavonicOld Slavonic+
OldSpanishOld Spanish++++
OsseticOssetian+
PapiamentoPapiamento++
PascalPascal programming language+
PidginEnglishTok Pisin++
PolishPolish+++++
PortugueseBrazilianPortuguese (Brazil)+++++
PortugueseStandardPortuguese (Portugal)+++++
ProvencalProvencal+
QuechuaQuechua++
RhaetoRomanicRhaeto-Romanic++
RomanianRomanian++++
RomanianMoldaviaRomanian (Moldavia)++
RomanyRomany++
RuandaRuanda++
RundiRundi++
RussianOldSpellingRussian (old spelling)+++
RussianRussian+++++
RussianWithAccentRussian (with accents marking stress position)+++
SamoanSamoan++
SelkupSelkup++
SerbianCyrillicSerbian (Cyrillic)++
SerbianLatinSerbian (Latin)++
ShonaShona+
SiouxSioux (Dakota)++
SlovakSlovak++++
SlovenianSlovenian++++
SomaliSomali++
SorbianSorbian+
SothoSotho++
SpanishSpanish++++<br />(including handwritten)++
SundaSunda+
SwahiliSwahili++
SwaziSwazi++
SwedishSwedish+++++
TabassaranTabassaran+
TagalogTagalog++
TahitianTahitian++
TajikTajik++
TatarTatar+++
ThaiThai+++
TinpoJingpo++
TonganTongan++
TswanaTswana++
TunTun++
TurkishTurkish+++++
TurkmenTurkmen+
TurkmenLatinTurkmen (Latin)++
TuvinTuvan++
UdmurtUdmurt+
UighurCyrillicUighur (Cyrillic)+
UighurLatinUighur (Latin)++
UkrainianUkrainian+++++
UzbekCyrillicUzbek (Cyrillic)+
UzbekLatinUzbek (Latin)++
VietnameseVietnamese+++
VisayanCebuano++
WelshWelsh+
WolofWolof++
XhosaXhosa++
YakutYakut+
YiddishYiddish+7
ZapotecZapotec++
ZuluZulu+
1Several languages support recognizing handwritten text: English, German, French, Japanese, and Spanish. Other languages marked in this column support only handprinted text. The same settings (IPageAnalysisParams::DetectHandwritten and IRecognizerParams::TextTypes = TT_Handwritten) enable recognizing handwritten or handprinted text, depending on which option the language supports. 2The classifier which uses only image characteristics can be used for documents in any language. The text-based classifiers (ClassifierTypeEnum::CT_Combined, ClassifierTypeEnum::CT_Text) are only available for recognized documents in languages which have full dictionary support. 3 Arabic ICR is not supported. However, handprinted Arabic digits can be recognized. See Recognizing Handprinted Arabic Digits. 4 If you would like to recognize a block with the MICR text type, use only languages with the Latin characters and not the combinations of Latin and CJK languages. 5 FineReader Engine doesn’t support some of the special symbols with diacritics in Gaelic Scottish and Irish languages. 6 The Nuskhuri and Mtavruli characters are recognized separately from each other, but both types of the characters are saved in the Unicode strings for Nuskhuri. 7 A few standard characters (veys בֿ, pasekh alef אַ, komets alef אָ, pasekh tsvey yudn ײַ, melupm vov וּ) are not supported in the predefined Yiddish language. To recognize these characters, create a new custom language and add these characters to it using the LetterSet property of the TextLanguage object (see Working with Languages), then set the new language as recognition language. For Windows, use the scenario described in Recognizing with Training and Training to recognize ligatures. See also LanguageIdEnum Working with Languages