Additional OCR Language Packs
IronOCR supports 125 international languages, but only English is installed within IronOCR as standard.
Additional Language packs may be easily added to your C#, VB or ASP .NET project via NuGet or as Dlls which can be downloaded and added as project references.
Code Examples
International Language Example
//PM> Install-Package IronOcr.Languages.ChineseSimplified
using IronOcr;
IronTesseract ocr = new IronTesseract();
ocr.Language = OcrLanguage.ChineseSimplified;
using (var input = new OcrInput())
{
input.AddImage("img/chinese.gif");
// Add image filters if needed
// Input.Deskew();
// Input.DeNoise();
OcrResult result = ocr.Read(input);
string TestResult = result.Text;
// Console can't print unicode. Save to disk instead.
result.SaveAsTextFile("chinese.txt");
}
//PM> Install-Package IronOcr.Languages.ChineseSimplified
using IronOcr;
IronTesseract ocr = new IronTesseract();
ocr.Language = OcrLanguage.ChineseSimplified;
using (var input = new OcrInput())
{
input.AddImage("img/chinese.gif");
// Add image filters if needed
// Input.Deskew();
// Input.DeNoise();
OcrResult result = ocr.Read(input);
string TestResult = result.Text;
// Console can't print unicode. Save to disk instead.
result.SaveAsTextFile("chinese.txt");
}
'PM> Install-Package IronOcr.Languages.ChineseSimplified
Imports IronOcr
Private ocr As New IronTesseract()
ocr.Language = OcrLanguage.ChineseSimplified
Using input = New OcrInput()
input.AddImage("img/chinese.gif")
' Add image filters if needed
' Input.Deskew();
' Input.DeNoise();
Dim result As OcrResult = ocr.Read(input)
Dim TestResult As String = result.Text
' Console can't print unicode. Save to disk instead.
result.SaveAsTextFile("chinese.txt")
End Using
Vertically Written Language Example
Dictionaries tuned for vertically written language. Use 'Vertical' Variant of Korean and Japan OcrLanguage.
using IronOcr;
IronTesseract ocr = new IronTesseract();
ocr.Language = OcrLanguage.JapaneseVertical;
using (var input = new OcrInput(@"images\image.png"))
{
OcrResult result = ocr.Read(input);
Console.WriteLine(result.Text);
}
using IronOcr;
IronTesseract ocr = new IronTesseract();
ocr.Language = OcrLanguage.JapaneseVertical;
using (var input = new OcrInput(@"images\image.png"))
{
OcrResult result = ocr.Read(input);
Console.WriteLine(result.Text);
}
Imports IronOcr
Private ocr As New IronTesseract()
ocr.Language = OcrLanguage.JapaneseVertical
Using input = New OcrInput("images\image.png")
Dim result As OcrResult = ocr.Read(input)
Console.WriteLine(result.Text)
End Using
Custom Language Example
For using any Tesseract .Traineddata language file you have downloaded or trained yourself
using IronOcr;
IronTesseract ocr = new IronTesseract();
ocr.UseCustomTesseractLanguageFile("custom_tesseract_files/custom.traineddata");
using (var input = new OcrInput(@"images\image.png"))
{
OcrResult result = ocr.Read(input);
Console.WriteLine(result.Text);
}
using IronOcr;
IronTesseract ocr = new IronTesseract();
ocr.UseCustomTesseractLanguageFile("custom_tesseract_files/custom.traineddata");
using (var input = new OcrInput(@"images\image.png"))
{
OcrResult result = ocr.Read(input);
Console.WriteLine(result.Text);
}
Imports IronOcr
Private ocr As New IronTesseract()
ocr.UseCustomTesseractLanguageFile("custom_tesseract_files/custom.traineddata")
Using input = New OcrInput("images\image.png")
Dim result As OcrResult = ocr.Read(input)
Console.WriteLine(result.Text)
End Using
Multiple Language Example
More than one Language at a time.
//PM> Install-Package IronOcr.Languages.Arabic
using IronOcr;
IronTesseract ocr = new IronTesseract();
ocr.Language = OcrLanguage.English;
ocr.AddSecondaryLanguage(OcrLanguage.Arabic);
// Add any number of languages
using (var input = new OcrInput(@"images\multi-lang.pdf"))
{
var Result = ocr.Read(input);
Console.WriteLine(Result.Text);
}
//PM> Install-Package IronOcr.Languages.Arabic
using IronOcr;
IronTesseract ocr = new IronTesseract();
ocr.Language = OcrLanguage.English;
ocr.AddSecondaryLanguage(OcrLanguage.Arabic);
// Add any number of languages
using (var input = new OcrInput(@"images\multi-lang.pdf"))
{
var Result = ocr.Read(input);
Console.WriteLine(Result.Text);
}
'PM> Install-Package IronOcr.Languages.Arabic
Imports IronOcr
Private ocr As New IronTesseract()
ocr.Language = OcrLanguage.English
ocr.AddSecondaryLanguage(OcrLanguage.Arabic)
' Add any number of languages
Using input = New OcrInput("images\multi-lang.pdf")
Dim Result = ocr.Read(input)
Console.WriteLine(Result.Text)
End Using
Faster Language Example
Dictionaries Tuned for Speed. Use 'Fast' Variant of any OcrLanguage.
using IronOcr;
IronTesseract ocr = new IronTesseract();
ocr.Language = OcrLanguage.EnglishFast;
using (var input = new OcrInput(@"images\image.png"))
{
OcrResult result = ocr.Read(input);
Console.WriteLine(result.Text);
}
using IronOcr;
IronTesseract ocr = new IronTesseract();
ocr.Language = OcrLanguage.EnglishFast;
using (var input = new OcrInput(@"images\image.png"))
{
OcrResult result = ocr.Read(input);
Console.WriteLine(result.Text);
}
Imports IronOcr
Private ocr As New IronTesseract()
ocr.Language = OcrLanguage.EnglishFast
Using input = New OcrInput("images\image.png")
Dim result As OcrResult = ocr.Read(input)
Console.WriteLine(result.Text)
End Using
Higher Accuracy Detail Language Example
Dictionaries tuned for accuracy but much slower results. Use 'Best' Variant of any OcrLanguage.
//PM> Install-Package IronOcr.Languages.French
using IronOcr;
IronTesseract ocr = new IronTesseract();
ocr.Language = OcrLanguage.FrenchBest;
using (var input = new OcrInput(@"images\image.png"))
{
OcrResult result = ocr.Read(input);
Console.WriteLine(result.Text);
}
//PM> Install-Package IronOcr.Languages.French
using IronOcr;
IronTesseract ocr = new IronTesseract();
ocr.Language = OcrLanguage.FrenchBest;
using (var input = new OcrInput(@"images\image.png"))
{
OcrResult result = ocr.Read(input);
Console.WriteLine(result.Text);
}
'PM> Install-Package IronOcr.Languages.French
Imports IronOcr
Private ocr As New IronTesseract()
ocr.Language = OcrLanguage.FrenchBest
Using input = New OcrInput("images\image.png")
Dim result As OcrResult = ocr.Read(input)
Console.WriteLine(result.Text)
End Using
How To Install OCR Language Packs
Additional OCR Languages packs are available for download below. Either
- Install the NuGet package. Search NuGet for IronOcr Languages.
- Or download the "ocrdata" file and add it to your .NET project in any folder you like. Set
CopyToOutputDirectory = CopyIfNewer
Download OCR Language Packs
-
Ancient Greek Language Pack ἙλληνικήZip NuGet
-
Assamese Language Pack অসমীয়াZip NuGet
-
Breton Language Pack brezhonegZip NuGet
-
Canadian Aboriginal Alphabet Language Pack Canadian First NationsZip NuGet
-
Cebuano Language Pack BisayaZip NuGet
-
Cherokee Language Pack ᏣᎳᎩ ᎦᏬᏂᎯᏍᏗZip NuGet
-
Cyrillic Language Pack Cyrillic scriptsZip NuGet
-
Devanagari Language Pack NagairZip NuGet
-
Divehi Language Pack ދިވެހިZip NuGet
-
Dzongkha Language Pack རྫོང་ཁZip NuGet
-
Faroese Language Pack føroysktZip NuGet
-
Filipino Language Pack The PhilippinesZip NuGet
-
Financial Language Pack Spreadsheets & NumbersZip NuGet
-
Fraktur Language Pack Generic FrakturZip NuGet
-
Frankish Language Pack FrenkiskZip NuGet
-
Gurmukhi Alphabet Language Pack GurmukhīZip NuGet
-
Hangul Language Pack Hangul AlphabetZip NuGet
-
Inuktitut Language Pack ᐃᓄᒃᑎᑐᑦZip NuGet
-
Javanese Language Pack basa JawaZip NuGet
-
Malay Language Pack bahasa MelayuZip NuGet
-
Malayalam Language Pack മലയാളംZip NuGet
-
Maori Language Pack te reo MāoriZip NuGet
-
MICR Language Pack Magnetic Ink Character RecognitionZip NuGet
-
Middle English Language Pack English (1100-1500 AD)Zip NuGet
-
Middle French Language Pack Moyen FrançaisZip NuGet
-
Myanmar Language Pack BurmeseZip NuGet
-
Northern Kurdish Language Pack KurmanjiZip NuGet
-
Occitan Language Pack occitanZip NuGet
-
Quechua Language Pack Runa SimiZip NuGet
-
Sanskrit Language Pack संस्कृतम्Zip NuGet
-
Scottish Gaelic Language Pack GàidhligZip NuGet
-
Syriac Language Pack SyrianZip NuGet
-
Tibetan Language Pack Tibetan StandardZip NuGet
-
Tonga Language Pack faka TongaZip NuGet
- Yoruba Language Pack YorùbáZip NuGet
Help
If the language you are looking to read is not available in the list above please get in touch with us. Many other languages are available on request.
Priority on production resources are given to IronOCR licensees so please also consider licensing IronOCR for access to your desired language pack.