Dodatkowe pakiety językowe OCR
IronOCR obsługuje 125 języków międzynarodowych, ale tylko angielski jest zainstalowany w IronOCR standardowo.
Dodatkowe pakiety językowe można łatwo dodać do swojego projektu C#, VB lub ASP .NET poprzez NuGet lub jako DLL, które można pobrać i dodać jako odniesienia do projektu.
Przykłady kodu
Przykład języka międzynarodowego
Install-Package IronOcr.Languages.ChineseSimplified
using IronOcr;
var ocr = new IronTesseract();
// Set the OCR to use Chinese Simplified
ocr.Language = OcrLanguage.ChineseSimplified;
using (var input = new OcrInput())
{
// Add an image to be processed
input.AddImage("img/chinese.gif");
// Optional: Enhance the input by deskewing or denoising the image
// input.Deskew();
// input.DeNoise();
// Process the image and retrieve the result
var result = ocr.Read(input);
// Store the recognized text in a string
string testResult = result.Text;
// Save the recognized text to a file since the console might not display Unicode characters properly
result.SaveAsTextFile("chinese.txt");
}
using IronOcr;
var ocr = new IronTesseract();
// Set the OCR to use Chinese Simplified
ocr.Language = OcrLanguage.ChineseSimplified;
using (var input = new OcrInput())
{
// Add an image to be processed
input.AddImage("img/chinese.gif");
// Optional: Enhance the input by deskewing or denoising the image
// input.Deskew();
// input.DeNoise();
// Process the image and retrieve the result
var result = ocr.Read(input);
// Store the recognized text in a string
string testResult = result.Text;
// Save the recognized text to a file since the console might not display Unicode characters properly
result.SaveAsTextFile("chinese.txt");
}
Imports IronOcr
Private ocr = New IronTesseract()
' Set the OCR to use Chinese Simplified
ocr.Language = OcrLanguage.ChineseSimplified
Using input = New OcrInput()
' Add an image to be processed
input.AddImage("img/chinese.gif")
' Optional: Enhance the input by deskewing or denoising the image
' input.Deskew();
' input.DeNoise();
' Process the image and retrieve the result
Dim result = ocr.Read(input)
' Store the recognized text in a string
Dim testResult As String = result.Text
' Save the recognized text to a file since the console might not display Unicode characters properly
result.SaveAsTextFile("chinese.txt")
End Using
Przykład języka pisanego pionowo
Słowniki dostosowane do języków pisanych pionowo. Użyj wariantu 'Vertical' dla koreańskiego i japońskiego OcrLanguage.
using IronOcr;
var ocr = new IronTesseract();
// Set the OCR to use Japanese Vertical language
ocr.Language = OcrLanguage.JapaneseVertical;
using (var input = new OcrInput(@"images\image.png"))
{
// Process the image and get the OCR result
var result = ocr.Read(input);
// Output the recognized text to the console
Console.WriteLine(result.Text);
}
using IronOcr;
var ocr = new IronTesseract();
// Set the OCR to use Japanese Vertical language
ocr.Language = OcrLanguage.JapaneseVertical;
using (var input = new OcrInput(@"images\image.png"))
{
// Process the image and get the OCR result
var result = ocr.Read(input);
// Output the recognized text to the console
Console.WriteLine(result.Text);
}
Imports IronOcr
Private ocr = New IronTesseract()
' Set the OCR to use Japanese Vertical language
ocr.Language = OcrLanguage.JapaneseVertical
Using input = New OcrInput("images\image.png")
' Process the image and get the OCR result
Dim result = ocr.Read(input)
' Output the recognized text to the console
Console.WriteLine(result.Text)
End Using
Przykład języka niestandardowego
Dla użycia dowolnego pliku językowego .traineddata Tesseract, który pobrałeś lub wyszkoliłeś samodzielnie.
using IronOcr;
var ocr = new IronTesseract();
// Use a custom Tesseract language file
ocr.UseCustomTesseractLanguageFile("custom_tesseract_files/custom.traineddata");
using (var input = new OcrInput(@"images\image.png"))
{
// Process the image and get the OCR result
var result = ocr.Read(input);
// Output the recognized text to the console
Console.WriteLine(result.Text);
}
using IronOcr;
var ocr = new IronTesseract();
// Use a custom Tesseract language file
ocr.UseCustomTesseractLanguageFile("custom_tesseract_files/custom.traineddata");
using (var input = new OcrInput(@"images\image.png"))
{
// Process the image and get the OCR result
var result = ocr.Read(input);
// Output the recognized text to the console
Console.WriteLine(result.Text);
}
Imports IronOcr
Private ocr = New IronTesseract()
' Use a custom Tesseract language file
ocr.UseCustomTesseractLanguageFile("custom_tesseract_files/custom.traineddata")
Using input = New OcrInput("images\image.png")
' Process the image and get the OCR result
Dim result = ocr.Read(input)
' Output the recognized text to the console
Console.WriteLine(result.Text)
End Using
Przykład wielojęzyczny
Więcej niż jeden język na raz.
Install-Package IronOcr.Languages.Arabic
using IronOcr;
var ocr = new IronTesseract();
// Set the primary language to English
ocr.Language = OcrLanguage.English;
// Add Arabic as a secondary language
ocr.AddSecondaryLanguage(OcrLanguage.Arabic);
// Add any number of languages
using (var input = new OcrInput(@"images\multi-lang.pdf"))
{
// Process the PDF and get the OCR result
var result = ocr.Read(input);
// Output the recognized text to the console
Console.WriteLine(result.Text);
}
using IronOcr;
var ocr = new IronTesseract();
// Set the primary language to English
ocr.Language = OcrLanguage.English;
// Add Arabic as a secondary language
ocr.AddSecondaryLanguage(OcrLanguage.Arabic);
// Add any number of languages
using (var input = new OcrInput(@"images\multi-lang.pdf"))
{
// Process the PDF and get the OCR result
var result = ocr.Read(input);
// Output the recognized text to the console
Console.WriteLine(result.Text);
}
Imports IronOcr
Private ocr = New IronTesseract()
' Set the primary language to English
ocr.Language = OcrLanguage.English
' Add Arabic as a secondary language
ocr.AddSecondaryLanguage(OcrLanguage.Arabic)
' Add any number of languages
Using input = New OcrInput("images\multi-lang.pdf")
' Process the PDF and get the OCR result
Dim result = ocr.Read(input)
' Output the recognized text to the console
Console.WriteLine(result.Text)
End Using
Przykład szybszego języka
Słowniki dostosowane do szybkiego działania. Użyj wariantu 'Fast' dla dowolnego OcrLanguage.
using IronOcr;
var ocr = new IronTesseract();
// Set the OCR to use the fast variant of English
ocr.Language = OcrLanguage.EnglishFast;
using (var input = new OcrInput(@"images\image.png"))
{
// Process the image and get the OCR result
var result = ocr.Read(input);
// Output the recognized text to the console
Console.WriteLine(result.Text);
}
using IronOcr;
var ocr = new IronTesseract();
// Set the OCR to use the fast variant of English
ocr.Language = OcrLanguage.EnglishFast;
using (var input = new OcrInput(@"images\image.png"))
{
// Process the image and get the OCR result
var result = ocr.Read(input);
// Output the recognized text to the console
Console.WriteLine(result.Text);
}
Imports IronOcr
Private ocr = New IronTesseract()
' Set the OCR to use the fast variant of English
ocr.Language = OcrLanguage.EnglishFast
Using input = New OcrInput("images\image.png")
' Process the image and get the OCR result
Dim result = ocr.Read(input)
' Output the recognized text to the console
Console.WriteLine(result.Text)
End Using
Przykład języka z wyższą dokładnością
Słowniki dostosowane do dokładności, ale dużo wolniejsze wyniki. Użyj wariantu 'Best' dla dowolnego OcrLanguage.
Install-Package IronOcr.Languages.French
using IronOcr;
var ocr = new IronTesseract();
// Set the OCR to use the best variant of French
ocr.Language = OcrLanguage.FrenchBest;
using (var input = new OcrInput(@"images\image.png"))
{
// Process the image and get the OCR result
var result = ocr.Read(input);
// Output the recognized text to the console
Console.WriteLine(result.Text);
}
using IronOcr;
var ocr = new IronTesseract();
// Set the OCR to use the best variant of French
ocr.Language = OcrLanguage.FrenchBest;
using (var input = new OcrInput(@"images\image.png"))
{
// Process the image and get the OCR result
var result = ocr.Read(input);
// Output the recognized text to the console
Console.WriteLine(result.Text);
}
Imports IronOcr
Private ocr = New IronTesseract()
' Set the OCR to use the best variant of French
ocr.Language = OcrLanguage.FrenchBest
Using input = New OcrInput("images\image.png")
' Process the image and get the OCR result
Dim result = ocr.Read(input)
' Output the recognized text to the console
Console.WriteLine(result.Text)
End Using
Jak zainstalować pakiety językowe OCR
Dodatkowe pakiety językowe OCR są dostępne do pobrania poniżej. Albo
- Zainstaluj pakiet NuGet. Wyszukaj NuGet dla języków IronOcr.
- Lub pobierz plik "ocrdata" i dodaj go do swojego projektu .NET w dowolnym folderze, który chcesz. Ustaw
CopyToOutputDirectory = CopyIfNewer
Pobierz pakiety językowe OCR
- Ancient Greek Language Pack Ἑλληνική Zip NuGet
- Assamese Language Pack অসমীযা Zip NuGet
- Breton Language Pack brezhoneg Zip NuGet
- Canadian Aboriginal Alphabet Language Pack Canadian First Nations Zip NuGet
- Cebuano Language Pack Bisaya Zip NuGet
- Cherokee Language Pack ᏣᎳᎩ ᎦᏬᏂᎯᏍᏗ Zip NuGet
- Cyrillic Language Pack Cyrillic scripts Zip NuGet
- Devanagari Language Pack Nagair Zip NuGet
- Faroese Language Pack føroyskt Zip NuGet
- Filipino Language Pack The Philippines Zip NuGet
- Financial Language Pack Spreadsheets & Numbers Zip NuGet
- Fraktur Language Pack Generic Fraktur Zip NuGet
- Frankish Language Pack Frenkisk Zip NuGet
- Gurmukhi Alphabet Language Pack Gurmukhī Zip NuGet
- Hangul Language Pack Hangul Alphabet Zip NuGet
- Inuktitut Language Pack ᐃᓄᒃᑎᑐᑦ Zip NuGet
- Javanese Language Pack basa Jawa Zip NuGet
- Latin Alphabet Language Pack latine Zip NuGet
- Malay Language Pack bahasa Melayu Zip NuGet
- Malayalam Language Pack മലയാളം Zip NuGet
- Maori Language Pack te reo Māori Zip NuGet
- MICR Language Pack Magnetic Ink Character Recognition Zip NuGet
- Middle English Language Pack English (1100-1500 AD) Zip NuGet
- Middle French Language Pack Moyen Français Zip NuGet
- Myanmar Language Pack Burmese Zip NuGet
- Northern Kurdish Language Pack Kurmanji Zip NuGet
- Norwegian Language Pack Norsk Zip NuGet
- Occitan Language Pack occitan Zip NuGet
- Quechua Language Pack Runa Simi Zip NuGet
- Sanskrit Language Pack ससकतम Zip NuGet
- Scottish Gaelic Language Pack Gàidhlig Zip NuGet
- Syriac Language Pack Syrian Zip NuGet
- Tibetan Language Pack Tibetan Standard Zip NuGet
- Tonga Language Pack faka Tonga Zip NuGet
Pomoc
Jeśli język, który chcesz przeczytać, nie jest dostępny na powyższej liście, prosimy skontaktować się z nami. Wiele innych języków jest dostępnych na życzenie.
Priorytet w dostępie do zasobów produkcyjnych mają posiadacze licencji IronOCR, dlatego prosimy również rozważyć licencjonowanie IronOCR, aby uzyskać dostęp do żądanego pakietu językowego.

