使用 IronOCR 的自訂 OCR 語言包
This article was translated from English: Does it need improvement?
TranslatedView the article in English
如何為 IronOCR 建立自訂語言包?
建立自訂語言包需要從字體訓練新的 Tesseract 4 LSTM 語言檔案/字典。
網路上有很多教學講解了完成此操作所需的步驟。 這個過程並不簡單,但還好有詳細的文件記錄。
作為入門指南,我們建議先觀看Gabriel Garcia (與我們沒有任何關聯)的YouTube 教學以及他們連結的GitHub 儲存庫。
完成後,輸出結果將是一個.traineddata檔。
然後,即可依下列方式在 IronOCR 中引用.traineddata檔:
文件: IronOCR 自訂語言
using IronOcr;
class Program
{
static void Main()
{
// Initialize the IronTesseract OCR engine
var Ocr = new IronTesseract();
// Load your custom Tesseract language file (trained .traineddata file)
Ocr.UseCustomTesseractLanguageFile("mydir/custom.traineddata"); //<--- your new font
// Multiple fonts can be used by calling the method multiple times with different files
// Load an image into the OCR Input for processing
using (var Input = new OcrInput(@"images\image.png"))
{
// Perform OCR on the input image
var Result = Ocr.Read(Input);
// Output the recognized text to the console
Console.WriteLine(Result.Text);
}
}
}using IronOcr;
class Program
{
static void Main()
{
// Initialize the IronTesseract OCR engine
var Ocr = new IronTesseract();
// Load your custom Tesseract language file (trained .traineddata file)
Ocr.UseCustomTesseractLanguageFile("mydir/custom.traineddata"); //<--- your new font
// Multiple fonts can be used by calling the method multiple times with different files
// Load an image into the OCR Input for processing
using (var Input = new OcrInput(@"images\image.png"))
{
// Perform OCR on the input image
var Result = Ocr.Read(Input);
// Output the recognized text to the console
Console.WriteLine(Result.Text);
}
}
}$vbLabelText $csharpLabel
準備好開始了嗎?
Nuget 下載 5,299,091 | 版本: 2025.12 剛剛發布






