在 IronOCR 中使用自訂 OCR 語言套件
This article was translated from English: Does it need improvement?
Translated
View the article in English
如何為 IronOCR 建立自訂語言套件?
建立自訂語言套件需要透過字型來訓練新的 Tesseract 4 LSTM 語言檔案/字典。
網路上已有許多教學文章,詳細說明了執行此操作所需的步驟。 這個過程雖不簡單,但值得慶幸的是相關文件相當完備。
作為入門參考,我們建議觀看 Gabriel Garcia(無任何關聯)的這段 YouTube 教學影片,並參閱其連結的 GitHub 儲存庫。
完成後,輸出結果將為 .traineddata 檔案。
隨後可在 IronOCR 中透過以下方式引用 .traineddata 檔案:
文件:IronOCR 自訂語言
using IronOcr;
class Program
{
static void Main()
{
// Initialize the IronTesseract OCR engine
var Ocr = new IronTesseract();
// Load your custom Tesseract language file (trained .traineddata file)
Ocr.UseCustomTesseractLanguageFile("mydir/custom.traineddata"); //<--- your new font
// Multiple fonts can be used by calling the method multiple times with different files
// Load an image into the OCR Input for processing
using (var Input = new OcrInput(@"images\image.png"))
{
// Perform OCR on the input image
var Result = Ocr.Read(Input);
// Output the recognized text to the console
Console.WriteLine(Result.Text);
}
}
}
using IronOcr;
class Program
{
static void Main()
{
// Initialize the IronTesseract OCR engine
var Ocr = new IronTesseract();
// Load your custom Tesseract language file (trained .traineddata file)
Ocr.UseCustomTesseractLanguageFile("mydir/custom.traineddata"); //<--- your new font
// Multiple fonts can be used by calling the method multiple times with different files
// Load an image into the OCR Input for processing
using (var Input = new OcrInput(@"images\image.png"))
{
// Perform OCR on the input image
var Result = Ocr.Read(Input);
// Output the recognized text to the console
Console.WriteLine(Result.Text);
}
}
}
Imports IronOcr
Friend Class Program
Shared Sub Main()
' Initialize the IronTesseract OCR engine
Dim Ocr = New IronTesseract()
' Load your custom Tesseract language file (trained .traineddata file)
Ocr.UseCustomTesseractLanguageFile("mydir/custom.traineddata") '<--- your new font
' Multiple fonts can be used by calling the method multiple times with different files
' Load an image into the OCR Input for processing
Using Input = New OcrInput("images\image.png")
' Perform OCR on the input image
Dim Result = Ocr.Read(Input)
' Output the recognized text to the console
Console.WriteLine(Result.Text)
End Using
End Sub
End Class
$vbLabelText
$csharpLabel
準備開始了嗎?
Nuget 下載 5,888,303 | 版本: 2026.5 just released

