IronOCR的金融語言包支援:提升財務文件 OCR辨識與擷取圖片文字效果
This article was translated from English: Does it need improvement?
Translated
View the article in English
我可以使用金融語言包來輔助IronOCR讀取資料表嗎?
是的,使用財務語言包並保存為可搜尋的 PDF 文件是實現此目的的非常有效的方法。
IronOCR金融語言包
範例程式碼
// Import the IronOcr library
using IronOcr;
class Program
{
static void Main()
{
// Initialize a new instance of IronTesseract
var Ocr = new IronTesseract();
// Set the OCR language to Financial
Ocr.Language = OcrLanguage.Financial;
// Using statement to ensure proper disposal of resources
using (var input = new OcrInput())
{
// Add a PDF file to be processed by OCR
input.AddPdf("TestPdf_new.pdf");
// Optional: Add image filters if needed to improve OCR
// input.Deskew(); // Corrects rotation and skewing
// input.DeNoise(); // Reduces noise and improves readability
// Perform OCR on the input PDF
var Result = Ocr.Read(input);
// Extract the recognized text
string TestResult = Result.Text;
// Save the result as a searchable PDF using the Financial language pack
Result.SaveAsSearchablePdf("Output_using_Financial_language_pack.pdf");
}
}
}
// Import the IronOcr library
using IronOcr;
class Program
{
static void Main()
{
// Initialize a new instance of IronTesseract
var Ocr = new IronTesseract();
// Set the OCR language to Financial
Ocr.Language = OcrLanguage.Financial;
// Using statement to ensure proper disposal of resources
using (var input = new OcrInput())
{
// Add a PDF file to be processed by OCR
input.AddPdf("TestPdf_new.pdf");
// Optional: Add image filters if needed to improve OCR
// input.Deskew(); // Corrects rotation and skewing
// input.DeNoise(); // Reduces noise and improves readability
// Perform OCR on the input PDF
var Result = Ocr.Read(input);
// Extract the recognized text
string TestResult = Result.Text;
// Save the result as a searchable PDF using the Financial language pack
Result.SaveAsSearchablePdf("Output_using_Financial_language_pack.pdf");
}
}
}
' Import the IronOcr library
Imports IronOcr
Friend Class Program
Shared Sub Main()
' Initialize a new instance of IronTesseract
Dim Ocr = New IronTesseract()
' Set the OCR language to Financial
Ocr.Language = OcrLanguage.Financial
' Using statement to ensure proper disposal of resources
Using input = New OcrInput()
' Add a PDF file to be processed by OCR
input.AddPdf("TestPdf_new.pdf")
' Optional: Add image filters if needed to improve OCR
' input.Deskew(); // Corrects rotation and skewing
' input.DeNoise(); // Reduces noise and improves readability
' Perform OCR on the input PDF
Dim Result = Ocr.Read(input)
' Extract the recognized text
Dim TestResult As String = Result.Text
' Save the result as a searchable PDF using the Financial language pack
Result.SaveAsSearchablePdf("Output_using_Financial_language_pack.pdf")
End Using
End Sub
End Class
$vbLabelText
$csharpLabel
準備好開始了嗎?
Nuget 下載 5,585,834 | 版本: 2026.4 剛剛發布

