IronOCR 的金融语言包支持

This article was translated from English: Does it need improvement?
Translated
View the article in English

我可以使用金融语言包来辅助 IronOCR 读取数据表格吗?

是的,使用财务语言包并保存为可搜索的 PDF 文件是实现此目的的非常有效的方法。

IronOCR 金融语言包

IronOCR 金融语言包文档

示例代码

// Import the IronOcr library
using IronOcr;

class Program
{
    static void Main()
    {
        // Initialize a new instance of IronTesseract
        var Ocr = new IronTesseract();

        // Set the OCR language to Financial
        Ocr.Language = OcrLanguage.Financial;

        // Using statement to ensure proper disposal of resources
        using (var input = new OcrInput())
        {
            // Add a PDF file to be processed by OCR
            input.AddPdf("TestPdf_new.pdf");

            // Optional: Add image filters if needed to improve OCR 
            // input.Deskew();    // Corrects rotation and skewing
            // input.DeNoise();   // Reduces noise and improves readability

            // Perform OCR on the input PDF
            var Result = Ocr.Read(input);

            // Extract the recognized text
            string TestResult = Result.Text;

            // Save the result as a searchable PDF using the Financial language pack
            Result.SaveAsSearchablePdf("Output_using_Financial_language_pack.pdf");
        }
    }
}
// Import the IronOcr library
using IronOcr;

class Program
{
    static void Main()
    {
        // Initialize a new instance of IronTesseract
        var Ocr = new IronTesseract();

        // Set the OCR language to Financial
        Ocr.Language = OcrLanguage.Financial;

        // Using statement to ensure proper disposal of resources
        using (var input = new OcrInput())
        {
            // Add a PDF file to be processed by OCR
            input.AddPdf("TestPdf_new.pdf");

            // Optional: Add image filters if needed to improve OCR 
            // input.Deskew();    // Corrects rotation and skewing
            // input.DeNoise();   // Reduces noise and improves readability

            // Perform OCR on the input PDF
            var Result = Ocr.Read(input);

            // Extract the recognized text
            string TestResult = Result.Text;

            // Save the result as a searchable PDF using the Financial language pack
            Result.SaveAsSearchablePdf("Output_using_Financial_language_pack.pdf");
        }
    }
}
' Import the IronOcr library
Imports IronOcr

Friend Class Program
	Shared Sub Main()
		' Initialize a new instance of IronTesseract
		Dim Ocr = New IronTesseract()

		' Set the OCR language to Financial
		Ocr.Language = OcrLanguage.Financial

		' Using statement to ensure proper disposal of resources
		Using input = New OcrInput()
			' Add a PDF file to be processed by OCR
			input.AddPdf("TestPdf_new.pdf")

			' Optional: Add image filters if needed to improve OCR 
			' input.Deskew();    // Corrects rotation and skewing
			' input.DeNoise();   // Reduces noise and improves readability

			' Perform OCR on the input PDF
			Dim Result = Ocr.Read(input)

			' Extract the recognized text
			Dim TestResult As String = Result.Text

			' Save the result as a searchable PDF using the Financial language pack
			Result.SaveAsSearchablePdf("Output_using_Financial_language_pack.pdf")
		End Using
	End Sub
End Class
$vbLabelText   $csharpLabel
Curtis Chau
技术作家

Curtis Chau 拥有卡尔顿大学的计算机科学学士学位,专注于前端开发,精通 Node.js、TypeScript、JavaScript 和 React。他热衷于打造直观且美观的用户界面,喜欢使用现代框架并创建结构良好、视觉吸引力强的手册。

除了开发之外,Curtis 对物联网 (IoT) 有浓厚的兴趣,探索将硬件和软件集成的新方法。在空闲时间,他喜欢玩游戏和构建 Discord 机器人,将他对技术的热爱与创造力相结合。

准备开始了吗?
Nuget 下载 5,167,857 | Version: 2025.11 刚刚发布