Latin Alphabet OCR in C# and .NET

Curtis Chau

已更新:2026年4月22日

Translated

View the article in English

还有126种语言

IronOCR 是一个 C# 软件组件，允许 .NET 程序员从图像和 PDF 文档中读取 126 种语言（包括拉丁字母）的文本。

它是 Tesseract 的一个高级分支，专为 .NET 开发人员构建，在速度和准确性方面通常优于其他 Tesseract 引擎。

IronOcr.Languages.LatinAlphabet 的内容

此软件包包含 64 种适用于 .NET 的 OCR 语言：

拉丁字母拉丁字母Best

拉丁字母速记

下载

拉丁字母语言包 [latine]

下载ZIP文件
使用NuGet安装

安装

我们首先需要做的是将我们的拉丁字母OCR 包安装到您的 .NET 项目中。

Install-Package IronOcr.Languages.LatinAlphabet

代码示例

此 C# 代码示例从图像或 PDF 文档中读取拉丁字母文本。

// Install the IronOCR.languages.LatinAlphabet package first
using IronOcr;

var Ocr = new IronTesseract(); // Initialize IronTesseract instance

// Set the OCR language to LatinAlphabet
Ocr.Language = OcrLanguage.LatinAlphabet;

// Define the input image or PDF you want to read
using (var Input = new OcrInput(@"images\LatinAlphabet.png"))
{
    // Perform OCR reading on the input
    var Result = Ocr.Read(Input);

    // Extract the recognized text
    var AllText = Result.Text;

    // Output the recognized text
    Console.WriteLine(AllText);
}

// Install the IronOCR.languages.LatinAlphabet package first
using IronOcr;

var Ocr = new IronTesseract(); // Initialize IronTesseract instance

// Set the OCR language to LatinAlphabet
Ocr.Language = OcrLanguage.LatinAlphabet;

// Define the input image or PDF you want to read
using (var Input = new OcrInput(@"images\LatinAlphabet.png"))
{
    // Perform OCR reading on the input
    var Result = Ocr.Read(Input);

    // Extract the recognized text
    var AllText = Result.Text;

    // Output the recognized text
    Console.WriteLine(AllText);
}

' Install the IronOCR.languages.LatinAlphabet package first
Imports IronOcr

Private Ocr = New IronTesseract() ' Initialize IronTesseract instance

' Set the OCR language to LatinAlphabet
Ocr.Language = OcrLanguage.LatinAlphabet

' Define the input image or PDF you want to read
Using Input = New OcrInput("images\LatinAlphabet.png")
	' Perform OCR reading on the input
	Dim Result = Ocr.Read(Input)

	' Extract the recognized text
	Dim AllText = Result.Text

	' Output the recognized text
	Console.WriteLine(AllText)
End Using

$vbLabelText $csharpLabel

解释

IronTesseract 初始化：初始化 IronTesseract 的实例，该实例将负责处理 OCR 任务。
语言设置：OCR 语言已设置为 LatinAlphabet，这是 IronOCR 软件包中可用的语言之一。
输入规范：创建一个 OcrInput 对象，指定用于提取文本的图像或 PDF 文件的路径。
OCR 执行：调用 Read 实例的 IronTesseract 方法来处理 OcrInput。这将返回一个包含提取文本的 Result 对象。
文本提取：Text 对象的 Result 属性用于访问识别出的文本。

6.输出：将识别出的文本打印到控制台进行验证。

请确保 OcrInput 中的文件路径正确指向您的图片或 PDF 文件，以避免出现"文件未找到"的异常。

客户亮点：

开发者焦点：

网络研讨会：

开始免费 30 天试用

本页内容

Latin Alphabet OCR in C# and .NET

IronOcr.Languages.LatinAlphabet 的内容

下载

安装

代码示例

解释

钢铁支援团队

开始免费 30 天试用

本页内容

Latin Alphabet OCR in C# and .NET

IronOcr.Languages.LatinAlphabet 的内容

下载

安装

代码示例

解释

下一步：开始免费 30 天试用

Thank You

下一步：开始免费 30 天试用

Want to deploy IronSuite to a live project for FREE?

What’s included?

深受全球数百万工程师信赖

钢铁支援团队