使用 C# 和 .NET 进行印地语 OCR

This article was translated from English: Does it need improvement?
Translated
View the article in English
Other versions of this document:

IronOCR 是一个 C# 软件组件,允许 .NET 程序员从图像和 PDF 文档中读取 126 种语言的文本,包括印地语。

它是 Tesseract 的一个高级分支,专为 .NET 开发人员构建,在速度和准确性方面通常优于其他 Tesseract 引擎。

IronOcr.Languages.Hindi 的内容

此软件包包含 40 种适用于 .NET 的 OCR 语言:

  • 不Best
  • 不Fast

下载

印地语语言包[印地语]

安装

我们首先需要做的就是将我们的印地语OCR 包安装到您的 .NET 项目中。

Install-Package IronOCR.Languages.Hindi

代码示例

这段 C# 代码示例从图像或 PDF 文档中读取印地语文本。

// Make sure to install IronOcr.Languages.Hindi package before running the code
using IronOcr;

var Ocr = new IronTesseract(); // Create a new instance of the OCR engine
Ocr.Language = OcrLanguage.Hindi; // Set the language to Hindi

// Load the image file containing Hindi text
using (var Input = new OcrInput(@"images\Hindi.png"))
{
    // Perform OCR on the image
    var Result = Ocr.Read(Input);

    // Extract the recognized text
    var AllText = Result.Text;

    // Optionally, you can print the extracted text
    Console.WriteLine(AllText);
}
// Make sure to install IronOcr.Languages.Hindi package before running the code
using IronOcr;

var Ocr = new IronTesseract(); // Create a new instance of the OCR engine
Ocr.Language = OcrLanguage.Hindi; // Set the language to Hindi

// Load the image file containing Hindi text
using (var Input = new OcrInput(@"images\Hindi.png"))
{
    // Perform OCR on the image
    var Result = Ocr.Read(Input);

    // Extract the recognized text
    var AllText = Result.Text;

    // Optionally, you can print the extracted text
    Console.WriteLine(AllText);
}
' Make sure to install IronOcr.Languages.Hindi package before running the code
Imports IronOcr

Private Ocr = New IronTesseract() ' Create a new instance of the OCR engine
Ocr.Language = OcrLanguage.Hindi ' Set the language to Hindi

' Load the image file containing Hindi text
Using Input = New OcrInput("images\Hindi.png")
	' Perform OCR on the image
	Dim Result = Ocr.Read(Input)

	' Extract the recognized text
	Dim AllText = Result.Text

	' Optionally, you can print the extracted text
	Console.WriteLine(AllText)
End Using
$vbLabelText   $csharpLabel

注意:请确保Hindi.png的文件路径正确,并且已安装必要的软件包。