Portuguese OCR in C# and .NET

Other versions of this document:

IronOCR is a C# software component allowing .NET coders to read text from images and PDF documents in 126 languages, including Portuguese.

It is an advanced fork of Tesseract, built exclusively for .NET developers and regularly outperforms other Tesseract engines in both speed and accuracy.

Contents of IronOcr.Languages.Portuguese

This package contains 55 OCR languages for .NET:

  • Portuguese
  • PortugueseBest
  • PortugueseFast

Download

Portuguese Language Pack [português]

Installation

The first thing we have to do is install our Portuguese OCR package to your .NET project.

Install-Package IronOCR.Languages.Portuguese

Code Example

This C# code example reads Portuguese text from an Image or PDF document.

// Required using directive for IronOcr
using IronOcr;

var Ocr = new IronTesseract();

// Specify the language for OCR as Portuguese
Ocr.Language = OcrLanguage.Portuguese;

// Load the image or PDF from which to read the text
using (var Input = new OcrInput(@"images\Portuguese.png"))
{
    // Perform OCR on the input
    var Result = Ocr.Read(Input);

    // Retrieve the recognized text
    var AllText = Result.Text;

    // Output the extracted text
    Console.WriteLine(AllText);
}
// Required using directive for IronOcr
using IronOcr;

var Ocr = new IronTesseract();

// Specify the language for OCR as Portuguese
Ocr.Language = OcrLanguage.Portuguese;

// Load the image or PDF from which to read the text
using (var Input = new OcrInput(@"images\Portuguese.png"))
{
    // Perform OCR on the input
    var Result = Ocr.Read(Input);

    // Retrieve the recognized text
    var AllText = Result.Text;

    // Output the extracted text
    Console.WriteLine(AllText);
}
' Required using directive for IronOcr
Imports IronOcr

Private Ocr = New IronTesseract()

' Specify the language for OCR as Portuguese
Ocr.Language = OcrLanguage.Portuguese

' Load the image or PDF from which to read the text
Using Input = New OcrInput("images\Portuguese.png")
	' Perform OCR on the input
	Dim Result = Ocr.Read(Input)

	' Retrieve the recognized text
	Dim AllText = Result.Text

	' Output the extracted text
	Console.WriteLine(AllText)
End Using
$vbLabelText   $csharpLabel

This code demonstrates how to set up and use the IronOCR library to read Portuguese text from an image. Ensure that the path to the image or PDF document is correct. The recognized text will be stored in the AllText variable, and it is printed to the console.