Portuguese OCR in C# and .NET
Other versions of this document:
IronOCR is a C# software component allowing .NET coders to read text from images and PDF documents in 126 languages, including Portuguese.
It is an advanced fork of Tesseract, built exclusively for .NET developers and regularly outperforms other Tesseract engines in both speed and accuracy.
Contents of IronOcr.Languages.Portuguese
This package contains 55 OCR languages for .NET:
- Portuguese
- PortugueseBest
- PortugueseFast
Download
Portuguese Language Pack [português]
Installation
The first thing we have to do is install our Portuguese OCR package to your .NET project.
Install-Package IronOCR.Languages.Portuguese
Code Example
This C# code example reads Portuguese text from an Image or PDF document.
// Required using directive for IronOcr
using IronOcr;
var Ocr = new IronTesseract();
// Specify the language for OCR as Portuguese
Ocr.Language = OcrLanguage.Portuguese;
// Load the image or PDF from which to read the text
using (var Input = new OcrInput(@"images\Portuguese.png"))
{
// Perform OCR on the input
var Result = Ocr.Read(Input);
// Retrieve the recognized text
var AllText = Result.Text;
// Output the extracted text
Console.WriteLine(AllText);
}
// Required using directive for IronOcr
using IronOcr;
var Ocr = new IronTesseract();
// Specify the language for OCR as Portuguese
Ocr.Language = OcrLanguage.Portuguese;
// Load the image or PDF from which to read the text
using (var Input = new OcrInput(@"images\Portuguese.png"))
{
// Perform OCR on the input
var Result = Ocr.Read(Input);
// Retrieve the recognized text
var AllText = Result.Text;
// Output the extracted text
Console.WriteLine(AllText);
}
' Required using directive for IronOcr
Imports IronOcr
Private Ocr = New IronTesseract()
' Specify the language for OCR as Portuguese
Ocr.Language = OcrLanguage.Portuguese
' Load the image or PDF from which to read the text
Using Input = New OcrInput("images\Portuguese.png")
' Perform OCR on the input
Dim Result = Ocr.Read(Input)
' Retrieve the recognized text
Dim AllText = Result.Text
' Output the extracted text
Console.WriteLine(AllText)
End Using
This code demonstrates how to set up and use the IronOCR library to read Portuguese text from an image. Ensure that the path to the image or PDF document is correct. The recognized text will be stored in the AllText
variable, and it is printed to the console.