Corsican OCR in C# and .NET

Other versions of this document:

IronOCR is a C# software component allowing .NET coders to read text from images and PDF documents in 126 languages, including Corsican.

It is an advanced fork of Tesseract, built exclusively for .NET developers and regularly outperforms other Tesseract engines for both speed and accuracy.

Contents of IronOcr.Languages.Corsican

This package contains 49 OCR languages for .NET:

  • Corsican
  • CorsicanBest
  • CorsicanFast

Download

Corsican Language Pack [corsu]

Installation

The first thing we have to do is install our Corsican OCR package to your .NET project.

Install-Package IronOCR.Languages.Corsican

Code Example

This C# code example reads Corsican text from an image or PDF document.

// First, ensure the IronOcr.Languages.Corsican package is installed
// You can use the NuGet Package Manager console:
// PM> Install-Package IronOcr.Languages.Corsican

using IronOcr;

var Ocr = new IronTesseract();

// Set the OCR language to Corsican
Ocr.Language = OcrLanguage.Corsican;

// Reading text from an image file
using (var Input = new OcrInput(@"images\Corsican.png"))
{
    // Perform OCR on the input image
    var Result = Ocr.Read(Input);

    // Extract all recognized text from the OCR result
    var AllText = Result.Text;
}
// First, ensure the IronOcr.Languages.Corsican package is installed
// You can use the NuGet Package Manager console:
// PM> Install-Package IronOcr.Languages.Corsican

using IronOcr;

var Ocr = new IronTesseract();

// Set the OCR language to Corsican
Ocr.Language = OcrLanguage.Corsican;

// Reading text from an image file
using (var Input = new OcrInput(@"images\Corsican.png"))
{
    // Perform OCR on the input image
    var Result = Ocr.Read(Input);

    // Extract all recognized text from the OCR result
    var AllText = Result.Text;
}
' First, ensure the IronOcr.Languages.Corsican package is installed
' You can use the NuGet Package Manager console:
' PM> Install-Package IronOcr.Languages.Corsican

Imports IronOcr

Private Ocr = New IronTesseract()

' Set the OCR language to Corsican
Ocr.Language = OcrLanguage.Corsican

' Reading text from an image file
Using Input = New OcrInput("images\Corsican.png")
	' Perform OCR on the input image
	Dim Result = Ocr.Read(Input)

	' Extract all recognized text from the OCR result
	Dim AllText = Result.Text
End Using
$vbLabelText   $csharpLabel

In this code:

  • We create an instance of IronTesseract which is used to perform OCR.
  • We specify the language as Corsican using OcrLanguage.Corsican.
  • We read from an input image called Corsican.png.
  • The Read method of Ocr performs the OCR and returns the result, from which we can extract the recognized text.