Latvian OCR in C# and .NET

Other versions of this document:

IronOCR is a C# software component allowing .NET coders to read text from images and PDF documents in 126 languages, including Latvian.

It is an advanced fork of Tesseract, built exclusively for .NET developers and regularly outperforms other Tesseract engines for both speed and accuracy.

Contents of IronOcr.Languages.Latvian

This package contains 46 OCR languages for .NET:

  • Latvian
  • LatvianBest
  • LatvianFast

Download

Latvian Language Pack [latviešu valoda]

Installation

The first thing you need to do is install the Latvian OCR package into your .NET project.

Install-Package IronOCR.Languages.Latvian

Code Example

This C# code example reads Latvian text from an image or PDF document.

using IronOcr;

class LatvianOCRExample
{
    static void Main()
    {
        // Create an instance of IronTesseract for OCR
        var Ocr = new IronTesseract();

        // Set the OCR language to Latvian
        Ocr.Language = OcrLanguage.Latvian;

        // Define the input image file path
        using (var Input = new OcrInput(@"images\Latvian.png"))
        {
            // Perform OCR on the input image
            var Result = Ocr.Read(Input);

            // Extract all recognized text from the OCR result
            var AllText = Result.Text;

            // Output the recognized text to the console
            Console.WriteLine(AllText);
        }
    }
}
using IronOcr;

class LatvianOCRExample
{
    static void Main()
    {
        // Create an instance of IronTesseract for OCR
        var Ocr = new IronTesseract();

        // Set the OCR language to Latvian
        Ocr.Language = OcrLanguage.Latvian;

        // Define the input image file path
        using (var Input = new OcrInput(@"images\Latvian.png"))
        {
            // Perform OCR on the input image
            var Result = Ocr.Read(Input);

            // Extract all recognized text from the OCR result
            var AllText = Result.Text;

            // Output the recognized text to the console
            Console.WriteLine(AllText);
        }
    }
}
Imports IronOcr

Friend Class LatvianOCRExample
	Shared Sub Main()
		' Create an instance of IronTesseract for OCR
		Dim Ocr = New IronTesseract()

		' Set the OCR language to Latvian
		Ocr.Language = OcrLanguage.Latvian

		' Define the input image file path
		Using Input = New OcrInput("images\Latvian.png")
			' Perform OCR on the input image
			Dim Result = Ocr.Read(Input)

			' Extract all recognized text from the OCR result
			Dim AllText = Result.Text

			' Output the recognized text to the console
			Console.WriteLine(AllText)
		End Using
	End Sub
End Class
$vbLabelText   $csharpLabel

Explanation

  • IronOcr Namespace: The example begins by importing the IronOcr namespace, necessary for accessing OCR functionalities.
  • IronTesseract Instance: The IronTesseract object (Ocr) is created to handle image reading and text extraction.
  • Language Configuration: The OCR process is configured to read Latvian text by setting the Ocr.Language property.
  • OcrInput Object: An OcrInput object is created, referencing the image file to be processed.
  • Reading the Image: The Read method is called on the Ocr instance to process the image and extract text, stored in the Result variable.
  • Result Extraction: The OCR results are accessed via Result.Text and stored in AllText for further use or display.
  • Console Output: The recognized Latvian text is printed to the console, providing a visual confirmation of text extraction.