German OCR in C# and .NET

Other versions of this document:

IronOCR is a C# software component allowing .NET coders to read text from images and PDF documents in 126 languages, including German.

It is an advanced fork of Tesseract, built exclusively for .NET developers and regularly outperforms other Tesseract engines for both speed and accuracy.

Contents of IronOcr.Languages.German

This package contains 61 OCR languages for .NET:

  • German
  • GermanBest
  • GermanFast
  • GermanFraktur

Download

German Language Pack [Deutsch]

Installation

The first thing we have to do is install our German OCR package to your .NET project.

Install-Package IronOCR.Languages.German

Code Example

This C# code example reads German text from an image or PDF document.

using IronOcr;

var Ocr = new IronTesseract();
// Set the OCR language to German. This must match the language of the document you are scanning.
Ocr.Language = OcrLanguage.German;

using (var Input = new OcrInput(@"images\German.png"))
{
    // Perform OCR on the provided image and get the result.
    var Result = Ocr.Read(Input);
    // Extract all recognized text from the OCR result.
    var AllText = Result.Text;
    // Optionally, output the recognized text to the console for verification.
    Console.WriteLine(AllText);
}
using IronOcr;

var Ocr = new IronTesseract();
// Set the OCR language to German. This must match the language of the document you are scanning.
Ocr.Language = OcrLanguage.German;

using (var Input = new OcrInput(@"images\German.png"))
{
    // Perform OCR on the provided image and get the result.
    var Result = Ocr.Read(Input);
    // Extract all recognized text from the OCR result.
    var AllText = Result.Text;
    // Optionally, output the recognized text to the console for verification.
    Console.WriteLine(AllText);
}
Imports IronOcr

Private Ocr = New IronTesseract()
' Set the OCR language to German. This must match the language of the document you are scanning.
Ocr.Language = OcrLanguage.German

Using Input = New OcrInput("images\German.png")
	' Perform OCR on the provided image and get the result.
	Dim Result = Ocr.Read(Input)
	' Extract all recognized text from the OCR result.
	Dim AllText = Result.Text
	' Optionally, output the recognized text to the console for verification.
	Console.WriteLine(AllText)
End Using
$vbLabelText   $csharpLabel

In this example, IronTesseract is configured to use the German language for OCR, which is necessary for processing images or PDFs containing German text. The OcrInput class is used to specify the image file, and the Read method performs the OCR operation, returning the extracted text.