Han Simplified Alphabet OCR in C# and .Net

126 More Langauges

IronOCR is a C# software component allowing .NET coders to read text from images and PDF documents in 126 language, including Han Simplified Alphabet.

It is an advanced fork of Tesseract, built exclusively for the .NET developers and regularly outperforms other Tesseract engines for both speed and accuracy.

Contents of IronOcr.Languages.Han

This package contain 400 OCR languages for .NET:

  • HanSimplifiedAlphabet
  • HanSimplifiedAlphabetBest
  • HanSimplifiedAlphabetFast
  • HanSimplifiedVerticalAlphabet
  • HanSimplifiedVerticalAlphabetBest
  • HanSimplifiedVerticalAlphabetFast
  • HanTraditionalAlphabet
  • HanTraditionalAlphabetBest
  • HanTraditionalAlphabetFast
  • HanTraditionalVerticalAlphabet
  • HanTraditionalVerticalAlphabetBest
  • HanTraditionalVerticalAlphabetFast

Download

Han Simplified Alphabet Language Pack [Samhan]

Installation

The first thing we have to do is install our Han Simplified Alphabet OCR package to your .NET project.

PM> Install-Package IronOCR.Languages.Han

Code Example

This C# code example reads Han Simplified Alphabet text from an Image or PDF document.

//PM> Install-Package IronOcr.Languages.Han
using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Han;
using (var Input = new OcrInput(@"images\Han.png"))
{
    var Result = Ocr.Read(Input);
    Var AllText =  Result.Text
}
//PM> Install-Package IronOcr.Languages.Han
using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Han;
using (var Input = new OcrInput(@"images\Han.png"))
{
    var Result = Ocr.Read(Input);
    Var AllText =  Result.Text
}
'PM> Install-Package IronOcr.Languages.Han
Imports IronOcr

Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Han
Using Input = New OcrInput("images\Han.png")
	Dim Result = Ocr.Read(Input)
	Dim AllText As Var = Result.Text
End Using
VB   C#