Korean OCR in C# and .NET

Other versions of this document:

IronOCR is a C# software component allowing .NET coders to read text from images and PDF documents in 126 language, including Korean.

It is an advanced fork of Tesseract, built exclusively for the .NET developers and regularly outperforms other Tesseract engines for both speed and accuracy.

Contents of IronOcr.Languages.Korean

This package contains 108 OCR languages for .NET:

  • Korean
  • KoreanBest
  • KoreanFast
  • KoreanVertical
  • KoreanVerticalBest
  • KoreanVerticalFast

Download

Korean Language Pack [한국어 (韓國語)]

Installation

The first thing we have to do is install our Korean OCR package to your .NET project.

PM> Install-Package IronOCR.Languages.Korean

Code Example

This C# code example reads Korean text from an Image or PDF document.

//PM> Install-Package IronOcr.Languages.Korean
using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Korean;
using (var Input = new OcrInput(@"images\Korean.png"))
{
    var Result = Ocr.Read(Input);
    var AllText =  Result.Text;
}
//PM> Install-Package IronOcr.Languages.Korean
using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Korean;
using (var Input = new OcrInput(@"images\Korean.png"))
{
    var Result = Ocr.Read(Input);
    var AllText =  Result.Text;
}
'PM> Install-Package IronOcr.Languages.Korean
Imports IronOcr

Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Korean
Using Input = New OcrInput("images\Korean.png")
	Dim Result = Ocr.Read(Input)
	Dim AllText = Result.Text
End Using
VB   C#