Hangul Alphabet OCR in C# and .NET

126 More Languages

IronOCR is a C# software component allowing .NET coders to read text from images and PDF documents in 126 languages, including the Hangul Alphabet.

It is an advanced fork of Tesseract, built exclusively for .NET developers and regularly outperforms other Tesseract engines in both speed and accuracy.

Contents of IronOcr.Languages.Hangul

This package contains 156 OCR languages for .NET:

  • HangulAlphabet
  • HangulAlphabetBest
  • HangulAlphabetFast
  • HangulVerticalAlphabet
  • HangulVerticalAlphabetBest
  • HangulVerticalAlphabetFast

Download

Hangul Alphabet Language Pack [Korean Alphabet]

Installation

The first step is to install the Hangul Alphabet OCR package to your .NET project.

Install-Package IronOCR.Languages.Hangul

Code Example

This C# code example reads Hangul Alphabet text from an Image or PDF document.

// Ensure the IronOCR library is installed
// PM> Install-Package IronOcr.Languages.Hangul

using IronOcr;

class Program
{
    static void Main()
    {
        // Create a new instance of IronTesseract for performing OCR
        var Ocr = new IronTesseract();

        // Specify the Hangul language pack for better accuracy with Korean texts
        Ocr.Language = OcrLanguage.Hangul;

        // Load the image or PDF you want to read
        using (var Input = new OcrInput(@"images\Hangul.png"))
        {
            // Perform OCR to read the text from the image
            var Result = Ocr.Read(Input);

            // Extract the recognized text
            var AllText = Result.Text;

            // Output the text to the console
            Console.WriteLine(AllText);
        }
    }
}
// Ensure the IronOCR library is installed
// PM> Install-Package IronOcr.Languages.Hangul

using IronOcr;

class Program
{
    static void Main()
    {
        // Create a new instance of IronTesseract for performing OCR
        var Ocr = new IronTesseract();

        // Specify the Hangul language pack for better accuracy with Korean texts
        Ocr.Language = OcrLanguage.Hangul;

        // Load the image or PDF you want to read
        using (var Input = new OcrInput(@"images\Hangul.png"))
        {
            // Perform OCR to read the text from the image
            var Result = Ocr.Read(Input);

            // Extract the recognized text
            var AllText = Result.Text;

            // Output the text to the console
            Console.WriteLine(AllText);
        }
    }
}
' Ensure the IronOCR library is installed
' PM> Install-Package IronOcr.Languages.Hangul

Imports IronOcr

Friend Class Program
	Shared Sub Main()
		' Create a new instance of IronTesseract for performing OCR
		Dim Ocr = New IronTesseract()

		' Specify the Hangul language pack for better accuracy with Korean texts
		Ocr.Language = OcrLanguage.Hangul

		' Load the image or PDF you want to read
		Using Input = New OcrInput("images\Hangul.png")
			' Perform OCR to read the text from the image
			Dim Result = Ocr.Read(Input)

			' Extract the recognized text
			Dim AllText = Result.Text

			' Output the text to the console
			Console.WriteLine(AllText)
		End Using
	End Sub
End Class
$vbLabelText   $csharpLabel

Explanation:

  • IronTesseract: This is the main class used to perform OCR operations.
  • OcrLanguage.Hangul: This specifies that the OCR engine should use the Hangul language pack, which optimizes the engine for Korean text recognition.
  • OcrInput: This is a container for the images or PDFs to be processed.
  • Ocr.Read(): This method performs the OCR operation and returns a result object containing the recognized text.