Thai Alphabet OCR in C# and .NET

Other versions of this document:

IronOCR is a C# software component allowing .NET coders to read text from images and PDF documents in 126 languages, including the Thai Alphabet.

It is an advanced fork of Tesseract, built exclusively for .NET developers and regularly outperforms other Tesseract engines for both speed and accuracy.

Contents of IronOcr.Languages.Thai

This package contains 96 OCR languages for .NET:

  • Thai
  • ThaiBest
  • ThaiFast
  • ThaiAlphabet
  • ThaiAlphabetBest
  • ThaiAlphabetFast

Download

Thai Alphabet Language Pack [ไทย]

Installation

The first thing we have to do is install our Thai Alphabet OCR package to your .NET project.

Install-Package IronOCR.Languages.Thai

Code Example

This C# code example reads Thai Alphabet text from an Image or PDF document.

// Ensure you have installed the IronOCR.Languages.Thai package via NuGet.
// Import the IronOcr namespace to work with IronOCR classes.
using IronOcr; 

class ThaiOcrExample
{
    static void Main()
    {
        // Create a new instance of IronTesseract for OCR processing
        var ocr = new IronTesseract();

        // Set the language to Thai for Optical Character Recognition
        ocr.Language = OcrLanguage.Thai;

        // Using the 'using' statement ensures that resources are properly disposed.
        using (var input = new OcrInput(@"images\Thai.png"))
        {
            // Perform OCR to read the text from the input image
            var result = ocr.Read(input);

            // Retrieve and store all recognized text from the image
            string allText = result.Text;

            // Optionally, you can output the text to console or log it as needed
            System.Console.WriteLine(allText);
        }
    }
}
// Ensure you have installed the IronOCR.Languages.Thai package via NuGet.
// Import the IronOcr namespace to work with IronOCR classes.
using IronOcr; 

class ThaiOcrExample
{
    static void Main()
    {
        // Create a new instance of IronTesseract for OCR processing
        var ocr = new IronTesseract();

        // Set the language to Thai for Optical Character Recognition
        ocr.Language = OcrLanguage.Thai;

        // Using the 'using' statement ensures that resources are properly disposed.
        using (var input = new OcrInput(@"images\Thai.png"))
        {
            // Perform OCR to read the text from the input image
            var result = ocr.Read(input);

            // Retrieve and store all recognized text from the image
            string allText = result.Text;

            // Optionally, you can output the text to console or log it as needed
            System.Console.WriteLine(allText);
        }
    }
}
' Ensure you have installed the IronOCR.Languages.Thai package via NuGet.
' Import the IronOcr namespace to work with IronOCR classes.
Imports IronOcr

Friend Class ThaiOcrExample
	Shared Sub Main()
		' Create a new instance of IronTesseract for OCR processing
		Dim ocr = New IronTesseract()

		' Set the language to Thai for Optical Character Recognition
		ocr.Language = OcrLanguage.Thai

		' Using the 'using' statement ensures that resources are properly disposed.
		Using input = New OcrInput("images\Thai.png")
			' Perform OCR to read the text from the input image
			Dim result = ocr.Read(input)

			' Retrieve and store all recognized text from the image
			Dim allText As String = result.Text

			' Optionally, you can output the text to console or log it as needed
			System.Console.WriteLine(allText)
		End Using
	End Sub
End Class
$vbLabelText   $csharpLabel

In this example, we read Thai text from an image named Thai.png located in the images folder. Make sure to replace the file path with your actual image location. The OCR language is set to Thai using OcrLanguage.Thai to specify the Thai language package for recognition.