Sanskrit OCR in C# and .NET

126 More Languages

IronOCR is a C# software component allowing .NET coders to read text from images and PDF documents in 126 language, including Sanskrit.

It is an advanced fork of Tesseract, built exclusively for the .NET developers and regularly outperforms other Tesseract engines for both speed and accuracy.

Contents of IronOcr.Languages.Sanskrit

This package contains 49 OCR languages for .NET:

  • Sanskrit
  • SanskritBest
  • SanskritFast

Download

Sanskrit Language Pack [संस्कृतम्]

Installation

The first thing we have to do is install our Sanskrit OCR package to your .NET project.

PM> Install-Package IronOCR.Languages.Sanskrit

Code Example

This C# code example reads Sanskrit text from an Image or PDF document.

//PM> Install-Package IronOcr.Languages.Sanskrit
using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Sanskrit;
using (var Input = new OcrInput(@"images\Sanskrit.png"))
{
    var Result = Ocr.Read(Input);
    var AllText =  Result.Text;
}
//PM> Install-Package IronOcr.Languages.Sanskrit
using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Sanskrit;
using (var Input = new OcrInput(@"images\Sanskrit.png"))
{
    var Result = Ocr.Read(Input);
    var AllText =  Result.Text;
}
'PM> Install-Package IronOcr.Languages.Sanskrit
Imports IronOcr

Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Sanskrit
Using Input = New OcrInput("images\Sanskrit.png")
	Dim Result = Ocr.Read(Input)
	Dim AllText = Result.Text
End Using
VB   C#