Japanese OCR in C# and .Net
Other versions of this document:
IronOCR is a C# software component allowing .NET coders to read text from images and PDF documents in 126 language, including Japanese.
It is an advanced fork of Tesseract, built exclusively for the .NET developers and regularly outperforms other Tesseract engines for both speed and accuracy.
Contents of IronOcr.Languages.Japanese
This package contain 286 OCR languages for .NET:
- JapaneseAlphabet
- JapaneseAlphabetBest
- JapaneseAlphabetFast
- JapaneseVerticalAlphabet
- JapaneseVerticalAlphabetBest
- JapaneseVerticalAlphabetFast
- Japanese
- JapaneseBest
- JapaneseFast
- JapaneseVertical
- JapaneseVerticalBest
- JapaneseVerticalFast
Download
Japanese Language Pack [日本語 (にほんご)]
Installation
The first thing we have to do is install our Japanese OCR package to your .NET project.
PM> Install-Package IronOCR.Languages.Japanese
Code Example
This C# code example reads Japanese text from an Image or PDF document.
//PM> Install-Package IronOcr.Languages.Japanese
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Japanese;
using (var Input = new OcrInput(@"images\Japanese.png"))
{
var Result = Ocr.Read(Input);
Var AllText = Result.Text
}
//PM> Install-Package IronOcr.Languages.Japanese
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Japanese;
using (var Input = new OcrInput(@"images\Japanese.png"))
{
var Result = Ocr.Read(Input);
Var AllText = Result.Text
}
'PM> Install-Package IronOcr.Languages.Japanese
Imports IronOcr
Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Japanese
Using Input = New OcrInput("images\Japanese.png")
Dim Result = Ocr.Read(Input)
Dim AllText As Var = Result.Text
End Using