C#と.NETにおける漢簡アルファベットOCR

カーティス・チャウ

更新日:2025年10月27日

Translated

View the article in English

他126の言語

IronOCR は、.NET コーダーが簡体字漢字を含む 126 の言語で画像や PDF ドキュメントからテキストを読み取ることができる C# ソフトウェアコンポーネントです。

これはTesseractの高度なフォークであり、.NET開発者専用に構築され、速度と精度の両方で他のTesseractエンジンを定期的に上回ります。

IronOcr.Languages.Han の内容

このパッケージには、.NET 用の 400 の OCR 言語が含まれています。

簡体字アルファベット
簡体字アルファベットベスト
簡体字アルファベット高速
簡体字縦書きアルファベット
簡体字漢字垂直アルファベットベスト
簡体字漢字垂直アルファベットファスト
繁体字漢字アルファベット
繁体字漢字アルファベットベスト
繁体字漢字アルファベットファスト
繁体字漢字垂直アルファベット
漢数字縦書きアルファベットベスト
ハングル繁体字縦書きアルファベット高速

ダウンロード

簡体字漢字言語パック [Samhan]

Download as Zip
NuGetでインストール

インストール

最初に、Han Simplified Alphabet OCR パッケージを .NET プロジェクトにインストールする必要があります。

パッケージマネージャーコンソールで次のコマンドを実行します。

Install-Package IronOCR.Languages.Han

Code Example

この C# コード例は、画像または PDF ドキュメントから Han Simplified Alphabet のテキストを読み取ります。

// Reference the IronOcr library
using IronOcr;

class Program
{
    static void Main()
    {
        // Create an IronTesseract OCR engine
        var Ocr = new IronTesseract();

        // Load the Han language for OCR processing
        Ocr.Language = OcrLanguage.Han;

        // Using a 'using' statement for resource management
        using (var Input = new OcrInput(@"images\Han.png"))
        {
            // Process the image to extract text
            var Result = Ocr.Read(Input);

            // Retrieve and display the extracted text
            string AllText = Result.Text;
            System.Console.WriteLine(AllText);
        }
    }
}

// Reference the IronOcr library
using IronOcr;

class Program
{
    static void Main()
    {
        // Create an IronTesseract OCR engine
        var Ocr = new IronTesseract();

        // Load the Han language for OCR processing
        Ocr.Language = OcrLanguage.Han;

        // Using a 'using' statement for resource management
        using (var Input = new OcrInput(@"images\Han.png"))
        {
            // Process the image to extract text
            var Result = Ocr.Read(Input);

            // Retrieve and display the extracted text
            string AllText = Result.Text;
            System.Console.WriteLine(AllText);
        }
    }
}

' Reference the IronOcr library
Imports IronOcr

Friend Class Program
	Shared Sub Main()
		' Create an IronTesseract OCR engine
		Dim Ocr = New IronTesseract()

		' Load the Han language for OCR processing
		Ocr.Language = OcrLanguage.Han

		' Using a 'using' statement for resource management
		Using Input = New OcrInput("images\Han.png")
			' Process the image to extract text
			Dim Result = Ocr.Read(Input)

			' Retrieve and display the extracted text
			Dim AllText As String = Result.Text
			System.Console.WriteLine(AllText)
		End Using
	End Sub
End Class

$vbLabelText $csharpLabel

説明

まず、IronOcr ライブラリを参照して、OCR 機能を使用します。
画像/PDF ドキュメントを処理するためにIronTesseractのインスタンスが作成されます。
OCR プロセスの言語は、 Ocr.Languageを使用してHanに設定されます。
画像はOcrInputを使用して読み込まれ、 Ocr.Read()を呼び出して処理されます。
OCR プロセスの結果はResult.Textに保存され、ドキュメントから抽出されたテキストが含まれます。
最後に、テキストをコンソールに出力します。

適切な using ディレクティブを持ち、特にファイルストリームのような未管理リソースを扱う際に、using ステートメントでリソースを効率的に管理することを確認してください。

顧客ハイライト:

開発者スポットライト:

ウェビナー:

無料30日間のトライアルを開始

このページでは

C#と.NETにおける漢簡アルファベットOCR

IronOcr.Languages.Han の内容

ダウンロード

インストール

Code Example

説明

無料30日間のトライアルを開始

このページでは

C#と.NETにおける漢簡アルファベットOCR

IronOcr.Languages.Han の内容

ダウンロード

インストール

Code Example

説明

Next step: Start free 30-day Trial

Next step: Start free 30-day Trial

世界中の数百万人のエンジニアから信頼されています。