Finnish OCR in C# and .NET

This article was translated from English: Does it need improvement?
Translated
View the article in English
Inne wersje tego dokumentu:

IronOCR to komponent oprogramowania w C#, umożliwiający programistom .NET odczytywanie tekstu z obrazów i dokumentów PDF w 126 językach, w tym w języku fińskim.

Jest to zaawansowany fork Tesseracta, zbudowany wyłącznie dla deweloperów .NET i regularnie przewyższający inne silniki Tesseract pod względem szybkości i dokładności.

Zawartość IronOcr.Languages.Finnish

Ten pakiet zawiera 46 języków OCR dla .NET:

  • Finnish
  • FinnishBest
  • FinnishFast

Pobieranie

Finnish Language Pack [suomi]

Instalacja

Pierwszą rzeczą, którą musimy zrobić, jest zainstalowanie naszego pakietu OCR Fiński w Twoim projekcie .NET.

Install-Package IronOcr.Languages.Finnish

Przyklad kodu

Ten przykład kodu C# odczytuje tekst fiński z obrazu lub dokumentu PDF.

// Ensure you have installed the IronOcr.Languages.Finnish package before using this code.
// You can install it via NuGet package manager.

using IronOcr;

var Ocr = new IronTesseract(); // Create a new instance of the IronTesseract OCR engine
Ocr.Language = OcrLanguage.Finnish; // Set the language of the OCR engine to Finnish

// Using a using statement to ensure the OcrInput gets disposed of correctly
using (var Input = new OcrInput(@"images\Finnish.png")) // Path to the image/PDF containing Finnish text
{
    var Result = Ocr.Read(Input); // Perform OCR on the input file
    var AllText = Result.Text; // Extract the recognized text as a string

    // Output or process the extracted text as required
    Console.WriteLine(AllText); // Example of outputting the text to the console
}
// Ensure you have installed the IronOcr.Languages.Finnish package before using this code.
// You can install it via NuGet package manager.

using IronOcr;

var Ocr = new IronTesseract(); // Create a new instance of the IronTesseract OCR engine
Ocr.Language = OcrLanguage.Finnish; // Set the language of the OCR engine to Finnish

// Using a using statement to ensure the OcrInput gets disposed of correctly
using (var Input = new OcrInput(@"images\Finnish.png")) // Path to the image/PDF containing Finnish text
{
    var Result = Ocr.Read(Input); // Perform OCR on the input file
    var AllText = Result.Text; // Extract the recognized text as a string

    // Output or process the extracted text as required
    Console.WriteLine(AllText); // Example of outputting the text to the console
}
' Ensure you have installed the IronOcr.Languages.Finnish package before using this code.
' You can install it via NuGet package manager.

Imports IronOcr

Private Ocr = New IronTesseract() ' Create a new instance of the IronTesseract OCR engine
Ocr.Language = OcrLanguage.Finnish ' Set the language of the OCR engine to Finnish

' Using a using statement to ensure the OcrInput gets disposed of correctly
Using Input = New OcrInput("images\Finnish.png") ' Path to the image/PDF containing Finnish text
	Dim Result = Ocr.Read(Input) ' Perform OCR on the input file
	Dim AllText = Result.Text ' Extract the recognized text as a string

	' Output or process the extracted text as required
	Console.WriteLine(AllText) ' Example of outputting the text to the console
End Using
$vbLabelText   $csharpLabel