using IronOcr; using IronSoftware.Drawing; // We can delve deep into OCR results as an object model of // Pages, Barcodes, Paragraphs, Lines, Words and Characters // This allows us to explore, export and draw OCR content using other APIs/ var ocrTesseract = new IronTesseract(); ocrTesseract.Configuration.ReadBarCodes = true; using var ocrInput = new OcrInput(); var pages = new int[] { 1, 2 }; ocrInput.LoadImageFrames("example.tiff", pages); OcrResult ocrResult = ocrTesseract.Read(ocrInput); foreach (var page in ocrResult.Pages) { // Page object int PageNumber = page.PageNumber; string PageText = page.Text; int PageWordCount = page.WordCount; // null if we dont set Ocr.Configuration.ReadBarCodes = true; OcrResult.Barcode[] Barcodes = page.Barcodes; AnyBitmap PageImage = page.ToBitmap(ocrInput); double PageWidth = page.Width; double PageHeight = page.Height; double PageRotation = page.Rotation; // angular correction in degrees from OcrInput.Deskew() foreach (var paragraph in page.Paragraphs) { // Pages -> Paragraphs int ParagraphNumber = paragraph.ParagraphNumber; string ParagraphText = paragraph.Text; AnyBitmap ParagraphImage = paragraph.ToBitmap(ocrInput); int ParagraphX_location = paragraph.X; int ParagraphY_location = paragraph.Y; int ParagraphWidth = paragraph.Width; int ParagraphHeight = paragraph.Height; double ParagraphOcrAccuracy = paragraph.Confidence; OcrResult.TextFlow paragrapthText_direction = paragraph.TextDirection; foreach (var line in paragraph.Lines) { // Pages -> Paragraphs -> Lines int LineNumber = line.LineNumber; string LineText = line.Text; AnyBitmap LineImage = line.ToBitmap(ocrInput); int LineX_location = line.X; int LineY_location = line.Y; int LineWidth = line.Width; int LineHeight = line.Height; double LineOcrAccuracy = line.Confidence; double LineSkew = line.BaselineAngle; double LineOffset = line.BaselineOffset; foreach (var word in line.Words) { // Pages -> Paragraphs -> Lines -> Words int WordNumber = word.WordNumber; string WordText = word.Text; AnyBitmap WordImage = word.ToBitmap(ocrInput); int WordX_location = word.X; int WordY_location = word.Y; int WordWidth = word.Width; int WordHeight = word.Height; double WordOcrAccuracy = word.Confidence; foreach (var character in word.Characters) { // Pages -> Paragraphs -> Lines -> Words -> Characters int CharacterNumber = character.CharacterNumber; string CharacterText = character.Text; AnyBitmap CharacterImage = character.ToBitmap(ocrInput); int CharacterX_location = character.X; int CharacterY_location = character.Y; int CharacterWidth = character.Width; int CharacterHeight = character.Height; double CharacterOcrAccuracy = character.Confidence; // Output alternative symbols choices and their probability. // Very useful for spellchecking OcrResult.Choice[] Choices = character.Choices; } } } } }

OCR TOOLS

How to Scan Writing into Text (Beginner Tutorial)

Published November 29, 2023

Introduction

In the world of digital information, the ability to convert handwriting or printed text from scanned documents into editable and searchable formats has become paramount. Optical Character Recognition (OCR) technology has been a key player in this process, enabling the extraction of textual information from images.

In this article, we'll explore the fundamentals of Scan Writing to Text using Tesseract, an open-source OCR engine, and later introduce IronOCR as a powerful alternative with advanced capabilities for converting handwriting, digital text, and scanned document to editable text accompanied by a code example.

Understanding OCR Technology

OCR technology utilizes sophisticated algorithms to recognize and interpret the patterns of printed or handwritten characters within an image. It essentially bridges the gap between physical and digital worlds, allowing us to capture and digitize text from a variety of sources, including scanned documents, scanned handwriting, PDFs, scanned image files and even recognize handwriting.

Tesseract: An Open-Source OCR Solution

Understanding Tesseract

Tesseract, developed by Google, is an open-source OCR engine widely used for converting various types of scanned documents including handwritten text, scanned images, and PDF documents into machine-readable editable text. It supports multiple languages and has gained popularity for its accuracy and versatility. Let's delve into the key features and steps involved in using Tesseract for Scan Writing to Text.

Key Features of Tesseract

Versatility: Tesseract supports a wide range of input formats, making it suitable for diverse applications, including scanned documents, images, and PDFs.
Language Support: Tesseract is designed to recognize text in multiple languages, making it a flexible solution for international users and developers working with documents in different linguistic contexts.
Open-Source Community: The open-source nature of Tesseract has led to a robust community of developers contributing to its improvement. Regular updates and enhancements ensure that Tesseract remains a competitive OCR solution.
OCR Accuracy: Tesseract is known for its accuracy in recognizing text, even in complex scenarios with varying fonts, styles, and image qualities.

Steps to Use Tesseract

Using Tesseract OCR for converting handwritten notes, or deciphering illegible handwriting in Windows involves a few steps. Here's a basic guide:

Install Tesseract OCR:
- Download the Tesseract installer for Windows from the official GitHub UB Mannheim repository: Tesseract OCR exe.
- Run the installer and follow the on-screen instructions to complete the installation.
- Select the location and remember the path of installation as it will be used later to set in Path variable.
Set Up Environment Variables:
- Add the Tesseract installation directory to the system's PATH environment variable. This ensures that the Tesseract executable can be accessed from any command prompt window.
Command-Line Usage:
- Open a command prompt window and navigate to the directory containing your images or scanned handwritten documents.
- Use the following command to perform OCR on an image and output the result to a text file:
```
tesseract input_image.png output_text.txt
```
```
tesseract input_image.png output_text.txt
```
```
'INSTANT VB TODO TASK: The following line uses invalid syntax:
'tesseract input_image.png output_text.txt
```
VB C#
Replace input_image.png with the name of your image file and output_text.txt with the desired name for the output text file.
Post-Processing: Depending on the requirements, post-processing steps may be needed to refine the extracted text, such as removing unnecessary characters or formatting.

While Tesseract is a powerful tool, developers often seek alternatives that offer additional features, customization options, and ease of integration into their applications. This is where IronOCR comes into play.

Introducing IronOCR: Elevating Scan Writing into Text

Understanding IronOCR

IronOCR is a .NET OCR library that goes beyond the capabilities of Tesseract, offering advanced features and customization options for developers. Whether working with scanned documents, images, or scanned PDFs, IronOCR provides a robust solution for accurate text extraction. Let's explore the key features of IronOCR and how it can enhance the Scan Writing to Text process.

Key Features of IronOCR

Accuracy: IronOCR leverages state-of-the-art OCR algorithms, ensuring high accuracy in text recognition. It excels in challenging scenarios, such as low-quality images or complex fonts.
Versatility: Supporting various input formats, including images and PDF files, IronOCR adapts to diverse use cases, making it a versatile choice for developers.
Language Support: IronOCR caters to a global audience by supporting multiple international languages, ensuring effective OCR processing for documents in different linguistic contexts.
Ease of Integration: One of IronOCR's standout features is its seamless integration into .NET applications. Developers can easily incorporate IronOCR into their projects, allowing for efficient implementation of OCR functionality.

IronOCR: A Code Example

Let's have a simple IronTesseract 5 code snippet for using IronOCR in a .NET application:

using IronOcr;

class Program
{
    static void Main()
    {
        var ocrTesseract = new IronTesseract();

    // This is done by default and can be omitted:
    // ocrTesseract.Configuration.TesseractVersion = TesseractVersion.Tesseract5;

    using (var ocrInput = new OcrInput(@"images\image.png"))
    {
        var ocrResult = ocrTesseract.Read(ocrInput);
        Console.WriteLine(ocrResult.Text);
    }
    }
}

using IronOcr;

class Program
{
    static void Main()
    {
        var ocrTesseract = new IronTesseract();

    // This is done by default and can be omitted:
    // ocrTesseract.Configuration.TesseractVersion = TesseractVersion.Tesseract5;

    using (var ocrInput = new OcrInput(@"images\image.png"))
    {
        var ocrResult = ocrTesseract.Read(ocrInput);
        Console.WriteLine(ocrResult.Text);
    }
    }
}

Imports IronOcr

Friend Class Program
	Shared Sub Main()
		Dim ocrTesseract = New IronTesseract()

	' This is done by default and can be omitted:
	' ocrTesseract.Configuration.TesseractVersion = TesseractVersion.Tesseract5;

	Using ocrInput As New OcrInput("images\image.png")
		Dim ocrResult = ocrTesseract.Read(ocrInput)
		Console.WriteLine(ocrResult.Text)
	End Using
	End Sub
End Class

VB C#

In this example, IronOCR provides a more straightforward and object-oriented approach, allowing developers to read printed or handwritten text directly from the image with the efficient IronTesseract 5 OCR engine. For more detailed information, please visit the documentation page.

Conclusion

While Tesseract remains a robust open-source OCR engine, IronOCR offers enhanced features, customization options, and ease of integration for developers working within the .NET ecosystem. The choice between Tesseract and IronOCR depends on the specific requirements of the project and the desired level of control over the OCR process. As the demand for accurate text extraction from scanned documents continues to grow, OCR tools like IronOCR play a pivotal role in shaping the future of information accessibility and digital document management.

IronOCR provides a free trial for users to experience its advanced OCR capabilities, while a commercial license is required for professional and commercial use. To explore the full potential of IronOCR, download the software library directly from the official website.

< PREVIOUS
Arabic Text Recognition (Free & Online Tools)

NEXT >
How to Build an OCR in Python