OcrInput Class

Introducing a better code structure with OcrInput which allows more developer control of OCR.

This structure is the building block on which high-performing OCR applications for C# can be built.

Features include automatic or explicit image optimizations, managed safety from memory leaks, multithreading, multipage documents, OCR to PDF, OCR to HTML, and support for OCR PDF and multipage TIFF files.

Below is an example of how to use the OcrInput class effectively:

// C# Example demonstrating OcrInput usage

using System;
using SomeOcrLibrary; // Replace with the actual OCR library you're using

namespace OcrExample
{
    class Program
    {
        static void Main(string[] args)
        {
            // Initialize the OCR input object
            OcrInput ocrInput = new OcrInput();

            // Add an image to be processed by OCR
            // Replace with the path to your image file
            ocrInput.AddImage("path/to/your/image.jpg"); 

            // Set options for OCR processing if needed (e.g. language, optimizations)
            ocrInput.SetLanguage("eng"); // Setting the language for OCR

            // Adding error handling to manage possible exceptions
            try
            {
                // Process the image(s) with OCR
                string resultText = ocrInput.Process();

                // Output the recognized text
                Console.WriteLine("OCR Result: ");
                Console.WriteLine(resultText);
            }
            catch (Exception ex)
            {
                // Handle exceptions such as file not found or OCR errors
                Console.WriteLine($"An error occurred: {ex.Message}");
            }
        }
    }
}
// C# Example demonstrating OcrInput usage

using System;
using SomeOcrLibrary; // Replace with the actual OCR library you're using

namespace OcrExample
{
    class Program
    {
        static void Main(string[] args)
        {
            // Initialize the OCR input object
            OcrInput ocrInput = new OcrInput();

            // Add an image to be processed by OCR
            // Replace with the path to your image file
            ocrInput.AddImage("path/to/your/image.jpg"); 

            // Set options for OCR processing if needed (e.g. language, optimizations)
            ocrInput.SetLanguage("eng"); // Setting the language for OCR

            // Adding error handling to manage possible exceptions
            try
            {
                // Process the image(s) with OCR
                string resultText = ocrInput.Process();

                // Output the recognized text
                Console.WriteLine("OCR Result: ");
                Console.WriteLine(resultText);
            }
            catch (Exception ex)
            {
                // Handle exceptions such as file not found or OCR errors
                Console.WriteLine($"An error occurred: {ex.Message}");
            }
        }
    }
}
' C# Example demonstrating OcrInput usage

Imports System
Imports SomeOcrLibrary ' Replace with the actual OCR library you're using

Namespace OcrExample
	Friend Class Program
		Shared Sub Main(ByVal args() As String)
			' Initialize the OCR input object
			Dim ocrInput As New OcrInput()

			' Add an image to be processed by OCR
			' Replace with the path to your image file
			ocrInput.AddImage("path/to/your/image.jpg")

			' Set options for OCR processing if needed (e.g. language, optimizations)
			ocrInput.SetLanguage("eng") ' Setting the language for OCR

			' Adding error handling to manage possible exceptions
			Try
				' Process the image(s) with OCR
				Dim resultText As String = ocrInput.Process()

				' Output the recognized text
				Console.WriteLine("OCR Result: ")
				Console.WriteLine(resultText)
			Catch ex As Exception
				' Handle exceptions such as file not found or OCR errors
				Console.WriteLine($"An error occurred: {ex.Message}")
			End Try
		End Sub
	End Class
End Namespace
$vbLabelText   $csharpLabel

Key Concepts Explained:

  • OcrInput Initialization: Instantiates an OcrInput object that handles input images for OCR processing.
  • AddImage Method: Adds an image file to the OcrInput object to be processed. This method can be called multiple times to handle multipage documents.
  • SetLanguage Method: Configures the language to be used by the OCR engine for processing the text.
  • Process Method: Executes the OCR operation on the added images and returns the extracted text.
  • Error Handling: Protects the application from unexpected issues such as missing files or OCR failures, providing clarity on what errors occurred.

Make sure to replace "SomeOcrLibrary" with the actual namespace of your OCR library. The sample code is a general guide and may need adaptations based on the specific library or framework you're using.