Search Results for

    Show / Hide Table of Contents

    Namespace IronOcr

    Classes

    Installation

    A static class providing everything to need to configure the installation and licensing of IronOcr

    IronTesseract

    IronTesseract is a comprehensive managed class for performing Tesseract OCR in .Net applications.

    IronTesseract natively supports Tesseract 3, 4 and 5 engines, and will automatically install all required binaries and language packs (tessdata) files.

    OcrInput

    OcrInput provides a robust class for preparing one or more Image Files, PDFs, System.Drawing Objects, Streams and Binary Image data for OCR. Instances of OcrInput can be read by the IronTesseract class.

    We recognise that much of the quality of OCR results depends on preparing images to be read. This class allows developers to enhance their scanned documents provide faster, more accurate OCR results using filters such as: EnhanceResolution(Int32), DeNoise(), ToGrayScale(), Deskew(Int32, Boolean), Rotate(Double) and Sharpen().

    Supports for multi-paged OCR input.

    OcrInput.Page

    A full page (one image) within an OcrInput object.

    OcrResult

    A full document object model (DOM) for results when IronTesseract reads an image or OcrInput.

    Gives access to Text, Pages, Words, Paragraphs, Lines, Words, Characters, Images, Barcodes, Coordinates, Font information in granular detail.

    OcrResult.Barcode

    Represents a barcode discovered during OCR. See ReadBarCodes which must be set true to use this feature.

    OcrResult.Block

    Represents a block of text containing zero or more paragraphs.

    OcrResult.Character

    Represents a single character (char) of text.

    OcrResult.Choice

    OcrResult.Line

    A line of text within an Paragraph

    OcrResult.OcrFont

    Detailed font information returned when using Tesseract OEM engine modes.

    OcrResult.OcrResultElement

    Lowest level of abstract OcrResult DOM element. All result objects extend from this including OcrResult.Barcode, OcrResult.Character, OcrResult.Word, OcrResult.Line, OcrResult.Paragraph ...

    OcrResult.OcrResultTextElement

    An abstract class representing discovered OCR DOM text elements such as OcrResult.Character, OcrResult.Word, OcrResult.Line, OcrResult.Paragraph ...

    OcrResult.Page

    Represents a single page within an OcrResult object.

    OcrResult.Paragraph

    Represents a paragraph of text with an OcrResult object.

    OcrResult.Word

    Represents a Word discovered during OCR.

    TesseractConfiguration

    A configuration object that fine-tunes Tesseract behavior at an Instance level. Gives access to every option available to tesseract command line or C++ API users.

    Enums

    Installation.LoggingModes

    Different ways to Log developer notices and warnings

    OcrLanguage

    The natural language in which OCR documents will be read.

    Any number of languages may be used simultaneously using the IronTessseract.AddSecondaryLanguage() Method.

    To use a custom Tesseract language file (.traineddata) please see the IronTessseract.UseCustomLanguageFile() Method

    OcrResult.BarcodeEncoding

    Barcode Encoding Types.

    OcrResult.TextFlow

    Describes text direction

    TesseractEngineMode

    Allows the developer to choose the algorithm Tesseract will use for OCR. TesseractAndLstm is the recommended behavior for IronOCR.

    TesseractPageSegmentationMode

    Page segmentation mode defines how your text will be searched for by Tesseract.

    TesseractVersion

    Selects between different binary versions of Tesseract.

    ☀
    ☾
    In This Article
    Back to top
    Install with Nuget