Namespace IronOcr
Classes
Installation
A static class providing everything to need to configure the installation and licensing of IronOcr
IronTesseract
IronTesseract is a comprehensive managed class for performing Tesseract OCR in .Net applications.
IronTesseract natively supports Tesseract 3, 4 and 5 engines, and will automatically install all required binaries and language packs (tessdata) files.
OcrInput
OcrInput provides a robust class for preparing one or more Image Files, PDFs, System.Drawing Objects, Streams and Binary Image data for OCR. Instances of OcrInput can be read by the IronTesseract class.
We recognise that much of the quality of OCR results depends on preparing images to be read. This class allows developers to enhance their scanned documents provide faster, more accurate OCR results using filters such as: EnhanceResolution(Int32), DeNoise(), ToGrayScale(), Deskew(Int32, Boolean), Rotate(Double) and Sharpen().
Supports for multi-paged OCR input.
OcrInput.Page
A full page (one image) within an OcrInput object.
OcrResult
A full document object model (DOM) for results when IronTesseract reads an image or OcrInput.
Gives access to Text, Pages, Words, Paragraphs, Lines, Words, Characters, Images, Barcodes, Coordinates, Font information in granular detail.
OcrResult.Barcode
Represents a barcode discovered during OCR. See ReadBarCodes which must be set true to use this feature.
OcrResult.Block
Represents a block of text containing zero or more paragraphs.
OcrResult.Character
Represents a single character (char) of text.
OcrResult.Choice
OcrResult.Line
A line of text within an Paragraph
OcrResult.OcrFont
Detailed font information returned when using Tesseract OEM engine modes.
OcrResult.OcrResultElement
Lowest level of abstract OcrResult DOM element. All result objects extend from this including OcrResult.Barcode, OcrResult.Character, OcrResult.Word, OcrResult.Line, OcrResult.Paragraph ...
OcrResult.OcrResultTextElement
An abstract class representing discovered OCR DOM text elements such as OcrResult.Character, OcrResult.Word, OcrResult.Line, OcrResult.Paragraph ...
OcrResult.Page
Represents a single page within an OcrResult object.
OcrResult.Paragraph
Represents a paragraph of text with an OcrResult object.
OcrResult.Word
Represents a Word discovered during OCR.
TesseractConfiguration
A configuration object that fine-tunes Tesseract behavior at an Instance level. Gives access to every option available to tesseract command line or C++ API users.
Enums
Installation.LoggingModes
Different ways to Log developer notices and warnings
OcrLanguage
The natural language in which OCR documents will be read.
Any number of languages may be used simultaneously using the IronTessseract.AddSecondaryLanguage() Method.
To use a custom Tesseract language file (.traineddata) please see the IronTessseract.UseCustomLanguageFile() Method
OcrResult.BarcodeEncoding
Barcode Encoding Types.
OcrResult.TextFlow
Describes text direction
TesseractEngineMode
Allows the developer to choose the algorithm Tesseract will use for OCR. TesseractAndLstm is the recommended behavior for IronOCR.
TesseractPageSegmentationMode
Page segmentation mode defines how your text will be searched for by Tesseract.
TesseractVersion
Selects between different binary versions of Tesseract.