Class OcrInput

Stores OCR input data and allows OCR of PDF documents or any image format.

Also provides various image filter methods which can improve OCR accuracy.

Inheritance

System.Object

OcrInputBase

OcrInput

Implements

System.IDisposable

Inherited Members

OcrInputBase.Finalize()

OcrInputBase.Dispose()

OcrInputBase.SaveAsImages(String, AnyBitmap.ImageFormat)

OcrInputBase.HighlightTextAndSaveAsImages(IronTesseract, String, ResultHighlightType)

OcrInputBase.FindTextRegion()

OcrInputBase.FindMultipleTextRegions()

OcrInputBase.StampCropRectanglesAndSaveAs(Rectangle[], Color, String, AnyBitmap.ImageFormat)

OcrInputBase.StampCropRectangleAndSaveAs(Rectangle, Color, String, AnyBitmap.ImageFormat)

OcrInputBase.StampCropRectangle(OcrInput, Rectangle, Color)

OcrInputBase.StampCropRectangles(OcrInput, Rectangle[], Color)

OcrInputBase.ApplyMultipleFilters(OcrFilters, Double, Int32, Int32, Int32, Boolean, Nullable<Int32>)

OcrInputBase.WithTitle(String)

OcrInputBase.Scale(Int32, Boolean)

OcrInputBase.EnhanceResolution(Int32)

OcrInputBase.AdaptiveThreshold(Nullable<Single>)

OcrInputBase.Rotate(Double)

OcrInputBase.Binarize()

OcrInputBase.ToGrayScale()

OcrInputBase.Sharpen()

OcrInputBase.ReplaceColor(Color, Color, Int32)

OcrInputBase.SelectTextColor(Color, Int32)

OcrInputBase.SelectTextColors(IEnumerable<Color>, Int32)

OcrInputBase.Contrast(Single)

OcrInputBase.DeNoise(Boolean)

OcrInputBase.Despeckle(Boolean)

OcrInputBase.Invert(Boolean)

OcrInputBase.Erode(Boolean)

OcrInputBase.Open(Boolean)

OcrInputBase.Close(Boolean)

OcrInputBase.HoughTransformStraighten(Int32)

OcrInputBase.Deskew(Int32)

OcrInputBase.DetectPageOrientation(OrientationDetectionMode)

OcrInputBase.Scale(Int32, Int32, Boolean)

OcrInputBase.Dilate(Boolean)

OcrInputBase.GetPages()

OcrInputBase.RemovePages(IEnumerable<Int32>)

OcrInputBase.RemovePage(Int32)

OcrInputBase.PageCount()

OcrInputBase.TargetDPI

OcrInputBase.Title

Namespace: IronOcr

Assembly: IronOcr.dll

Syntax

public class OcrInput : OcrInputBase, IDisposable

Remarks

Also see OcrPdfInput and OcrImageInput

Constructors

OcrInput()

Creates an OcrInput Object which holds pages of rasterized input media (images, PDFs, TIFs, GIFs)

You may Load one or more media and apply image correction filters to loaded pages such as Deskew or Binarize.

Please use the `using` keyword:

using var input = new OcrInput();
input.LoadImage("input.png");
input.Deskew();
var result = new IronTesseract().Read(input);

Declaration

public OcrInput()

Methods

Add(IEnumerable<OcrInputPage>, Rectangle)

Please migrate to using: LoadPages(OcrInputPage[], Rectangle)

Declaration

public void Add(IEnumerable<OcrInputPage> imagesAsOcrInputPages, Rectangle ContentArea)

Parameters

Type	Name	Description
System.Collections.Generic.IEnumerable<OcrInputPage>	imagesAsOcrInputPages
IronSoftware.Drawing.Rectangle	ContentArea

Load(Object, Rectangle)

Method that will attempt to read an input from an object. It is recommended to use the explicit Load methods instead for more customization and stability. See LoadPdf and LoadImage.

Declaration

public void Load(object inputObject, Rectangle contentArea = null)

Parameters

Type	Name	Description
System.Object	inputObject	Object to be loaded
IronSoftware.Drawing.Rectangle	contentArea	Optional cropped area of the page to be added. Will be ignored if onlyPdfImages set to true.

LoadImage(AnyBitmap, Rectangle)

Loads image into this OcrInput object.

Accepts: PNG. JPG, BMP, TIFF, GIF, WEBP, and other common Image formats.

Declaration

public void LoadImage(AnyBitmap imageBitmap, Rectangle contentArea = null)

Parameters

Type	Name	Description
IronSoftware.Drawing.AnyBitmap	imageBitmap	Image as an AnyBitmap
IronSoftware.Drawing.Rectangle	contentArea	Optional cropped area of the page to be added.

Remarks

If multiple pages exist in the image such as TIFF/GIF frames, all will be added as their own OcrInputPage

LoadImage(Byte[], Rectangle)

Loads image into this OcrInput object.

Accepts: PNG. JPG, BMP, TIFF, GIF, WEBP, and other common Image formats.

Declaration

public void LoadImage(byte[] imageBytes, Rectangle contentArea = null)

Parameters

Type	Name	Description
System.Byte[]	imageBytes	Image as a byte array
IronSoftware.Drawing.Rectangle	contentArea	Optional cropped area of the page to be added.

Remarks

If multiple pages exist in the image such as TIFF/GIF frames, all will be added as their own OcrInputPage

LoadImage(Stream, Rectangle)

Loads image into this OcrInput object.

Accepts: PNG. JPG, BMP, TIFF, GIF, WEBP, and other common Image formats.

Declaration

public void LoadImage(Stream imageStream, Rectangle contentArea = null)

Parameters

Type	Name	Description
System.IO.Stream	imageStream	Image as a Stream
IronSoftware.Drawing.Rectangle	contentArea	Optional cropped area of the page to be added.

Remarks

If multiple pages exist in the image such as TIFF/GIF frames, all will be added as their own OcrInputPage

LoadImage(String, Rectangle)

Loads image into this OcrInput object.

Accepts: PNG. JPG, BMP, TIFF, GIF, WEBP, and other common Image formats.

Declaration

public void LoadImage(string imageFilePath, Rectangle contentArea = null)

Parameters

Type	Name	Description
System.String	imageFilePath	File Path of the Image
IronSoftware.Drawing.Rectangle	contentArea	Optional cropped area of the page to be added.

Remarks

If multiple pages exist in the image such as TIFF/GIF frames, all will be added as their own OcrInputPage