Class OcrInput.Page
Represents a page within an OcrInput object. This can relate to one of many images appended, pages of a PDF or TIFF, or frames of a GIF.
Inherited Members
Namespace: IronOcr
Assembly: IronOcr.dll
Syntax
public class Page : OcrInputPage
OcrInput.Page is a single page inside an OcrInput, the unit you work with when an input holds more than one image. One page can correspond to an appended image, a page of a PDF or TIFF, or a frame of a GIF, so it is how you address an individual sheet within a multi-page or multi-frame document before or after reading.
You obtain pages by enumerating the Pages collection of an OcrInput you have loaded and appended content to. Each page carries the members it inherits from OcrInputPage: Index for its position in the input, Width and Height for its pixel size, and HorizontalDPI and VerticalDPI for its resolution. ContentArea sets or reports the rectangle the engine should read, which is the member to use when you only need part of a page.
For preparing or inspecting a page, ToBitmap renders it to an AnyBitmap, SaveAsImage writes it out (optionally with an AnyBitmap.ImageFormat), and GetTextRegions, FindTextRegion, and FindMultipleTextRegions locate text areas on the page. Set the page-level options before calling IronTesseract.Read on the parent input.
The input PDFs how-to covers reading multi-page documents, and the OCR a region how-to uses the page content area.