Search Results for

    Show / Hide Table of Contents

    Class PageIterator

    Inheritance
    System.Object
    DisposableBase
    PageIterator
    ResultIterator
    Implements
    System.IDisposable
    Inherited Members
    DisposableBase.Dispose()
    DisposableBase.VerifyNotDisposed()
    DisposableBase.IsDisposed
    Namespace: DynamicTesseract
    Assembly: IronOcr.dll
    Syntax
    public class PageIterator : DisposableBase

    Walking the layout of a recognized page, block by block down to individual symbols, runs through PageIterator. It moves a cursor over the regions Tesseract found and reports where each one sits, so code can map the structure of a page without decoding its text. When the text itself is needed too, ResultIterator derives from PageIterator and adds the recognized characters on top of the same traversal.

    A PageIterator is obtained from a recognized page rather than constructed directly, and because it inherits DisposableBase it should be disposed (or wrapped in using) once the walk finishes. Begin resets the cursor to the first element, and the two Next overloads advance it, one stepping a single PageIteratorLevel and the other moving to the next element at one level while bounded by a parent level. IsAtBeginningOf and IsAtFinalOf test the cursor's position within the hierarchy so a loop knows where a block, paragraph, or line starts and ends.

    At each stop, the iterator describes the current region. BlockType returns a PolyBlockType identifying the region's role, body text, heading, image, table, or separator, and TryGetBoundingBox fills a Rect with the region's pixel coordinates at a requested PageIteratorLevel, returning false when none is available. TryGetBaseline reports the text baseline the same way, and GetProperties reads font and size details for the current element. Pass the PageIteratorLevel that matches the granularity a task needs, from Block down to Symbol.

    using DynamicTesseract;
    
    void WalkBlocks(PageIterator iterator)
    {
        iterator.Begin();
        do
        {
            if (iterator.TryGetBoundingBox(PageIteratorLevel.Block, out Rect box))
                Console.WriteLine($"{iterator.BlockType} at {box}");
        }
        while (iterator.Next(PageIteratorLevel.Block));
    }

    The OCR results how-to walks through reading recognized structure, the results objects example shows the result model, and the region of an image how-to covers working with page coordinates.

    Fields

    handle

    Declaration
    protected HandleRef handle
    Field Value
    Type Description
    System.Runtime.InteropServices.HandleRef

    page

    Declaration
    protected readonly Page page
    Field Value
    Type Description
    Page

    Properties

    BlockType

    Declaration
    public PolyBlockType BlockType { get; }
    Property Value
    Type Description
    PolyBlockType

    Methods

    Begin()

    Declaration
    public void Begin()

    Dispose(Boolean)

    Declaration
    public override void Dispose(bool disposing)
    Parameters
    Type Name Description
    System.Boolean disposing
    Overrides
    DisposableBase.Dispose(Boolean)

    Finalize()

    Declaration
    protected override void Finalize()
    Overrides
    DisposableBase.Finalize()

    GetProperties()

    Declaration
    public ElementProperties GetProperties()
    Returns
    Type Description
    ElementProperties

    IsAtBeginningOf(PageIteratorLevel)

    Declaration
    public bool IsAtBeginningOf(PageIteratorLevel level)
    Parameters
    Type Name Description
    PageIteratorLevel level
    Returns
    Type Description
    System.Boolean

    IsAtFinalOf(PageIteratorLevel, PageIteratorLevel)

    Declaration
    public bool IsAtFinalOf(PageIteratorLevel level, PageIteratorLevel element)
    Parameters
    Type Name Description
    PageIteratorLevel level
    PageIteratorLevel element
    Returns
    Type Description
    System.Boolean

    Next(PageIteratorLevel)

    Declaration
    public bool Next(PageIteratorLevel level)
    Parameters
    Type Name Description
    PageIteratorLevel level
    Returns
    Type Description
    System.Boolean

    Next(PageIteratorLevel, PageIteratorLevel)

    Declaration
    public bool Next(PageIteratorLevel level, PageIteratorLevel element)
    Parameters
    Type Name Description
    PageIteratorLevel level
    PageIteratorLevel element
    Returns
    Type Description
    System.Boolean

    TryGetBaseline(PageIteratorLevel, out Rect)

    Declaration
    public bool TryGetBaseline(PageIteratorLevel level, out Rect bounds)
    Parameters
    Type Name Description
    PageIteratorLevel level
    Rect bounds
    Returns
    Type Description
    System.Boolean

    TryGetBoundingBox(PageIteratorLevel, out Rect)

    Declaration
    public bool TryGetBoundingBox(PageIteratorLevel level, out Rect bounds)
    Parameters
    Type Name Description
    PageIteratorLevel level
    Rect bounds
    Returns
    Type Description
    System.Boolean

    Implements

    System.IDisposable

    Inherited members

    Dispose()
    VerifyNotDisposed()
    IsDisposed
    ☀
    ☾
    Downloads
    • Download with Nuget
    • Start for Free
    In This Article
    Back to top
    Install with Nuget
    IronOCR_for_dotnet_log2o
    Blue key in circleGet started for FREE
    No credit card required
    Test in a live environment

    Test in production without watermarks.
    Works wherever you need it to.

    Fully-functional product

    Get 30 days of fully functional product.
    Have it up and running in minutes.

    24/5 technical support

    Full access to our support engineering team during your product trial

    Grey key in circleGet started for FREE
    The trial form was submitted successfully.
    Calendar in circleBook Free Live Demo
    No contact, no card details, no commitments Book a 30-minute, personal demo.
    Here's what to expect:

    A live demo of our product and its key features

    Get project specific feature recommendations

    All your questions are answered to make sure you have all the information you need. (No commitment whatsoever.)

    Grey key in circleBook Free Live Demo
    Your booking has been completed Check your e-mail for confirmation
    Support Team Member 6 related to The C# PDF Library Support Team Member 14 related to The C# PDF Library Support Team Member 4 related to The C# PDF Library Support Team Member 2 related to The C# PDF Library
    Online 24/5
    Need help? Our sales team would be glad to help you.
    Try the Enterprise Trial
    ironpdf_for_dotnet_log2o
    Key in blue circle
    Get your free 30-day Trial Key instantly.
    bullet_checkedNo credit card or account creation required
    Key in blue circle
    Get your free 30-day Trial Key instantly.
    Blue key in circleNo credit card or account creation required
    Green Check in orange circle
    The trial form was submitted successfully.
    badge_greencheck_in_yellowcircle
    Thank you for starting a trial

    Please check your email for the trial license key.

    If you don’t receive an email, please start a live chat or email support@ironsoftware.com

    Install with NuGet
    View Licensing
    • Logo Aetna
    • Logo NASA
    • Logo GE
    • Logo Porsche
    • Logo USDA
    • Logo Qatar
    Join Millions of Engineers who’ve tried IronOCR