How to Read Specialized Documents

Chaknith Bin

Updated:July 28, 2025

Accurately reading specific documents such as standard text documents, license plates, passports, and photos with a general singular method is very hard. These challenges stem from the diverse formats, layouts, and content of each document type, as well as variations in image quality, distortion, and specialized content. Additionally, achieving contextual understanding and balancing performance and efficiency becomes more complex with a broader scope of document types.

IronOCR introduces specific methods for performing OCR on particular documents such as standard text documents, license plates, passports, and photos to achieve optimal accuracy and performance.

Get started with IronOCR

Start using IronOCR in your project today with a free trial.

First Step:

How to Read Specific Documents

Download a C# library to read license plates, passports, and photos
Prepare the image and PDF document for OCR
Set the ReadLicensePlate method to read a license plate
Set the ReadPassport method to retrieve information from a passport
Leverage the ReadPhoto and ReadScreenShot methods to read images that contain hard-to-read text

About The Package

The methods ReadLicensePlate, ReadPassport, ReadPhoto, and ReadScreenShot are extension methods to the base IronOCR package and require the IronOcr.Extensions.AdvancedScan package to be installed. Currently, this extension is only available on Windows.

The methods work with OCR engine configurations such as blacklist and whitelist. Multiple languages, including Chinese, Japanese, Korean, and LatinAlphabet, are supported in all methods except for the ReadPassport method. Please note that each language requires an additional language package, IronOcr.Languages.

Using advanced scan on .NET Framework requires the project to run on x64 architecture. Navigate to the project configuration and uncheck the "Prefer 32-bit" option to achieve this. Learn more in the following troubleshooting guide: "Advanced Scan on .NET Framework."

Read Document Example

The ReadDocument method is a robust document reading method that specializes in scanned documents or photos of paper documents containing a lot of text. The PageSegmentationMode configuration is very important in reading text documents with different layouts.

For example, the SingleBlock and SparseText types could retrieve much information from table layout. This is because SingleBlock assumes that the text stays as a block, whereas SparseText assumes that the text is scattered throughout the document.

:path=/static-assets/ocr/content-code-examples/how-to/read-specific-document-document.cs

using IronOcr;
using System;

// Instantiate OCR engine
var ocr = new IronTesseract();

// Configure OCR engine
ocr.Configuration.PageSegmentationMode = TesseractPageSegmentationMode.SingleBlock;

using var input = new OcrInput();

input.LoadPdf("Five.pdf");

// Perform OCR
OcrResult result = ocr.ReadDocument(input);

Console.WriteLine(result.Text);

Imports IronOcr
Imports System

' Instantiate OCR engine
Private ocr = New IronTesseract()

' Configure OCR engine
ocr.Configuration.PageSegmentationMode = TesseractPageSegmentationMode.SingleBlock

Dim input = New OcrInput()

input.LoadPdf("Five.pdf")

' Perform OCR
Dim result As OcrResult = ocr.ReadDocument(input)

Console.WriteLine(result.Text)

$vbLabelText $csharpLabel

Read License Plate Example

The ReadLicensePlate method is optimized for reading license plates from photos. The special information returned from this method is the Licenseplate property, which contains the information of the license plate location in the provided document.

:path=/static-assets/ocr/content-code-examples/how-to/read-specific-document-license-plate.cs

using IronOcr;
using IronSoftware.Drawing;
using System;

// Instantiate OCR engine
var ocr = new IronTesseract();

using var inputLicensePlate = new OcrInput();

inputLicensePlate.LoadImage("LicensePlate.jpeg");

// Perform OCR
OcrLicensePlateResult result = ocr.ReadLicensePlate(inputLicensePlate);

// Retrieve license plate coordinates
Rectangle rectangle = result.Licenseplate;

// Retrieve license plate value
string output = result.Text;

Imports IronOcr
Imports IronSoftware.Drawing
Imports System

' Instantiate OCR engine
Private ocr = New IronTesseract()

Private inputLicensePlate = New OcrInput()

inputLicensePlate.LoadImage("LicensePlate.jpeg")

' Perform OCR
Dim result As OcrLicensePlateResult = ocr.ReadLicensePlate(inputLicensePlate)

' Retrieve license plate coordinates
Dim rectangle As Rectangle = result.Licenseplate

' Retrieve license plate value
Dim output As String = result.Text

$vbLabelText $csharpLabel

Read Passport Example

The ReadPassport method is optimized for reading and extracts passport information from passport photos by scanning the machine-readable zone (MRZ) contents. An MRZ is a specially defined zone in official documents such as passports, ID cards, and visas. The MRZ typically contains essential personal information, such as the holder’s name, date of birth, nationality, and document number. Currently, this method only supports the English language.

:path=/static-assets/ocr/content-code-examples/how-to/read-specific-document-passport.cs

using IronOcr;
using System;

// Instantiate OCR engine
var ocr = new IronTesseract();

using var inputPassport = new OcrInput();

inputPassport.LoadImage("Passport.jpg");

// Perform OCR
OcrPassportResult result = ocr.ReadPassport(inputPassport);

// Output passport information
Console.WriteLine(result.PassportInfo.GivenNames);
Console.WriteLine(result.PassportInfo.Country);
Console.WriteLine(result.PassportInfo.PassportNumber);
Console.WriteLine(result.PassportInfo.Surname);
Console.WriteLine(result.PassportInfo.DateOfBirth);
Console.WriteLine(result.PassportInfo.DateOfExpiry);

Imports IronOcr
Imports System

' Instantiate OCR engine
Private ocr = New IronTesseract()

Private inputPassport = New OcrInput()

inputPassport.LoadImage("Passport.jpg")

' Perform OCR
Dim result As OcrPassportResult = ocr.ReadPassport(inputPassport)

' Output passport information
Console.WriteLine(result.PassportInfo.GivenNames)
Console.WriteLine(result.PassportInfo.Country)
Console.WriteLine(result.PassportInfo.PassportNumber)
Console.WriteLine(result.PassportInfo.Surname)
Console.WriteLine(result.PassportInfo.DateOfBirth)
Console.WriteLine(result.PassportInfo.DateOfExpiry)

$vbLabelText $csharpLabel

Result

Please make sure that the document only contains the passport image. Any header and footer text could confuse the method and result in an unexpected output.

Read Photo Example

The ReadPhoto method is optimized for reading images that contain hard-to-read text. This method returns the TextRegions property, which contains useful information about the detected text, such as Region, TextInRegion, and FrameNumber.

:path=/static-assets/ocr/content-code-examples/how-to/read-specific-document-photo.cs

using IronOcr;
using IronSoftware.Drawing;

// Instantiate OCR engine
var ocr = new IronTesseract();

using var inputPhoto = new OcrInput();
inputPhoto.LoadImageFrame("photo.tif", 2);

// Perform OCR
OcrPhotoResult result = ocr.ReadPhoto(inputPhoto);

// index number refer to region order in the page
int number = result.TextRegions[0].PageNumber;
string textinregion = result.TextRegions[0].TextInRegion;
Rectangle region = result.TextRegions[0].Region;

Imports IronOcr
Imports IronSoftware.Drawing

' Instantiate OCR engine
Private ocr = New IronTesseract()

Private inputPhoto = New OcrInput()
inputPhoto.LoadImageFrame("photo.tif", 2)

' Perform OCR
Dim result As OcrPhotoResult = ocr.ReadPhoto(inputPhoto)

' index number refer to region order in the page
Dim number As Integer = result.TextRegions(0).PageNumber
Dim textinregion As String = result.TextRegions(0).TextInRegion
Dim region As Rectangle = result.TextRegions(0).Region

$vbLabelText $csharpLabel

Read Screenshot Example

The ReadScreenShot method is optimized for reading screenshots that contain hard-to-read text. Similar to the ReadPhoto method, it also returns the TextRegions property.

:path=/static-assets/ocr/content-code-examples/how-to/read-specific-document-screenshot.cs
}

using IronOcr;
using System;
using System.Linq;

// Instantiate OCR engine
var ocr = new IronTesseract();

using var inputScreenshot = new OcrInput();
inputScreenshot.LoadImage("screenshot.png");

// Perform OCR
OcrPhotoResult result = ocr.ReadScreenShot(inputScreenshot);

// Output screenshoot information
Console.WriteLine(result.Text);
Console.WriteLine(result.TextRegions.First().Region.X);
Console.WriteLine(result.TextRegions.Last().Region.Width);
Console.WriteLine(result.Confidence);
}

Imports IronOcr
Imports System
Imports System.Linq

' Instantiate OCR engine
Private ocr = New IronTesseract()

Private inputScreenshot = New OcrInput()
inputScreenshot.LoadImage("screenshot.png")

' Perform OCR
Dim result As OcrPhotoResult = ocr.ReadScreenShot(inputScreenshot)

' Output screenshoot information
Console.WriteLine(result.Text)
Console.WriteLine(result.TextRegions.First().Region.X)
Console.WriteLine(result.TextRegions.Last().Region.Width)
Console.WriteLine(result.Confidence)
}

$vbLabelText $csharpLabel

Frequently Asked Questions

What is this OCR library?

IronOCR is a C# library used for performing Optical Character Recognition (OCR) on various document types such as text documents, license plates, passports, and photos.

How can I start using this library for reading documents?

To start using IronOCR, download the library from NuGet, prepare your images or PDF documents for OCR, and use specific methods like ReadLicensePlate, ReadPassport, ReadPhoto, or ReadScreenShot to perform OCR on your documents.

What additional package is required for using advanced scanning features?

The IronOcr.Extensions.AdvancedScan package is required for using advanced scanning features, and it is currently available only on Windows.

Which languages are supported by this OCR tool?

IronOCR supports multiple languages including Chinese, Japanese, Korean, and LatinAlphabet. However, the ReadPassport method currently only supports the English language.

How do I configure the library to read documents with different layouts?

You can configure IronOCR to read documents with different layouts by setting the PageSegmentationMode in the OCR configuration. Options like SingleBlock and SparseText can help in retrieving information from table layouts.

What is the method for reading license plates used for?

The ReadLicensePlate method is specifically optimized for reading license plates from photos, and it returns details such as the license plate text and its location.

How does the method for reading passports work?

The ReadPassport method extracts information from passport photos by scanning the machine-readable zone (MRZ), which contains essential personal information like name, date of birth, and document number.

What is the purpose of the method for reading photos?

The ReadPhoto method is designed to read images that contain hard-to-read text. It returns the TextRegions property, which includes information about detected text and its regions.

Can this tool read text from screenshots?

Yes, IronOCR can read text from screenshots using the ReadScreenShot method, which is optimized for processing text in screenshots and provides the TextRegions property.

What should I do if I encounter issues with advanced scan on .NET Framework?

If you encounter issues with advanced scan on .NET Framework, ensure your project is configured to run on x64 architecture by unchecking the 'Prefer 32-bit' option in project settings.

Chaknith Bin

Chat with engineering team now

Software Engineer

Chaknith works on IronXL and IronBarcode. He has deep expertise in C# and .NET, helping improve the software and support customers. His insights from user interactions contribute to better products, documentation, and overall experience.

On This Page

How to Read Specialized Documents

Get started with IronOCR

How to Read Specific Documents

About The Package

Read Document Example

Read License Plate Example

Read Passport Example

Result

Read Photo Example

Read Screenshot Example

Frequently Asked Questions

What is this OCR library?

How can I start using this library for reading documents?

What additional package is required for using advanced scanning features?

Which languages are supported by this OCR tool?

How do I configure the library to read documents with different layouts?

What is the method for reading license plates used for?

How does the method for reading passports work?

What is the purpose of the method for reading photos?

Can this tool read text from screenshots?

What should I do if I encounter issues with advanced scan on .NET Framework?

Ready to Get Started?

On This Page

How to Read Specialized Documents

Get started with IronOCR

How to Read Specific Documents

About The Package

Read Document Example

Read License Plate Example

Read Passport Example

Result

Read Photo Example

Read Screenshot Example

Frequently Asked Questions

What is this OCR library?

How can I start using this library for reading documents?

What additional package is required for using advanced scanning features?

Which languages are supported by this OCR tool?

How do I configure the library to read documents with different layouts?

What is the method for reading license plates used for?

How does the method for reading passports work?

What is the purpose of the method for reading photos?

Can this tool read text from screenshots?

What should I do if I encounter issues with advanced scan on .NET Framework?

Ready to Get Started?

Get your FREE

Next step: Start free 30-day Trial

Next step: Start free 30-day Trial

Trusted by Over 2 Million Engineers Worldwide