Reading Identity Documents with IronOCR

ByChaknith Bin

April 7, 2022

Updated June 22, 2025

Identity documents are, by design, very difficult for OCR engines to read due to anti-copying/fraud protection -- holograms, watermarking images, variable digital noise, etc.-- added to the backgrounds of the card.

This is not to say it is impossible. Results will likely be dependent on image quality. Image formats with less digital noise, such as TIFF or PNG, are recommended over lossy image formats, such as JPEG.

Please also try the following image optimization filters:

DeNoise(): Removes digital noise. This filter should only be used where noise is expected. It flattens Alpha channels to white.
DeepCleanBackgroundNoise(): Performs heavy background noise removal. Only use this filter in cases where extreme document background noise is known, as this filter will also risk reducing OCR accuracy of clean documents and is very CPU expensive.

You may also try crop rectangles: Crop Rectangles Example.

Chaknith Bin

Chat with engineering team now

Software Engineer

Chaknith works on IronXL and IronBarcode. He has deep expertise in C# and .NET, helping improve the software and support customers. His insights from user interactions contribute to better products, documentation, and overall experience.