Reading Identity Documents with IronOCR

Identity documents are, by design, very difficult for OCR engines to read due to anti-copying/fraud protection -- holograms, watermarking images, variable digital noise, etc.-- added to the backgrounds of the card.

This is not to say it is impossible. Results will likely be dependent on image quality. Image formats with less digital noise, such as TIFF or PNG, are recommended over lossy image formats, such as JPEG.

Please also try the following image optimization filters:

  • DeNoise(): Removes digital noise. This filter should only be used where noise is expected. It flattens Alpha channels to white.
  • DeepCleanBackgroundNoise(): Performs heavy background noise removal. Only use this filter in cases where extreme document background noise is known, as this filter will also risk reducing OCR accuracy of clean documents and is very CPU expensive.

You may also try crop rectangles: Crop Rectangles Example.

Curtis Chau
Technical Writer

Curtis Chau holds a Bachelor’s degree in Computer Science (Carleton University) and specializes in front-end development with expertise in Node.js, TypeScript, JavaScript, and React. Passionate about crafting intuitive and aesthetically pleasing user interfaces, Curtis enjoys working with modern frameworks and creating well-structured, visually appealing manuals.

...

Read More
Ready to Get Started?
Nuget Downloads 4,876,499 | Version: 2025.10 just released