Fixing Low Quality Scans & Images
The IronTesseract
OCR Class provides granular control to C# and .NET developers to add OCR (image and PDF to text) functionality to their application, and also to fine tune performance to their own specific use case.
By setting variables a perfect balance between speed and accuracy can be found though working with real world examples. Settings include: CleanBackgroundNoise
, EnhanceContrast
, EnhanceResolution
, Language, Strategy, RotateAndStraighten
, ColorSpace
, DetectWhiteTextOnDarkBackgrounds
, InputImageType
.
This allows for low quality scans to be read which free Tesseract would not be able to handle.