VIDEOS

How to use OCR Language Packs in IronOCR

Kannaopat Udonpant
Kannapat Udonpant
December 10, 2023
Share:


In this tutorial, you’ll learn how to extract text from multilingual PDF documents using IronOCR in C#. The video walks through setting up IronOCR and installing additional language packs—specifically English and Japanese. You'll see how to configure the OCR engine to support multiple languages and apply it to a sample PDF that includes both English and Japanese text. The tutorial demonstrates how to initialize the OCR engine, define the input file, and extract text using the Read method. The extracted content is then saved to a .txt file, with error handling in place for failed operations. This is a great example of how IronOCR supports global document processing by recognizing multiple languages in a single scan. Whether you're processing multilingual forms, international documents, or PDFs from global sources, this guide shows how easy it is to get accurate, language-aware OCR results in C#.

Further Reading: Additional OCR Language Packs

Get stated with IronOCR now.
green arrow pointer

Kannaopat Udonpant
Software Engineer
Before becoming a Software Engineer, Kannapat completed a Environmental Resources PhD from Hokkaido University in Japan. While pursuing his degree, Kannapat also became a member of the Vehicle Robotics Laboratory, which is part of the Department of Bioproduction Engineering. In 2022, he leveraged his C# skills to join Iron Software's engineering team, where he focuses on IronPDF. Kannapat values his job because he learns directly from the developer who writes most of the code used in IronPDF. In addition to peer learning, Kannapat enjoys the social aspect of working at Iron Software. When he's not writing code or documentation, Kannapat can usually be found gaming on his PS5 or rewatching The Last of Us.
< PREVIOUS
Why IronOCR is better than the Tesseract 4 Nuget Package
NEXT >
How to use Multiple Languages with Tesseract