COMPARE TO OTHER COMPONENTS

Windows.Media.Ocr vs IronOCR: .NET OCR Library

Updated:July 19, 2026

Windows.Media.Ocr ships free with every Windows 10 and Windows 11 installation, which makes it attractive until you try to deploy the same application to a Linux server, a Docker container, or an AWS Lambda function — at which point the API simply does not exist. Platform lock-in is not an edge case with this library; it is the defining constraint. Every architectural decision downstream of choosing Windows.Media.Ocr is shaped by the requirement that the host OS must be a Windows 10 or Windows 11 desktop or consumer-grade server. No Linux, no macOS, no Docker, no Azure Functions on Linux, no AWS Lambda. For developers building internal Windows tools with zero plans to cross that boundary, the price of $0 is hard to argue with. For everyone else, the hidden cost is rewriting OCR into a different system entirely when deployment requirements evolve.

Understanding Windows.Media.Ocr

Windows.Media.Ocr is part of the Windows Runtime (WinRT) API surface introduced in Windows 8.1 and refined for Windows 10 and 11. It exposes an OcrEngine class in the Windows.Media.Ocr namespace that accepts a SoftwareBitmap — itself a WinRT type from Windows.Graphics.Imaging — and returns an OcrResult containing recognized text and line geometry.

The API is built around WinRT's async/await contract. Every operation flows through async Task calls backed by WinRT IAsyncOperation machinery: load a StorageFile, open a stream, create a BitmapDecoder, get a SoftwareBitmap, and only then invoke RecognizeAsync. From .NET 6 onward, consuming WinRT APIs requires a Windows-specific Target Framework Moniker (TFM) such as net8.0-windows10.0.19041.0. A project file without that TFM cannot compile code that references Windows.Media.Ocr at all — the types simply do not exist in the assembly graph.

Key architectural characteristics:

Windows 10/11 only — the WinRT API surface is not available on Windows Server without Desktop Experience in all configurations, and is completely absent on Linux and macOS
WinRT async model — all recognition flows through IAsyncOperation-backed async calls; no synchronous path exists
Language packs from the OS — OcrEngine.TryCreateFromLanguage and TryCreateFromUserProfileLanguages resolve available languages from the Windows language packs installed by the user or IT administrator on that specific machine; there is no bundled or portable language model
Image-only input — the API accepts SoftwareBitmap directly; no PDF input path exists at any layer of the API
No preprocessing pipeline — raw bitmap is passed to the recognizer; rotation correction, noise removal, contrast enhancement, and resolution scaling are the developer's responsibility using separate Windows Imaging APIs
No searchable PDF output — recognized text is returned as plain string data with line geometry; no export to PDF is provided
Windows-specific TFM required — project files must target a net*-windows* TFM, which prevents the same project from compiling cross-platform

The WinRT Async Stack

Every basic OCR operation with Windows.Media.Ocr requires navigating through multiple layers of WinRT API surface before recognition can begin:

// Windows.Media.Ocr: 6+ async steps before receiving any text
// Requires net8.0-windows10.0.19041.0 TFM — will not compile cross-platform

public async Task<string> ExtractTextAsync(string imagePath)
{
    // Step 1: WinRT file system access
    var file = await StorageFile.GetFileFromPathAsync(imagePath);

    // Step 2: Open WinRT stream
    using var stream = await file.OpenAsync(FileAccessMode.Read);

    // Step 3: Create bitmap decoder
    var decoder = await BitmapDecoder.CreateAsync(stream);

    // Step 4: Decode to SoftwareBitmap
    var bitmap = await decoder.GetSoftwareBitmapAsync();

    // Step 5: Check language availability — null if not installed on this machine
    var engine = OcrEngine.TryCreateFromLanguage(
        new Windows.Globalization.Language("en-US"));
    if (engine == null)
        throw new Exception("OCR engine not available for this language");

    // Step 6: Recognize
    var result = await engine.RecognizeAsync(bitmap);
    return result.Text;
}

// Windows.Media.Ocr: 6+ async steps before receiving any text
// Requires net8.0-windows10.0.19041.0 TFM — will not compile cross-platform

public async Task<string> ExtractTextAsync(string imagePath)
{
    // Step 1: WinRT file system access
    var file = await StorageFile.GetFileFromPathAsync(imagePath);

    // Step 2: Open WinRT stream
    using var stream = await file.OpenAsync(FileAccessMode.Read);

    // Step 3: Create bitmap decoder
    var decoder = await BitmapDecoder.CreateAsync(stream);

    // Step 4: Decode to SoftwareBitmap
    var bitmap = await decoder.GetSoftwareBitmapAsync();

    // Step 5: Check language availability — null if not installed on this machine
    var engine = OcrEngine.TryCreateFromLanguage(
        new Windows.Globalization.Language("en-US"));
    if (engine == null)
        throw new Exception("OCR engine not available for this language");

    // Step 6: Recognize
    var result = await engine.RecognizeAsync(bitmap);
    return result.Text;
}

Imports System.Threading.Tasks
Imports Windows.Media.Ocr
Imports Windows.Storage
Imports Windows.Graphics.Imaging
Imports Windows.Storage.Streams
Imports Windows.Globalization

Public Async Function ExtractTextAsync(imagePath As String) As Task(Of String)
    ' Step 1: WinRT file system access
    Dim file As StorageFile = Await StorageFile.GetFileFromPathAsync(imagePath)

    ' Step 2: Open WinRT stream
    Using stream As IRandomAccessStream = Await file.OpenAsync(FileAccessMode.Read)

        ' Step 3: Create bitmap decoder
        Dim decoder As BitmapDecoder = Await BitmapDecoder.CreateAsync(stream)

        ' Step 4: Decode to SoftwareBitmap
        Dim bitmap As SoftwareBitmap = Await decoder.GetSoftwareBitmapAsync()

        ' Step 5: Check language availability — Nothing if not installed on this machine
        Dim engine As OcrEngine = OcrEngine.TryCreateFromLanguage(New Language("en-US"))
        If engine Is Nothing Then
            Throw New Exception("OCR engine not available for this language")
        End If

        ' Step 6: Recognize
        Dim result As OcrResult = Await engine.RecognizeAsync(bitmap)
        Return result.Text
    End Using
End Function

$vbLabelText $csharpLabel

The null-check on engine is not optional. If the target language pack is not installed on the machine running the code, TryCreateFromLanguage returns null and recognition is impossible. There is no fallback; the application must surface an error to the user or fail silently.

Understanding IronOCR

IronOCR is a commercial .NET OCR library built on an optimized Tesseract 5 engine with a managed API layer that handles preprocessing, PDF reading, multi-language resolution, and structured data output. It installs as a single NuGet package with no external native binaries to deploy separately, no tessdata folders to manage, and no platform-specific TFMs required.

Key characteristics:

Cross-platform by design — runs on Windows, Linux, macOS, Docker, Azure App Service (Windows or Linux), AWS Lambda, and GCP Cloud Run without code changes
Automatic preprocessing — deskew, denoise, contrast enhancement, binarization, and resolution scaling apply automatically on poor-quality input, with explicit control available via OcrInput filter methods
Native PDF input — IronTesseract.Read accepts PDF paths directly; no conversion step, no external library
125+ bundled languages — language packs are NuGet packages that deploy with the application; no dependency on OS-installed language data
Searchable PDF output — OcrResult.SaveAsSearchablePdf creates a text-layer PDF from any scanned input
Structured result model — OcrResult exposes Pages, Paragraphs, Lines, Words, and per-word confidence scores and bounding boxes
Thread-safe — IronTesseract instances support parallel workloads without additional synchronization
Perpetual licensing — $999 Lite through $5,999 Unlimited, one-time purchase, process unlimited documents

Feature Comparison

Feature	Windows.Media.Ocr	IronOCR
Platform	Windows 10/11 only	Windows, Linux, macOS, Docker, cloud
Price	Free	$999–$5,999 perpetual
PDF input	No	Native
Language model	OS-installed packs	125+ bundled via NuGet
Preprocessing	None	Automatic + explicit filters
Searchable PDF output	No	Yes
API model	WinRT async	Standard .NET

Detailed Feature Comparison

Feature	Windows.Media.Ocr	IronOCR
Platform Support
Windows 10/11	Yes	Yes
Windows Server	Limited	Yes
Linux	No	Yes
macOS	No	Yes
Docker	No	Yes
Azure Functions (Linux)	No	Yes
AWS Lambda	No	Yes
Input Formats
JPEG / PNG / BMP	Yes (via WinRT pipeline)	Yes
PDF (scanned)	No	Yes
PDF (password-protected)	No	Yes
TIFF / multi-page	No	Yes
Stream / byte array	No (WinRT StorageFile only)	Yes
URL	No	Yes
Language Support
Language source	OS-installed language packs	125+ bundled NuGet packs
Install without OS admin	No	Yes (NuGet)
Multi-language simultaneous	No	Yes
Language portability across machines	No	Yes
Preprocessing
Deskew	No	Yes (`input.Deskew()`)
Denoise	No	Yes (`input.DeNoise()`)
Contrast enhancement	No	Yes (`input.Contrast()`)
Binarization	No	Yes (`input.Binarize()`)
Resolution scaling	No	Yes (`input.EnhanceResolution(300)`)
Output
Plain text	Yes	Yes
Searchable PDF	No	Yes
hOCR / HTML	No	Yes
Word-level bounding boxes	Partial (line geometry)	Yes
Per-word confidence scores	No	Yes
API Design
TFM restriction	`net-windows` required	None
Synchronous path	No	Yes
Barcode reading during OCR	No	Yes
Region-based OCR	No	Yes

Platform Lock-In vs Cross-Platform Deployment

The single most consequential difference between these two libraries is not accuracy, not preprocessing, and not PDF support — it is deployment topology. Windows.Media.Ocr does not exist outside Windows 10/11. That is not a configuration problem or a missing NuGet package; the WinRT runtime that backs the API is absent on every other operating system.

Windows.Media.Ocr Approach

The WinRT dependency manifests in the project file before a single line of code runs. The TargetFramework must specify a Windows platform version:


<PropertyGroup>
  <TargetFramework>net8.0-windows10.0.19041.0</TargetFramework>
</PropertyGroup>


<PropertyGroup>
  <TargetFramework>net8.0-windows10.0.19041.0</TargetFramework>
</PropertyGroup>

XML

With that TFM, the project cannot be used from a Linux container. A Docker image based on mcr.microsoft.com/dotnet/aspnet:8.0 — the standard Linux base image for ASP.NET deployments — has no WinRT runtime. Attempting to reference Windows.Media.Ocr types in a project targeting net8.0 (no Windows suffix) produces compile errors, not runtime errors. The lock-in is enforced at build time.

When OCR requirements arise in a microservices architecture where the OCR worker runs on Linux, or in a CI/CD pipeline that produces cross-platform Docker images, Windows.Media.Ocr is not an option to evaluate — it is eliminated before the first spike.

IronOCR Approach

IronOCR targets net6.0, net7.0, net8.0, and net9.0 without platform-specific TFMs. The same NuGet package and the same application binary run on Windows, Linux, and macOS. Deploying IronOCR to Docker requires one apt-get line for libgdiplus on the Linux base image, nothing else:

FROM mcr.microsoft.com/dotnet/aspnet:8.0
# Linux dependency for System.Drawing
RUN apt-get update && apt-get install -y libgdiplus

COPY --from=build /app/publish /app
WORKDIR /app
ENTRYPOINT ["dotnet", "YourApp.dll"]

The application code itself is unchanged between the Windows and Linux deployments:

// Same code — Windows, Linux, macOS, Docker, AWS Lambda
// No platform TFM, no WinRT, no conditional compilation
using IronOcr;

IronOcr.License.LicenseKey = "YOUR-LICENSE-KEY";
var text = new IronTesseract().Read("document.jpg").Text;

// Same code — Windows, Linux, macOS, Docker, AWS Lambda
// No platform TFM, no WinRT, no conditional compilation
using IronOcr;

IronOcr.License.LicenseKey = "YOUR-LICENSE-KEY";
var text = new IronTesseract().Read("document.jpg").Text;

Imports IronOcr

' Same code — Windows, Linux, macOS, Docker, AWS Lambda
' No platform TFM, no WinRT, no conditional compilation

IronOcr.License.LicenseKey = "YOUR-LICENSE-KEY"
Dim text As String = New IronTesseract().Read("document.jpg").Text

$vbLabelText $csharpLabel

IronOCR runs on AWS Lambda, Azure Functions on Linux, and Linux servers directly without code modification. The deployment target is a configuration concern, not an architectural constraint.

Language Support: OS Dependency vs Bundled Packs

Windows.Media.Ocr delegates language availability entirely to the host machine. The set of languages your application can recognize is determined by which language packs the user — or an IT administrator — has installed on that specific Windows installation. This creates a class of production failure that has nothing to do with your code.

Windows.Media.Ocr Approach

OcrEngine.TryCreateFromLanguage returns null when the requested language is not installed. TryCreateFromUserProfileLanguages returns null when no OCR-capable language pack exists at all. Both paths require null handling, and neither provides a graceful recovery path — there is no way to install a language from code or bundle one with the application:

// Windows.Media.Ocr: language availability is a runtime unknown
// Returns null if the language pack is not installed on this machine

var engine = OcrEngine.TryCreateFromLanguage(
    new Windows.Globalization.Language("fr-FR"));

if (engine == null)
{
    // French OCR is simply unavailable — no recovery path
    // User must go to Windows Settings > Language to install French
    throw new InvalidOperationException(
        "French OCR unavailable. Install the French language pack in Windows Settings.");
}

var result = await engine.RecognizeAsync(bitmap);

// Windows.Media.Ocr: language availability is a runtime unknown
// Returns null if the language pack is not installed on this machine

var engine = OcrEngine.TryCreateFromLanguage(
    new Windows.Globalization.Language("fr-FR"));

if (engine == null)
{
    // French OCR is simply unavailable — no recovery path
    // User must go to Windows Settings > Language to install French
    throw new InvalidOperationException(
        "French OCR unavailable. Install the French language pack in Windows Settings.");
}

var result = await engine.RecognizeAsync(bitmap);

Imports Windows.Globalization
Imports Windows.Media.Ocr

' Windows.Media.Ocr: language availability is a runtime unknown
' Returns Nothing if the language pack is not installed on this machine

Dim engine = OcrEngine.TryCreateFromLanguage(New Language("fr-FR"))

If engine Is Nothing Then
    ' French OCR is simply unavailable — no recovery path
    ' User must go to Windows Settings > Language to install French
    Throw New InvalidOperationException("French OCR unavailable. Install the French language pack in Windows Settings.")
End If

Dim result = Await engine.RecognizeAsync(bitmap)

$vbLabelText $csharpLabel

Deploying a multilingual document processing application with Windows.Media.Ocr requires coordinating Windows language pack installation across every machine in the deployment target. On a shared server or a user's machine managed by Group Policy, this is not within the developer's control.

IronOCR Approach

IronOCR ships language models as dedicated NuGet packages that deploy alongside the application binary. The language data travels with the build artifact, not with the OS configuration. Supporting 125+ languages is a dotnet add package operation:

dotnet add package IronOcr.Languages.French, IronOcr.Languages.German, IronOcr.Languages.Arabic, IronOcr.Languages.ChineseSimplified

// IronOCR: language availability is a deploy-time guarantee, not a runtime unknown
var ocr = new IronTesseract();
ocr.Language = OcrLanguage.French;
ocr.AddSecondaryLanguage(OcrLanguage.German);

// Works on any machine, any OS, zero OS configuration required
var result = ocr.Read("multilingual-document.jpg");
Console.WriteLine(result.Text);

// IronOCR: language availability is a deploy-time guarantee, not a runtime unknown
var ocr = new IronTesseract();
ocr.Language = OcrLanguage.French;
ocr.AddSecondaryLanguage(OcrLanguage.German);

// Works on any machine, any OS, zero OS configuration required
var result = ocr.Read("multilingual-document.jpg");
Console.WriteLine(result.Text);

Imports IronOcr

' IronOCR: language availability is a deploy-time guarantee, not a runtime unknown
Dim ocr As New IronTesseract()
ocr.Language = OcrLanguage.French
ocr.AddSecondaryLanguage(OcrLanguage.German)

' Works on any machine, any OS, zero OS configuration required
Dim result = ocr.Read("multilingual-document.jpg")
Console.WriteLine(result.Text)

$vbLabelText $csharpLabel

The full language catalog covers Latin, CJK, Arabic, Hebrew, Devanagari, Cyrillic, and specialized sets including mathematical notation. Every language pack version-pins with the IronOCR package version, so the language model on production matches the one tested locally.

Preprocessing Absence

Low-quality scans — slightly rotated pages, photocopied text with speckle noise, faded ink on off-white paper — produce poor OCR accuracy from any engine that receives them as-is. Preprocessing corrects those defects before recognition runs. Windows.Media.Ocr provides no preprocessing layer of any kind.

Windows.Media.Ocr Approach

The API accepts a SoftwareBitmap and returns text. What happens to image quality between those two points is not configurable. Developers who need to improve accuracy on sub-optimal input must implement preprocessing manually using Windows Imaging Component APIs before constructing the SoftwareBitmap. That is a separate codebase with its own maintenance burden, and it remains Windows-specific for the same reason the OCR API itself is:

// Windows.Media.Ocr: no preprocessing — what you pass is what gets recognized
// Skewed, noisy, or low-resolution images degrade accuracy with no remedy
// Manual preprocessing via separate Windows Imaging APIs is the only option

var bitmap = await decoder.GetSoftwareBitmapAsync();
// bitmap goes directly to recognition with no quality improvement
var result = await engine.RecognizeAsync(bitmap);

// Windows.Media.Ocr: no preprocessing — what you pass is what gets recognized
// Skewed, noisy, or low-resolution images degrade accuracy with no remedy
// Manual preprocessing via separate Windows Imaging APIs is the only option

var bitmap = await decoder.GetSoftwareBitmapAsync();
// bitmap goes directly to recognition with no quality improvement
var result = await engine.RecognizeAsync(bitmap);

Imports System.Threading.Tasks

' Windows.Media.Ocr: no preprocessing — what you pass is what gets recognized
' Skewed, noisy, or low-resolution images degrade accuracy with no remedy
' Manual preprocessing via separate Windows Imaging APIs is the only option

Dim bitmap = Await decoder.GetSoftwareBitmapAsync()
' bitmap goes directly to recognition with no quality improvement
Dim result = Await engine.RecognizeAsync(bitmap)

$vbLabelText $csharpLabel

For standard, clean document scans (a controlled scanning environment, consistent lighting, 300 DPI minimum, correct orientation), this limitation is manageable. For document processing pipelines that receive images from mobile phone cameras, flatbed scanners with auto-feed misalignment, faxed documents, or photocopied materials, it means either building a preprocessing layer from scratch or accepting accuracy degradation.

IronOCR Approach

IronOCR's OcrInput class provides a preprocessing pipeline with individual filter methods that apply in sequence. Image quality correction filters address the most common accuracy killers in production document processing:

// IronOCR: explicit preprocessing pipeline
// Each filter targets a specific quality defect
using var input = new OcrInput();
input.LoadImage("low-quality-scan.jpg");

input.Deskew();              // Correct page rotation up to ~40 degrees
input.DeNoise();             // Remove scanner speckle and compression artifacts
input.Contrast();            // Boost contrast on faded or washed-out text
input.Binarize();            // Convert to black/white with optimal threshold
input.EnhanceResolution(300); // Scale image to 300 DPI for recognition

var result = new IronTesseract().Read(input);
Console.WriteLine($"Confidence: {result.Confidence}%");

// IronOCR: explicit preprocessing pipeline
// Each filter targets a specific quality defect
using var input = new OcrInput();
input.LoadImage("low-quality-scan.jpg");

input.Deskew();              // Correct page rotation up to ~40 degrees
input.DeNoise();             // Remove scanner speckle and compression artifacts
input.Contrast();            // Boost contrast on faded or washed-out text
input.Binarize();            // Convert to black/white with optimal threshold
input.EnhanceResolution(300); // Scale image to 300 DPI for recognition

var result = new IronTesseract().Read(input);
Console.WriteLine($"Confidence: {result.Confidence}%");

Imports IronOcr

' IronOCR: explicit preprocessing pipeline
' Each filter targets a specific quality defect
Using input As New OcrInput()
    input.LoadImage("low-quality-scan.jpg")

    input.Deskew()              ' Correct page rotation up to ~40 degrees
    input.DeNoise()             ' Remove scanner speckle and compression artifacts
    input.Contrast()            ' Boost contrast on faded or washed-out text
    input.Binarize()            ' Convert to black/white with optimal threshold
    input.EnhanceResolution(300) ' Scale image to 300 DPI for recognition

    Dim result = New IronTesseract().Read(input)
    Console.WriteLine($"Confidence: {result.Confidence}%")
End Using

$vbLabelText $csharpLabel

For the common case, IronOCR applies automatic preprocessing when calling Read directly on a file path — the engine detects quality issues and corrects them without explicit filter configuration. The image filters tutorial covers the full filter set including Sharpen, Dilate, Erode, Invert, and ToGrayScale for specialized scenarios. Color correction filters and orientation correction extend the pipeline further for documents with non-standard color profiles or multi-angle rotation.

PDF Support Absence

PDF is the dominant document format in enterprise environments. Contracts, invoices, scanned archives, and government forms arrive as PDFs. Windows.Media.Ocr has no concept of PDF — it accepts only image data. OCR of a PDF document requires a separate PDF rendering library, page-by-page rasterization, and manual assembly of results.

Windows.Media.Ocr Approach

There is no PDF path in the API. To OCR a scanned PDF with Windows.Media.Ocr, the developer must: render each page to a SoftwareBitmap using a separate PDF rendering library (none of which are built into Windows), iterate pages, call RecognizeAsync per page, and concatenate results manually. That rendering library itself carries additional licensing and deployment considerations. The Windows.Media.Ocr code is the smaller part of the total implementation:

// Windows.Media.Ocr: no PDF support
// Requires external PDF renderer to rasterize pages before OCR
// Conceptual pattern — a PDF rendering library is not provided by Windows APIs

// Step 1: Use external PDF library to render page to bitmap (not shown)
// Step 2: Pass rendered bitmap to Windows OCR
// var bitmap = RenderPdfPageToBitmap(pdfPath, pageIndex); // external library required

var engine = OcrEngine.TryCreateFromUserProfileLanguages();
if (engine == null)
    throw new Exception("No OCR language available");

// Step 3: Recognize the rasterized page
// var result = await engine.RecognizeAsync(bitmap);
// Step 4: Collect and concatenate results across all pages manually

// Windows.Media.Ocr: no PDF support
// Requires external PDF renderer to rasterize pages before OCR
// Conceptual pattern — a PDF rendering library is not provided by Windows APIs

// Step 1: Use external PDF library to render page to bitmap (not shown)
// Step 2: Pass rendered bitmap to Windows OCR
// var bitmap = RenderPdfPageToBitmap(pdfPath, pageIndex); // external library required

var engine = OcrEngine.TryCreateFromUserProfileLanguages();
if (engine == null)
    throw new Exception("No OCR language available");

// Step 3: Recognize the rasterized page
// var result = await engine.RecognizeAsync(bitmap);
// Step 4: Collect and concatenate results across all pages manually

' Windows.Media.Ocr: no PDF support
' Requires external PDF renderer to rasterize pages before OCR
' Conceptual pattern — a PDF rendering library is not provided by Windows APIs

' Step 1: Use external PDF library to render page to bitmap (not shown)
' Step 2: Pass rendered bitmap to Windows OCR
' Dim bitmap = RenderPdfPageToBitmap(pdfPath, pageIndex) ' external library required

Dim engine = OcrEngine.TryCreateFromUserProfileLanguages()
If engine Is Nothing Then
    Throw New Exception("No OCR language available")
End If

' Step 3: Recognize the rasterized page
' Dim result = Await engine.RecognizeAsync(bitmap)
' Step 4: Collect and concatenate results across all pages manually

$vbLabelText $csharpLabel

The external PDF rendering step alone adds a dependency, a separate learning curve, and an additional failure surface to what started as a "free and built-in" solution.

IronOCR Approach

IronOCR reads PDFs natively. No external renderer, no rasterization step, no manual page assembly. The same IronTesseract.Read method that accepts image paths accepts PDF paths. PDF OCR in .NET is a single line:

// IronOCR: native PDF support — no external renderer needed
var text = new IronTesseract().Read("scanned-document.pdf").Text;

// Password-protected PDFs
using var input = new OcrInput();
input.LoadPdf("encrypted.pdf", Password: "secret");
var result = new IronTesseract().Read(input);
Console.WriteLine(result.Text);

// Searchable PDF output: make a scanned PDF text-searchable
var ocrResult = new IronTesseract().Read("scanned-archive.pdf");
ocrResult.SaveAsSearchablePdf("searchable-output.pdf");

// IronOCR: native PDF support — no external renderer needed
var text = new IronTesseract().Read("scanned-document.pdf").Text;

// Password-protected PDFs
using var input = new OcrInput();
input.LoadPdf("encrypted.pdf", Password: "secret");
var result = new IronTesseract().Read(input);
Console.WriteLine(result.Text);

// Searchable PDF output: make a scanned PDF text-searchable
var ocrResult = new IronTesseract().Read("scanned-archive.pdf");
ocrResult.SaveAsSearchablePdf("searchable-output.pdf");

Imports IronOcr

' IronOCR: native PDF support — no external renderer needed
Dim text As String = New IronTesseract().Read("scanned-document.pdf").Text

' Password-protected PDFs
Using input As New OcrInput()
    input.LoadPdf("encrypted.pdf", Password:="secret")
    Dim result = New IronTesseract().Read(input)
    Console.WriteLine(result.Text)
End Using

' Searchable PDF output: make a scanned PDF text-searchable
Dim ocrResult = New IronTesseract().Read("scanned-archive.pdf")
ocrResult.SaveAsSearchablePdf("searchable-output.pdf")

$vbLabelText $csharpLabel

The searchable PDF feature embeds a text layer over the original scanned image, producing a PDF that preserves visual fidelity while enabling full-text search and copy-paste. This is a common requirement for document management systems and compliance archives. Windows.Media.Ocr cannot produce this output at any layer of its API.

API Mapping Reference

Windows.Media.Ocr	IronOCR Equivalent
`OcrEngine.TryCreateFromLanguage(lang)`	`new IronTesseract()` with `ocr.Language = OcrLanguage.X`
`OcrEngine.TryCreateFromUserProfileLanguages()`	`new IronTesseract()` (default language auto-resolved)
`engine.RecognizeAsync(softwareBitmap)`	`ocr.Read("image.jpg")` or `ocr.Read(ocrInput)`
`OcrResult.Text`	`OcrResult.Text`
`OcrResult.Lines`	`OcrResult.Lines` (with extended metadata)
`OcrLine.Text`	`OcrResult.Lines[i].Text`
`OcrLine.Words`	`OcrResult.Words` (with bounding boxes + confidence)
`OcrWord.BoundingRect`	`OcrResult.Words[i].X`, `.Y`, `.Width`, `.Height`
`BitmapDecoder.CreateAsync(stream)`	`input.LoadImage(stream)` via `OcrInput`
`StorageFile.GetFileFromPathAsync(path)`	`ocr.Read("path")` directly
No equivalent (PDF not supported)	`ocr.Read("document.pdf")`
No equivalent (PDF not supported)	`input.LoadPdf("file.pdf", Password: "x")`
No equivalent (no searchable PDF)	`result.SaveAsSearchablePdf("output.pdf")`
No equivalent (no preprocessing)	`input.Deskew()`, `input.DeNoise()`, `input.Contrast()`
No equivalent (no multi-language)	`ocr.AddSecondaryLanguage(OcrLanguage.X)`
No equivalent (no confidence)	`result.Confidence`, `word.Confidence`

When Teams Consider Moving from Windows.Media.Ocr to IronOCR

The Application Outgrows Windows Desktop

The most common trigger is a requirement change that introduces a non-Windows deployment target. A desktop utility that starts life as an internal Windows tool gets promoted to a web service, a Docker-based microservice, or a cloud function. The moment that happens, Windows.Media.Ocr becomes a blocker. The OCR component requires a full rewrite because the API does not exist on the target platform — there is no port, no compatibility shim, and no conditional compilation flag that fixes the issue. Teams who planned ahead with IronOCR do not face this rewrite.

Language Requirements Exceed Installed Packs

Document processing pipelines often expand scope. A system built to process English invoices gets a requirement to handle French, German, Arabic, or Japanese documents. With Windows.Media.Ocr, supporting those languages requires coordinating OS language pack installation across every deployment target — developer machines, test VMs, production servers, and any containers in the mix. In environments managed by Group Policy or in cloud VMs with minimal OS footprints, that coordination is impractical. IronOCR's NuGet-based language packs deploy with the application and require no OS coordination.

PDF Processing Is Added to Scope

When the original requirement was "OCR images from a flatbed scanner," Windows.Media.Ocr works. When the requirement expands to "also process the backlog of scanned PDFs in our archive," a second library enters the stack. That library is an additional dependency, an additional licensing consideration, and an additional failure surface. Teams that need both image OCR and PDF OCR on a unified API find that IronOCR eliminates the two-library architecture from the start.

Accuracy Degrades on Real-World Input

Controlled scanning environments produce clean images. Real-world input — photos taken with mobile phones, lightly skewed flatbed scans, older fax-received documents, photocopied materials — produces accuracy degradation that has no remedy within Windows.Media.Ocr. When customer complaints about missed text start arriving, teams discover that the preprocessing step they skipped has become necessary. Retrofitting preprocessing using Windows Imaging APIs is a substantial development effort that keeps the solution Windows-only. IronOCR's preprocessing pipeline is already there.

The Server Deployment Question Arises

Windows.Media.Ocr documentation explicitly positions the API for client applications. Running it in a server context — an ASP.NET application processing user-uploaded documents, a Windows Service consuming a document queue — requires a Windows Server environment with Desktop Experience installed, which is a heavier and more expensive VM profile than a Linux container. When the infrastructure team asks whether the OCR worker can run on a Linux instance to reduce hosting cost, the answer with Windows.Media.Ocr is no.

Common Migration Considerations

Project File TFM Change

Windows.Media.Ocr requires a Windows-specific TFM in the project file (net8.0-windows10.0.19041.0 or similar). Removing that dependency to support cross-platform targets means removing the TFM suffix. IronOCR targets net6.0, net8.0, and net9.0 without Windows-specific suffixes. When migrating, confirm that no other WinRT API dependencies in the project require the Windows TFM — other Windows-platform features (shell integration, Windows notifications, etc.) may need to be abstracted behind platform checks.

Async to Sync Migration

Windows.Media.Ocr is entirely async — RecognizeAsync returns IAsyncOperation<OcrResult> which maps to Task<OcrResult> via WinRT interop. IronOCR provides both synchronous and asynchronous paths. The synchronous ocr.Read("file.jpg") directly replaces the multi-step await chain. For server applications where the OCR call sits inside a background service or a Task-based pipeline, the async path is available too. Either way, the transition from 6+ async steps to 1 call is straightforward:

// Before: Windows.Media.Ocr — 6+ await operations
var file = await StorageFile.GetFileFromPathAsync(imagePath);
using var stream = await file.OpenAsync(FileAccessMode.Read);
var decoder = await BitmapDecoder.CreateAsync(stream);
var bitmap = await decoder.GetSoftwareBitmapAsync();
var engine = OcrEngine.TryCreateFromUserProfileLanguages();
var winResult = await engine.RecognizeAsync(bitmap);
string text = winResult.Text;

// After: IronOCR — 1 call, same result, any platform
string text = new IronTesseract().Read(imagePath).Text;

// Before: Windows.Media.Ocr — 6+ await operations
var file = await StorageFile.GetFileFromPathAsync(imagePath);
using var stream = await file.OpenAsync(FileAccessMode.Read);
var decoder = await BitmapDecoder.CreateAsync(stream);
var bitmap = await decoder.GetSoftwareBitmapAsync();
var engine = OcrEngine.TryCreateFromUserProfileLanguages();
var winResult = await engine.RecognizeAsync(bitmap);
string text = winResult.Text;

// After: IronOCR — 1 call, same result, any platform
string text = new IronTesseract().Read(imagePath).Text;

Imports Windows.Storage
Imports Windows.Graphics.Imaging
Imports Windows.Media.Ocr

' Before: Windows.Media.Ocr — 6+ await operations
Dim file As StorageFile = Await StorageFile.GetFileFromPathAsync(imagePath)
Using stream = Await file.OpenAsync(FileAccessMode.Read)
    Dim decoder As BitmapDecoder = Await BitmapDecoder.CreateAsync(stream)
    Dim bitmap = Await decoder.GetSoftwareBitmapAsync()
    Dim engine = OcrEngine.TryCreateFromUserProfileLanguages()
    Dim winResult = Await engine.RecognizeAsync(bitmap)
    Dim text As String = winResult.Text
End Using

' After: IronOCR — 1 call, same result, any platform
Dim text As String = New IronTesseract().Read(imagePath).Text

$vbLabelText $csharpLabel

Language Pack Substitution

For each language previously resolved via OcrEngine.TryCreateFromLanguage(new Windows.Globalization.Language("fr-FR")), install the corresponding IronOCR language pack and set ocr.Language = OcrLanguage.French. The IronOCR language catalog lists all 125+ available packs. Language codes map straightforwardly from BCP-47 tags to the OcrLanguage enum.

Null Engine Handling Removal

Windows.Media.Ocr requires null-checking every engine creation call. IronOCR throws structured exceptions rather than returning null for configuration or initialization failures. Remove the null-check guard clauses and replace with standard exception handling where needed. The result is a cleaner call site with no "silently unavailable language" failure mode.

Additional IronOCR Capabilities

Beyond the features that directly replace Windows.Media.Ocr functionality, IronOCR covers capabilities that Windows.Media.Ocr has no equivalent for:

Scanned document processing — purpose-built handling for multi-page scanned archives including TIFF and multi-page PDF inputs
Table extraction — structured detection of tabular data within documents for invoice line items, report grids, and form matrices
Specialized document types — passport MRZ zones, MICR cheque lines, license plates, and handwritten text each have dedicated processing paths
Progress tracking — batch operations report progress via events, enabling progress bars and processing rate monitoring in application UIs

.NET Compatibility and Future Readiness

IronOCR supports .NET 6, .NET 7, .NET 8, and .NET 9 on standard TFMs without platform-specific suffixes, alongside .NET Framework 4.6.2 through 4.8 for legacy application support. The library receives regular updates tracking .NET release cadence, with .NET 10 support on the roadmap for 2026. Windows.Media.Ocr is available on any .NET version that supports WinRT interop from .NET 5 onward, but the Windows TFM requirement permanently limits its applicability to Windows-targeted projects. As .NET's cross-platform story matures — with more teams targeting Linux containers and cloud functions as first-class deployment targets — the TFM constraint of Windows.Media.Ocr becomes a more pronounced architectural liability rather than a minor caveat.

Conclusion

Windows.Media.Ocr occupies a specific and legitimate niche: a Windows 10/11 desktop application with zero cross-platform ambitions, basic image OCR needs, and a hard budget constraint of $0. Within that niche, it works. Outside that niche — the moment deployment targets a Linux container, a cloud function, a server with multiple language requirements, or a document pipeline that processes PDFs — the API does not exist on the target platform and the code must be replaced.

The deeper issue is that Windows.Media.Ocr's limitations are architectural, not incidental. Platform lock-in is not a configuration flag to turn off; it is baked into the WinRT runtime that the API depends on. Language availability is not a bundle to include at build time; it is delegated to OS administrators. PDF support is not a missing feature to add with a NuGet package; it is absent from the API surface entirely. Each limitation requires a separate system to compensate for it, and each compensating system reintroduces platform dependencies.

IronOCR addresses all four constraints — platform, language, preprocessing, and PDF — in a single package. The $999 entry price is not $0, and for a Windows-only desktop utility with controlled input and English-only documents, Windows.Media.Ocr remains a valid choice. For any project with broader requirements, the cost of building around Windows.Media.Ocr's constraints can exceed the IronOCR license cost in developer hours before the project reaches its first production deployment.

The practical test is straightforward: if the deployment target could ever be Linux, Docker, or a cloud function, and if input documents could ever be PDFs or arrive in languages beyond the default OS pack, Windows.Media.Ocr is the wrong foundation. Discovering that mid-project is considerably more expensive than choosing the right tool at the start. Evaluating the IronOCR feature set against your specific requirements before committing either direction is the most efficient way to make that determination.

Please noteTesseract and Windows Media OCR are registered trademarks of their respective owners. This site is not affiliated with, endorsed by, or sponsored by Google or Microsoft. All product names, logos, and brands are property of their respective owners. Comparisons are for informational purposes only and reflect publicly available information at the time of writing.

Frequently Asked Questions

What is Windows.Media.Ocr?

Windows.Media.Ocr is an OCR solution used by developers and enterprises to extract text from images and documents. It is one of several OCR options evaluated alongside IronOCR for .NET application development.

How does IronOCR compare to Windows.Media.Ocr for .NET developers?

IronOCR is a NuGet-native .NET OCR library using IronTesseract as its core engine. Compared to Windows.Media.Ocr, it offers simpler deployment (no SDK installers), flat-rate pricing, and a clean C# API without COM interop or cloud dependencies.

Is IronOCR easier to set up than Windows.Media.Ocr?

IronOCR installs via a single NuGet package. There are no SDK installers, license files to copy, COM components to register, or separate runtime binaries to manage. The entire OCR engine is bundled in the package.

What accuracy differences exist between Windows.Media.Ocr and IronOCR?

IronOCR achieves high recognition accuracy for standard business documents, invoices, receipts, and scanned forms. For highly degraded documents or uncommon scripts, accuracy varies by source quality. IronOCR includes image preprocessing filters to improve recognition on low-quality inputs.

Does IronOCR support PDF text extraction?

Yes. IronOCR extracts text from both native PDFs and scanned PDF images in a single call. It also supports multi-page TIFF files, images, and streams. For scanned PDFs, OCR is applied page-by-page with per-page result objects.

How does Windows.Media.Ocr licensing compare to IronOCR?

IronOCR uses a flat-rate perpetual license with no per-page or per-scan charges. Organizations processing high document volumes pay the same license cost regardless of volume. Details and volume pricing are on the IronOCR licensing page.

What languages does IronOCR support?

IronOCR supports 127 languages via separate NuGet language packs. Adding a language requires a single 'dotnet add package IronOcr.Languages.{Language}' command. No manual file placement or path configuration is needed.

How do I install IronOCR in a .NET project?

Install via NuGet: 'Install-Package IronOcr' in Package Manager Console or 'dotnet add package IronOcr' in the CLI. Additional language packs are installed the same way. No native SDK installer is required.

Is IronOCR suitable for Docker and containerized deployments, unlike Windows.Media.Ocr?

Yes. IronOCR works in Docker containers via its NuGet package. The license key is set via an environment variable. No license files, SDK paths, or volume mounts are required for the OCR engine itself.

Can I try IronOCR before purchasing, compared to Windows.Media.Ocr?

Yes. IronOCR trial mode processes documents and returns OCR results with a watermark overlay on output. You can verify accuracy on your own documents before purchasing a license.

Does IronOCR support barcode reading alongside text extraction?

IronOCR focuses on text extraction and OCR. For barcode reading, Iron Software provides IronBarcode as a companion library. Both are available individually or as part of the Iron Suite bundle.

Is it easy to migrate from Windows.Media.Ocr to IronOCR?

Migration from Windows.Media.Ocr to IronOCR typically involves replacing initialization sequences with IronTesseract instantiation, removing COM lifecycle management, and updating API calls. Most migrations reduce code complexity significantly.

Kannapat Udonpant

Chat with engineering team now

Software Engineer

Before becoming a Software Engineer, Kannapat completed a Environmental Resources PhD from Hokkaido University in Japan. While pursuing his degree, Kannapat also became a member of the Vehicle Robotics Laboratory, which is part of the Department of Bioproduction Engineering. In 2022, he leveraged his C# skills to join Iron Software's engineering ...

Updated July 19, 2026

xImage.OCR vs IronOCR: .NET OCR Library Comparison

XImage.OCR charges commercial licensing fees for an OCR library that delivers the same Tesseract engine you can access.

Updated July 19, 2026

Veryfi OCR API vs IronOCR: .NET OCR Library

Veryfi sends your bank account numbers, routing numbers, transaction amounts, and vendor relationships to a third-party.

Updated July 19, 2026

TesseractOCR.Net vs IronOCR: .NET OCR Library

TesseractOCR (the Sicos1977 fork) is a genuinely active, modern .NET wrapper — and that is exactly what makes its.

xImage.OCR vs IronOCR: .NET OCR Library Comparison

Veryfi OCR API vs IronOCR: .NET OCR...

Start Free 30 Day Trial

Windows.Media.Ocr vs IronOCR: .NET OCR Library

Understanding Windows.Media.Ocr

The WinRT Async Stack

Understanding IronOCR

Feature Comparison

Detailed Feature Comparison

Platform Lock-In vs Cross-Platform Deployment

Windows.Media.Ocr Approach

IronOCR Approach

Language Support: OS Dependency vs Bundled Packs

Windows.Media.Ocr Approach

IronOCR Approach

Preprocessing Absence

Windows.Media.Ocr Approach

IronOCR Approach

PDF Support Absence

Windows.Media.Ocr Approach

IronOCR Approach

API Mapping Reference

When Teams Consider Moving from Windows.Media.Ocr to IronOCR

The Application Outgrows Windows Desktop

Language Requirements Exceed Installed Packs

PDF Processing Is Added to Scope

Accuracy Degrades on Real-World Input

The Server Deployment Question Arises

Common Migration Considerations

Project File TFM Change

Async to Sync Migration

Language Pack Substitution

Null Engine Handling Removal

Additional IronOCR Capabilities

.NET Compatibility and Future Readiness

Conclusion

Frequently Asked Questions

What is Windows.Media.Ocr?

How does IronOCR compare to Windows.Media.Ocr for .NET developers?

Is IronOCR easier to set up than Windows.Media.Ocr?

What accuracy differences exist between Windows.Media.Ocr and IronOCR?

Does IronOCR support PDF text extraction?

How does Windows.Media.Ocr licensing compare to IronOCR?

What languages does IronOCR support?

How do I install IronOCR in a .NET project?

Is IronOCR suitable for Docker and containerized deployments, unlike Windows.Media.Ocr?

Can I try IronOCR before purchasing, compared to Windows.Media.Ocr?

Does IronOCR support barcode reading alongside text extraction?

Is it easy to migrate from Windows.Media.Ocr to IronOCR?

Related Articles

xImage.OCR vs IronOCR: .NET OCR Library Comparison

Veryfi OCR API vs IronOCR: .NET OCR Library

TesseractOCR.Net vs IronOCR: .NET OCR Library

Next step: Start free 30-day Trial

Thank You

Next step: Start free 30-day Trial

Want to deploy IronSuite to a live project for FREE?

What’s included?

Your license key has been delivered to your inbox

Your demo request is in.

Trusted by Millions of Engineers Worldwide

Iron Support Team