C# ଏବଂ .NET ରେ ଓଡଆ OCR |
ଏହ ଡକୟମେଣଟର ଅନୟ ସଂସକରଣ:
IronOCR ହେଉଛ ଏକ C# ସଫଟୱେର ଉପାଦାନ ଯାହାକ .NET କୋଡରମାନଙକ ଓଡଆ ସମେତ 126 ଭାଷାରେ ପରତଛବ ଏବଂ PDF ଡକୟମେଣଟର ପାଠ ପ to ବାକ ଅନମତ ଦେଇଥାଏ |
ଏହା ଟେସେରାକଟର ଏକ ଉନନତ ଫଙକ, ଯାହା କେବଳ .NET ଡେଭଲପରମାନଙକ ପାଇ ନରମତ ଏବଂ ଉଭୟ ଗତ ଏବଂ ସଠକତା ପାଇ ନୟମତ ଭାବରେ ଅନୟ ଟେସେରାକଟ ଇଞଜନଗଡକ ଅତକରମ କରଥାଏ |
IronOcr.Languages.Oriya ର ବଷୟବସତ |
ଏହ ପୟାକେଜ .NET ପାଇ 102 OCR ଭାଷା ଧାରଣ କରଥାଏ:
- ଓଡଆ
- ଓଡଆ ବେଷଟ |
- ଓଡଆ ଫାଷଟ |
- ଓଡଆ ବରଣଣମାଳା |
- ଓଡଆ ଆଲଫାବେଟ ବେଷଟ |
- ଓଡଆ ଆଲଫାବେଟ ଫାଷଟ |
ଡାଉନଲୋଡ କରନତ |
ଓଡଆ ଭାଷା ପୟାକ [ଓଡଆ]
* Download as ଜପ
* Install with as https://www.nuget.org/packages/IronOcr.Languages.Oriya/'> NuGet
ସଥାପନ
ଆମକ ପରଥମେ କରବାକ ହେବ ଆମର .NET ପରୋଜେକଟରେ ଆମର ଓଡଆ OCR ପୟାକେଜ ସଂସଥାପନ କରବା |
PM> Install-Package IronOcr.Languages.Oriya
କୋଡ ଉଦାହରଣ |
ଏହ C# କୋଡ ଉଦାହରଣ ଏକ ପରତଛବ କମବା PDF ଡକୟମେଣଟର ଓଡଆ ପାଠ ପ read େ |
// Install the IronOCR Oriya language package
// PM> Install-Package IronOcr.Languages.Oriya
using IronOcr;
var Ocr = new IronTesseract();
// Set the language to Oriya
Ocr.Language = OcrLanguage.Oriya;
// Use the OcrInput with the path to the image
using (var Input = new OcrInput(@"images\Oriya.png"))
{
// Perform OCR to get the result
var Result = Ocr.Read(Input);
// Get the text from the OCR result
var AllText = Result.Text;
}
// Install the IronOCR Oriya language package
// PM> Install-Package IronOcr.Languages.Oriya
using IronOcr;
var Ocr = new IronTesseract();
// Set the language to Oriya
Ocr.Language = OcrLanguage.Oriya;
// Use the OcrInput with the path to the image
using (var Input = new OcrInput(@"images\Oriya.png"))
{
// Perform OCR to get the result
var Result = Ocr.Read(Input);
// Get the text from the OCR result
var AllText = Result.Text;
}
' Install the IronOCR Oriya language package
' PM> Install-Package IronOcr.Languages.Oriya
Imports IronOcr
Private Ocr = New IronTesseract()
' Set the language to Oriya
Ocr.Language = OcrLanguage.Oriya
' Use the OcrInput with the path to the image
Using Input = New OcrInput("images\Oriya.png")
' Perform OCR to get the result
Dim Result = Ocr.Read(Input)
' Get the text from the OCR result
Dim AllText = Result.Text
End Using
IronOCR କାହକ ବାଛନତ?
ଆଇରନ OCR ଏକ ସହଜ-ସଂସଥାପତ, ସଂପରଣଣ ଏବଂ ଭଲ-ଡକୟମେଣଟ ହୋଇଥବା .NET ସଫଟୱେର ଲାଇବରେରୀ |
99.8% + OCR ସଠକତା ହାସଲ କରବାକ IronOCR ବାଛନତ, କ external ଣସ ବାହୟ ୱେବ ସେବା, ଚାଲଥବା ଶଳକ କମବା ଇଣଟରନେଟରେ ଗପତ ଡକୟମେଣଟ ପଠାଇ |
ଭାନଲା ଟେସେରାକଟ ଉପରେ C# ବକାଶକାରୀମାନେ କାହକ IronOCR ବାଛନତ:
- ଗୋଟଏ DLL କମବା NuGet ଭାବରେ ସଂସଥାପନ କରନତ |
- ବାକସ ବାହାରେ ଟେସେରାକଟ 5, 4 ଏବଂ 3 ଇଞଜନ ପାଇ ଅନତରଭକତ କରେ |
- ସଠକତା 99.8% ନୟମତ ଟେସେରାକଟକ ଯଥେଷଟ ଅଧକ କରଥାଏ |
- ଜୱଳନତ ଗତ ଏବଂ ମଲଟ ଥରେଡଂ |
- MVC, ୱେବ ଆପ, ଡେସକଟପ, କନସୋଲ ଏବଂ ସରଭର ଆପଲକେସନ ସସଙଗତ |
- କାମ କରବାକ କ Ex ଣସ Exes କମବା C ++ କୋଡ ନାହ |
- ସମପରଣଣ PDF OCR ସମରଥନ |
- OCR କ ପରାୟ ଯେକ Image ଣସ ପରତଛବ ଫାଇଲ କମବା PDF କରବା ପାଇ |
- ପରଣଣ .ନେଟ କୋର, ଷଟାଣଡାରଡ ଏବଂ ଫରେମ ୱରକ ସମରଥନ |
- ୱଣଡୋଜ, ମୟାକ, ଲନକସ, ଆଜର, ଡକର, ଲମବଡା, AWS ରେ ନୟୋଜତ କରନତ |
- ବାରକୋଡ ଏବଂ QR କୋଡ ପ Read ନତ |
- XHTML ପର OCR ରପତାନ କରନତ |
- ସନଧାନଯୋଗୟ PDF ଡକୟମେଣଟଗଡକ ପାଇ OCR ରପତାନ କରନତ |
- ମଲଟଥରେଡଂ ସମରଥନ |
- 126 ଆନତରଜାତୀୟ ଭାଷା ସମସତ NuGet କମବା OcrData ଫାଇଲ ମାଧୟମରେ ପରଚାଳତ |
- ପରତଛବ, ସଂଯୋଜକ, ପରସଂଖୟାନ ଏବଂ ଫଣଟ ବାହାର କରନତ | କେବଳ ପାଠ ନହେ |
- ବାଣଜୟକ ଏବଂ ମାଲକାନା ପରୟୋଗଗଡକ ମଧୟରେ ଟେସେରାକଟ OCR କ ପନ istr ବଣଟନ ପାଇ ବୟବହାର କରାଯାଇପାରବ |
ବାସତବ ବଶ images ପରତଛବ ଏବଂ ଅସମପରଣଣ ଡକୟମେଣଟ ଯେପରକ ଫଟୋଗରାଫ, କମବା କମ ରଜୋଲୟସନ ସକାନ ସହତ କାମ କରବାବେଳେ ଲ Iron ହ OCR ଉଜଜୱଳ ହୋଇଥାଏ ଯାହା ଡଜଟାଲ ଶବଦ କମବା ଅସମପରଣଣତା ଥାଇପାରେ |
.NET ପଲାଟଫରମ ପାଇ ଅନୟାନୟ ମାଗଣା OCR ଲାଇବରେରୀଗଡକ ଏହପର ଅନୟ .NET ଟେସେରାକଟ API ଏବଂ ୱେବ ସେବାଗଡକ ଏହ ବାସତବ ଦନଆ ବୟବହାର କଷେତରରେ ଏତେ ଭଲ ପରଦରଶନ କରେ ନାହ |
ଟେସେରାକଟ 5 ସହତ OCR - C# ରେ କୋଡଂ ଆରମଭ କରନତ |
ନମନରେ କୋଡ ନମନା ଦରଶାଏ ଯେ C# କମବା VB .NET ବୟବହାର କର ଏକ ପରତଛବର ପାଠ ପ to ବା କେତେ ସହଜ |
OneLiner
// Quick one-line OCR text extraction from an image
string Text = new IronTesseract().Read(@"img\Screenshot.png").Text;
// Quick one-line OCR text extraction from an image
string Text = new IronTesseract().Read(@"img\Screenshot.png").Text;
' Quick one-line OCR text extraction from an image
Dim Text As String = (New IronTesseract()).Read("img\Screenshot.png").Text
ବନୟାସଯୋଗୟ ହେଲୋ ୱାରଲଡ |
// PM> Install-Package IronOcr.Languages.Oriya
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
using (var Input = new OcrInput())
{
// Add an image to the OCR input
Input.AddImage("images/sample.jpeg");
// You can add any number of images here
var Result = Ocr.Read(Input);
Console.WriteLine(Result.Text);
}
// PM> Install-Package IronOcr.Languages.Oriya
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
using (var Input = new OcrInput())
{
// Add an image to the OCR input
Input.AddImage("images/sample.jpeg");
// You can add any number of images here
var Result = Ocr.Read(Input);
Console.WriteLine(Result.Text);
}
Imports IronOcr
Dim Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya
Using Input = New OcrInput()
' Add an image to the OCR input
Input.AddImage("images/sample.jpeg")
' You can add any number of images here
Dim Result = Ocr.Read(Input)
Console.WriteLine(Result.Text)
End Using
C# PDF OCR |
ସମାନ ପଦଧତ ଯେକ PDF ଣସ PDF ଡକୟମେଣଟର ପାଠ ବାହାର କରବା ପାଇ ସମାନ ଭାବରେ ବୟବହତ ହୋଇପାରେ |
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
using (var input = new OcrInput())
{
input.AddPdf("example.pdf", "password");
// Specific PDF pages can also be selected for OCR
var Result = Ocr.Read(input);
Console.WriteLine(Result.Text);
Console.WriteLine($"{Result.Pages.Count} Pages");
// Outputs page count for each page in the PDF
}
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
using (var input = new OcrInput())
{
input.AddPdf("example.pdf", "password");
// Specific PDF pages can also be selected for OCR
var Result = Ocr.Read(input);
Console.WriteLine(Result.Text);
Console.WriteLine($"{Result.Pages.Count} Pages");
// Outputs page count for each page in the PDF
}
Imports IronOcr
Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya
Using input = New OcrInput()
input.AddPdf("example.pdf", "password")
' Specific PDF pages can also be selected for OCR
Dim Result = Ocr.Read(input)
Console.WriteLine(Result.Text)
Console.WriteLine($"{Result.Pages.Count} Pages")
' Outputs page count for each page in the PDF
End Using
ମଲଟପେଜ TIFF ପାଇ OCR |
ଏକାଧକ ପଷଠା ଡକୟମେଣଟ ସହତ OCR ପଠନ TIFF ଫାଇଲ ଫରମାଟ | ସନଧାନଯୋଗୟ ପାଠୟ ସହତ TIFF କ ସଧାସଳଖ ଏକ PDF ଫାଇଲରେ ରପାନତର କରାଯାଇପାରେ |
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
using (var Input = new OcrInput())
{
Input.AddMultiFrameTiff("multi-frame.tiff");
var Result = Ocr.Read(Input);
Console.WriteLine(Result.Text);
}
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
using (var Input = new OcrInput())
{
Input.AddMultiFrameTiff("multi-frame.tiff");
var Result = Ocr.Read(Input);
Console.WriteLine(Result.Text);
}
Imports IronOcr
Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya
Using Input = New OcrInput()
Input.AddMultiFrameTiff("multi-frame.tiff")
Dim Result = Ocr.Read(Input)
Console.WriteLine(Result.Text)
End Using
ବାରକୋଡ ଏବଂ QR |
ଆଇରନ OCR ର ଏକ ଅନନୟ ବ feature ଶଷଟୟ ହେଉଛ ଏହା ଟେକସଟ ପାଇ ସକାନ କରଥବାବେଳେ ଡକୟମେଣଟର ବାରକୋଡ ଏବଂ QR କୋଡ ପ read ପାରେ | OcrResult.OcrBarcode ଶରେଣୀର ଉଦାହରଣଗଡକ ପରତୟେକ ସକାନ ହୋଇଥବା ବାରକୋଡ ବଷୟରେ ଜଣା ହୋଇଥାଏ |
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Configuration.ReadBarCodes = true;
using (var input = new OcrInput())
{
input.AddImage("img/Barcode.png");
var Result = Ocr.Read(input);
foreach (var Barcode in Result.Barcodes)
{
Console.WriteLine(Barcode.Value);
// Type and Position properties are also available for each barcode
}
}
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Configuration.ReadBarCodes = true;
using (var input = new OcrInput())
{
input.AddImage("img/Barcode.png");
var Result = Ocr.Read(input);
foreach (var Barcode in Result.Barcodes)
{
Console.WriteLine(Barcode.Value);
// Type and Position properties are also available for each barcode
}
}
Imports IronOcr
Private Ocr = New IronTesseract()
Ocr.Configuration.ReadBarCodes = True
Using input = New OcrInput()
input.AddImage("img/Barcode.png")
Dim Result = Ocr.Read(input)
For Each Barcode In Result.Barcodes
Console.WriteLine(Barcode.Value)
' Type and Position properties are also available for each barcode
Next Barcode
End Using
ପରତଛବଗଡକର ନରଦଦଷଟ କଷେତର ଉପରେ OCR |
ସମସତ ଆଇରନ OCR ର ସକାନଂ ଏବଂ ପ reading ବା ପଦଧତଗଡକ କେଉ ପଷଠାର କେଉ ଅଂଶର ଆମେ ପାଠ ପ to ବାକ ଚାହଛ ତାହା ନରଦଦଷଟ କରବାର କଷମତା ଦେଇଥାଏ | ଯେତେବେଳେ ଆମେ ମାନକ ଫରମଗଡକ ଦେଖ ଏହା ଅତୟନତ ଉପଯୋଗୀ ଏବଂ ଏକ ଭୟଙକର ସମୟ ବଞଚାଇ ପାରବ ଏବଂ ଦକଷତା ବଦଧ କରପାରବ |
ଫସଲ ଅଞଚଳ ବୟବହାର କରବାକ, ଆମକ System.Drawing ଏକ ସଷଟମ ରେଫରେନସ ଯୋଡଵାକ ପଡବ ଯାହା System.Drawing.Rectangle ବସତ ବୟବହାର କରପାରବା |
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
using (var Input = new OcrInput())
{
var ContentArea = new System.Drawing.Rectangle()
{
X = 215, Y = 1250, Height = 280, Width = 1335
// Measurements are in pixels
};
Input.Add("document.png", ContentArea);
var Result = Ocr.Read(Input);
Console.WriteLine(Result.Text);
}
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
using (var Input = new OcrInput())
{
var ContentArea = new System.Drawing.Rectangle()
{
X = 215, Y = 1250, Height = 280, Width = 1335
// Measurements are in pixels
};
Input.Add("document.png", ContentArea);
var Result = Ocr.Read(Input);
Console.WriteLine(Result.Text);
}
Imports IronOcr
Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya
Using Input = New OcrInput()
Dim ContentArea = New System.Drawing.Rectangle() With {
.X = 215,
.Y = 1250,
.Height = 280,
.Width = 1335
}
Input.Add("document.png", ContentArea)
Dim Result = Ocr.Read(Input)
Console.WriteLine(Result.Text)
End Using
ନମନ ଗଣବତତା ସକାନ ପାଇ OCR |
ଆଇରନ OCR OcrInput ଶରେଣୀ ସକାନ ଗଣବତତା ପରବଧାନକ ନରଦଦେଶ କରେ ଏବଂ ଏକ ସଂପରଣଣ ସମାଧାନ ଦେଇଥାଏ ଯାହା ସଡଲା ଏବଂ ଅସନଦର ସକାନ ବମୋକତ କରପାରେ ଏବଂ ଡଜଟାଲ ଶବଦ କମବା ଅସମପରଣଣତାକ ଠକ କରପାରେ |
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
using (var Input = new OcrInput(@"img\Potter.LowQuality.tiff"))
{
Input.DeNoise(); // Corrects digital noise and poor scanning
Input.Deskew(); // Corrects rotation and perspective
var Result = Ocr.Read(Input);
Console.WriteLine(Result.Text);
}
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
using (var Input = new OcrInput(@"img\Potter.LowQuality.tiff"))
{
Input.DeNoise(); // Corrects digital noise and poor scanning
Input.Deskew(); // Corrects rotation and perspective
var Result = Ocr.Read(Input);
Console.WriteLine(Result.Text);
}
Imports IronOcr
Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya
Using Input = New OcrInput("img\Potter.LowQuality.tiff")
Input.DeNoise() ' Corrects digital noise and poor scanning
Input.Deskew() ' Corrects rotation and perspective
Dim Result = Ocr.Read(Input)
Console.WriteLine(Result.Text)
End Using
ଏକ ସନଧାନଯୋଗୟ PDF ଭାବରେ OCR ଫଳାଫଳ ରପତାନ କରନତ |
କପ ଯୋଗୟ ପାଠୟ ଷଟରଙଗ ସହତ PDF କ ପରତଛବ | ସରଚଚ ଇଞଜନ ଏବଂ ଡାଟାବେସ ଦୱାରା ସଚୀବଦଧ ହୋଇପାରବ |
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
using (var Input = new OcrInput())
{
Input.Title = "Quarterly Report";
Input.AddImage("image1.jpeg");
Input.AddImage("image2.png");
Input.AddImage("image3.gif");
var Result = Ocr.Read(Input);
Result.SaveAsSearchablePdf("searchable.pdf");
}
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
using (var Input = new OcrInput())
{
Input.Title = "Quarterly Report";
Input.AddImage("image1.jpeg");
Input.AddImage("image2.png");
Input.AddImage("image3.gif");
var Result = Ocr.Read(Input);
Result.SaveAsSearchablePdf("searchable.pdf");
}
Imports IronOcr
Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya
Using Input = New OcrInput()
Input.Title = "Quarterly Report"
Input.AddImage("image1.jpeg")
Input.AddImage("image2.png")
Input.AddImage("image3.gif")
Dim Result = Ocr.Read(Input)
Result.SaveAsSearchablePdf("searchable.pdf")
End Using
ସନଧାନଯୋଗୟ PDF ରପାନତର ପାଇ TIFF |
ଏକ TIFF ଡକୟମେଣଟ (କମବା ପରତଛବ ଫାଇଲଗଡକର କ group ଣସ ଗୋଷଠୀ) କ ଏକ ସନଧାନଯୋଗୟ PDF କ ସଧାସଳଖ ରପାନତର କରନତ ଯାହା ଇଣଟରନେଟ, ୱେବସାଇଟ ଏବଂ ଗଗଲ ସରଚଚ ଇଞଜନ ଦୱାରା ସଚୀବଦଧ ହୋଇପାରବ |
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
using (var Input = new OcrInput())
{
Input.AddMultiFrameTiff("example.tiff");
var Result = Ocr.Read(Input).SaveAsSearchablePdf("searchable.pdf");
}
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
using (var Input = new OcrInput())
{
Input.AddMultiFrameTiff("example.tiff");
var Result = Ocr.Read(Input).SaveAsSearchablePdf("searchable.pdf");
}
Imports IronOcr
Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya
Using Input = New OcrInput()
Input.AddMultiFrameTiff("example.tiff")
Dim Result = Ocr.Read(Input).SaveAsSearchablePdf("searchable.pdf")
End Using
OCR ଫଳାଫଳକ HTML ଭାବରେ ରପତାନ କରନତ |
XHTML ରପାନତର ପାଇ OCR ପରତଛବ |
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
using (var Input = new OcrInput())
{
Input.Title = "Html Title";
Input.AddImage("image1.jpeg");
var Result = Ocr.Read(Input);
Result.SaveAsHocrFile("results.html");
}
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
using (var Input = new OcrInput())
{
Input.Title = "Html Title";
Input.AddImage("image1.jpeg");
var Result = Ocr.Read(Input);
Result.SaveAsHocrFile("results.html");
}
Imports IronOcr
Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya
Using Input = New OcrInput()
Input.Title = "Html Title"
Input.AddImage("image1.jpeg")
Dim Result = Ocr.Read(Input)
Result.SaveAsHocrFile("results.html")
End Using
OCR ପରତଛବ ଉନନତ ଫଲଟର |
OCR ଆବଶୟକତା ବାଢନକ OcrInput ବସତଗଡକ ପାଇ ଦଆଏ ସମାଧାନୀୟ ଫଲଟର ଉପଲବଧ |
ପରତଛବ ଉନନତ ସଂକେତ ଉଦାହରଣ |
ଉପରତ, ବାହାର ମାଇନାମରୀ କାର ଟୟାରୋଉ ନଦେଇଲା |
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
using (var Input = new OcrInput(@"LowQuality.jpeg"))
{
Input.DeNoise(); // Corrects digital noise and poor scanning
Input.Deskew(); // Corrects rotation and perspective
var Result = Ocr.Read(Input);
Console.WriteLine(Result.Text);
}
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
using (var Input = new OcrInput(@"LowQuality.jpeg"))
{
Input.DeNoise(); // Corrects digital noise and poor scanning
Input.Deskew(); // Corrects rotation and perspective
var Result = Ocr.Read(Input);
Console.WriteLine(Result.Text);
}
Imports IronOcr
Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya
Using Input = New OcrInput("LowQuality.jpeg")
Input.DeNoise() ' Corrects digital noise and poor scanning
Input.Deskew() ' Corrects rotation and perspective
Dim Result = Ocr.Read(Input)
Console.WriteLine(Result.Text)
End Using
OCR ପରତଛବ ଫଲଟରଗଡକର ତାଲକା |
OCR କାରଯୟଦକଷତା ବଦଧ କରବାକ ଇନପଟ ଫଲଟର ଯାହା IronOCR ରେ ନରମତ:
- OcrInput.Rotate(double degrees) - Rotates the image a number of degrees clockwise. For counterclockwise, use a negative number.
- OcrInput.Binarize() - This image filter turns each pixel black or white. Useful where there is low-contrast text on a background and OCR performance improves.
- OcrInput.ToGrayScale() - Converts each pixel to grayscale. Might not increase OCR accuracy but can improve speed.
- OcrInput.Contrast() - Automatically increases contrast. Improves OCR speed and accuracy especially on low contrast scans.
- OcrInput.DeNoise() - Removes digital noise. Should be used wherever noise is expected.
- OcrInput.Invert(/csharp/ocr/languages/strong> - Inverts each color (e.g., white becomes black; black becomes white/).
- OcrInput.Dilate() - Enhanced morphology, adds pixels around the boundary of objects in the image. Opposite of Erode.
- OcrInput.Erode() - Enhanced morphology, removes pixels on object boundaries. Opposite of Dilate.
- OcrInput.Deskew() - Rotates image so that it is upright and orthogonally aligned. Essential for accuracy, the 5-degree tolerance of Tesseract is improved.
- OcrInput.DeepCleanBackgroundNoise() - Removes heavy background noise. Use only where known, as it can lower OCR accuracy on clean documents and is CPU intensive.
- OcrInput.EnhanceResolution - Increases resolution of low-quality images. Automatically handles low res inputs using MinimumDPI and TargetDPI properties.
Clean Background Noise is a setting that is time-sensitive but allows the library to automatically clean digital noise, paper shreds, and other artifacts, which other libraries cannot.
Enhance Contrast automatically increases image text contrast and improves OCR accuracy, thereby generally increasing performance and speed.
Enhance Resolution detects low-resolution images (<275 dpi) and auto-upscales them for sharp text. This operation is timely but often faster overall for OCR ops.
Language settings support multiple languages, useful for translating foreign documents.
126 ଭାଷା ପୟାକ |
IronOCR supports 126 ଆନତରଜାତୀୟ ମାଗଣା languages distributed as DLLs; downloadable from this website or NuGet.
Included languages span German, French, English, Chinese, Japanese, and many more. Specialist packages exist for MRZ, MICR, license plates, and others. Any Tesseract ".traineddata" files you have can be used, including your custom-made ones.
Language Example |
Using another OCR language:
using IronOcr;
// PM> Install IronOcr.Languages.Arabic
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Arabic;
using (var input = new OcrInput())
{
input.AddImage("img/arabic.gif");
// Add image noise filters if needed
// In this case, the input might be of very poor quality
// IronTesseract can read what conventional ones cannot
var Result = Ocr.Read(input);
// Saving Arabic text - cannot print Arabic easily in console on Windows
Result.SaveAsTextFile("arabic.txt");
}
using IronOcr;
// PM> Install IronOcr.Languages.Arabic
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Arabic;
using (var input = new OcrInput())
{
input.AddImage("img/arabic.gif");
// Add image noise filters if needed
// In this case, the input might be of very poor quality
// IronTesseract can read what conventional ones cannot
var Result = Ocr.Read(input);
// Saving Arabic text - cannot print Arabic easily in console on Windows
Result.SaveAsTextFile("arabic.txt");
}
Imports IronOcr
' PM> Install IronOcr.Languages.Arabic
Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Arabic
Using input = New OcrInput()
input.AddImage("img/arabic.gif")
' Add image noise filters if needed
' In this case, the input might be of very poor quality
' IronTesseract can read what conventional ones cannot
Dim Result = Ocr.Read(input)
' Saving Arabic text - cannot print Arabic easily in console on Windows
Result.SaveAsTextFile("arabic.txt")
End Using
Multiple Language Example |
Using multiple languages in one go is possible, aiding in receiving metadata in multiple languages from runtime OCR:
using IronOcr;
// PM> Install IronOcr.Languages.ChineseSimplified
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.ChineseSimplified;
Ocr.AddSecondaryLanguage(OcrLanguage.Oriya);
// Any number of additional languages can be added
using (var input = new OcrInput())
{
input.Add("multi-language.pdf");
var Result = Ocr.Read(input);
Result.SaveAsTextFile("results.txt");
}
using IronOcr;
// PM> Install IronOcr.Languages.ChineseSimplified
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.ChineseSimplified;
Ocr.AddSecondaryLanguage(OcrLanguage.Oriya);
// Any number of additional languages can be added
using (var input = new OcrInput())
{
input.Add("multi-language.pdf");
var Result = Ocr.Read(input);
Result.SaveAsTextFile("results.txt");
}
Imports IronOcr
' PM> Install IronOcr.Languages.ChineseSimplified
Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.ChineseSimplified
Ocr.AddSecondaryLanguage(OcrLanguage.Oriya)
' Any number of additional languages can be added
Using input = New OcrInput()
input.Add("multi-language.pdf")
Dim Result = Ocr.Read(input)
Result.SaveAsTextFile("results.txt")
End Using
ାସାଧାରଣ OCR ଫଳାଫଳ ଅବଜେକଟସ |
IronOCR provides robust OCR result object for each operation. Developers can access text but the result DOM is far more extensive.
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
Ocr.Configuration.EngineMode = TesseractEngineMode.TesseractAndLstm;
Ocr.Configuration.ReadBarCodes = true; // Important!
using (var Input = new OcrInput(@"images\sample.tiff"))
{
OcrResult Result = Ocr.Read(Input);
var Pages = Result.Pages;
var Words = Pages[0].Words;
var Barcodes = Result.Barcodes;
// Explore a large, detailed API here:
// - Pages, Blocks, Paragraphs, Lines, Words, Characters |
// - Image exports, Font Interactions, Statistical data |
}
using IronOcr;
var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
Ocr.Configuration.EngineMode = TesseractEngineMode.TesseractAndLstm;
Ocr.Configuration.ReadBarCodes = true; // Important!
using (var Input = new OcrInput(@"images\sample.tiff"))
{
OcrResult Result = Ocr.Read(Input);
var Pages = Result.Pages;
var Words = Pages[0].Words;
var Barcodes = Result.Barcodes;
// Explore a large, detailed API here:
// - Pages, Blocks, Paragraphs, Lines, Words, Characters |
// - Image exports, Font Interactions, Statistical data |
}
Imports IronOcr
Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya
Ocr.Configuration.EngineMode = TesseractEngineMode.TesseractAndLstm
Ocr.Configuration.ReadBarCodes = True ' Important!
Using Input = New OcrInput("images\sample.tiff")
Dim Result As OcrResult = Ocr.Read(Input)
Dim Pages = Result.Pages
Dim Words = Pages(0).Words
Dim Barcodes = Result.Barcodes
' Explore a large, detailed API here:
' - Pages, Blocks, Paragraphs, Lines, Words, Characters |
' - Image exports, Font Interactions, Statistical data |
End Using
ପରଦରଶନ
IronOCR performs efficiently out-of-the-box without extra transformation steps.
Blazing fast: IronOCR.2020 is 10 times faster and 250% less error-prone than previous builds.
ଅଧକ ସଖନତ
For more on OCR in C#, VB, F#, or any .NET language, please read our community tutorials, or see how best to make use of its nuances in real-world example applications using this library.
A full object reference is also available for .NET developers.

