C# ଏବଂ .NET ରେ ଓଡଆ OCR |

ଏହ ଡକୟମେଣଟର ଅନୟ ସଂସକରଣ:

IronOCR ହେଉଛ ଏକ C# ସଫଟୱେର ଉପାଦାନ ଯାହାକ .NET କୋଡରମାନଙକ ଓଡଆ ସମେତ 126 ଭାଷାରେ ପରତଛବ ଏବଂ PDF ଡକୟମେଣଟର ପାଠ ପ to ବାକ ଅନମତ ଦେଇଥାଏ |

ଏହା ଟେସେରାକଟର ଏକ ଉନନତ ଫଙକ, ଯାହା କେବଳ .NET ଡେଭଲପରମାନଙକ ପାଇ ନରମତ ଏବଂ ଉଭୟ ଗତ ଏବଂ ସଠକତା ପାଇ ନୟମତ ଭାବରେ ଅନୟ ଟେସେରାକଟ ଇଞଜନଗଡକ ଅତକରମ କରଥାଏ |

IronOcr.Languages.Oriya ର ବଷୟବସତ |

ଏହ ପୟାକେଜ .NET ପାଇ 102 OCR ଭାଷା ଧାରଣ କରଥାଏ:

  • ଓଡଆ
  • ଓଡଆ ବେଷଟ |
  • ଓଡଆ ଫାଷଟ |
  • ଓଡଆ ବରଣଣମାଳା |
  • ଓଡଆ ଆଲଫାବେଟ ବେଷଟ |
  • ଓଡଆ ଆଲଫାବେଟ ଫାଷଟ |

ଡାଉନଲୋଡ କରନତ |

ଓଡଆ ଭାଷା ପୟାକ [ଓଡଆ]
* Download as ଜପ
* Install with as
https://www.nuget.org/packages/IronOcr.Languages.Oriya/'> NuGet

ସଥାପନ

ଆମକ ପରଥମେ କରବାକ ହେବ ଆମର .NET ପରୋଜେକଟରେ ଆମର ଓଡଆ OCR ପୟାକେଜ ସଂସଥାପନ କରବା |

PM> Install-Package IronOCR.Languages.Oriya

କୋଡ ଉଦାହରଣ |

ଏହ C# କୋଡ ଉଦାହରଣ ଏକ ପରତଛବ କମବା PDF ଡକୟମେଣଟର ଓଡଆ ପାଠ ପ read େ |

// Install the IronOCR Oriya language package
// PM> Install-Package IronOcr.Languages.Oriya
using IronOcr;

var Ocr = new IronTesseract();
// Set the language to Oriya
Ocr.Language = OcrLanguage.Oriya;

// Use the OcrInput with the path to the image
using (var Input = new OcrInput(@"images\Oriya.png"))
{
    // Perform OCR to get the result
    var Result = Ocr.Read(Input);

    // Get the text from the OCR result
    var AllText = Result.Text;
}
// Install the IronOCR Oriya language package
// PM> Install-Package IronOcr.Languages.Oriya
using IronOcr;

var Ocr = new IronTesseract();
// Set the language to Oriya
Ocr.Language = OcrLanguage.Oriya;

// Use the OcrInput with the path to the image
using (var Input = new OcrInput(@"images\Oriya.png"))
{
    // Perform OCR to get the result
    var Result = Ocr.Read(Input);

    // Get the text from the OCR result
    var AllText = Result.Text;
}
' Install the IronOCR Oriya language package
' PM> Install-Package IronOcr.Languages.Oriya
Imports IronOcr

Private Ocr = New IronTesseract()
' Set the language to Oriya
Ocr.Language = OcrLanguage.Oriya

' Use the OcrInput with the path to the image
Using Input = New OcrInput("images\Oriya.png")
	' Perform OCR to get the result
	Dim Result = Ocr.Read(Input)

	' Get the text from the OCR result
	Dim AllText = Result.Text
End Using
$vbLabelText   $csharpLabel

IronOCR କାହକ ବାଛନତ?

ଆଇରନ OCR ଏକ ସହଜ-ସଂସଥାପତ, ସଂପରଣଣ ଏବଂ ଭଲ-ଡକୟମେଣଟ ହୋଇଥବା .NET ସଫଟୱେର ଲାଇବରେରୀ |

99.8% + OCR ସଠକତା ହାସଲ କରବାକ IronOCR ବାଛନତ, କ external ଣସ ବାହୟ ୱେବ ସେବା, ଚାଲଥବା ଶଳକ କମବା ଇଣଟରନେଟରେ ଗପତ ଡକୟମେଣଟ ପଠାଇ |

ଭାନଲା ଟେସେରାକଟ ଉପରେ C# ବକାଶକାରୀମାନେ କାହକ IronOCR ବାଛନତ:

  • ଗୋଟଏ DLL କମବା NuGet ଭାବରେ ସଂସଥାପନ କରନତ |
  • ବାକସ ବାହାରେ ଟେସେରାକଟ 5, 4 ଏବଂ 3 ଇଞଜନ ପାଇ ଅନତରଭକତ କରେ |
  • ସଠକତା 99.8% ନୟମତ ଟେସେରାକଟକ ଯଥେଷଟ ଅଧକ କରଥାଏ |
  • ଜୱଳନତ ଗତ ଏବଂ ମଲଟ ଥରେଡଂ |
  • MVC, ୱେବ ଆପ, ଡେସକଟପ, କନସୋଲ ଏବଂ ସରଭର ଆପଲକେସନ ସସଙଗତ |
  • କାମ କରବାକ କ Ex ଣସ Exes କମବା C ++ କୋଡ ନାହ |
  • ସମପରଣଣ PDF OCR ସମରଥନ |
  • OCR କ ପରାୟ ଯେକ Image ଣସ ପରତଛବ ଫାଇଲ କମବା PDF କରବା ପାଇ |
  • ପରଣଣ .ନେଟ କୋର, ଷଟାଣଡାରଡ ଏବଂ ଫରେମ ୱରକ ସମରଥନ |
  • ୱଣଡୋଜ, ମୟାକ, ଲନକସ, ଆଜର, ଡକର, ଲମବଡା, AWS ରେ ନୟୋଜତ କରନତ |
  • ବାରକୋଡ ଏବଂ QR କୋଡ ପ Read ନତ |
  • XHTML ପର OCR ରପତାନ କରନତ |
  • ସନଧାନଯୋଗୟ PDF ଡକୟମେଣଟଗଡକ ପାଇ OCR ରପତାନ କରନତ |
  • ମଲଟଥରେଡଂ ସମରଥନ |
  • 126 ଆନତରଜାତୀୟ ଭାଷା ସମସତ NuGet କମବା OcrData ଫାଇଲ ମାଧୟମରେ ପରଚାଳତ |
  • ପରତଛବ, ସଂଯୋଜକ, ପରସଂଖୟାନ ଏବଂ ଫଣଟ ବାହାର କରନତ | କେବଳ ପାଠ ନହେ |
  • ବାଣଜୟକ ଏବଂ ମାଲକାନା ପରୟୋଗଗଡକ ମଧୟରେ ଟେସେରାକଟ OCR କ ପନ istr ବଣଟନ ପାଇ ବୟବହାର କରାଯାଇପାରବ |

ବାସତବ ବଶ images ପରତଛବ ଏବଂ ଅସମପରଣଣ ଡକୟମେଣଟ ଯେପରକ ଫଟୋଗରାଫ, କମବା କମ ରଜୋଲୟସନ ସକାନ ସହତ କାମ କରବାବେଳେ ଲ Iron ହ OCR ଉଜଜୱଳ ହୋଇଥାଏ ଯାହା ଡଜଟାଲ ଶବଦ କମବା ଅସମପରଣଣତା ଥାଇପାରେ |

.NET ପଲାଟଫରମ ପାଇ ଅନୟାନୟ ମାଗଣା OCR ଲାଇବରେରୀଗଡକ ଏହପର ଅନୟ .net ଟେସେରାକଟ API ଏବଂ ୱେବ ସେବାଗଡକ ଏହ ବାସତବ ଦନଆ ବୟବହାର କଷେତରରେ ଏତେ ଭଲ ପରଦରଶନ କରେ ନାହ |

ଟେସେରାକଟ 5 ସହତ OCR - C# ରେ କୋଡଂ ଆରମଭ କରନତ |

ନମନରେ କୋଡ ନମନା ଦରଶାଏ ଯେ C# କମବା VB .NET ବୟବହାର କର ଏକ ପରତଛବର ପାଠ ପ to ବା କେତେ ସହଜ |

OneLiner

// Quick one-line OCR text extraction from an image
string Text = new IronTesseract().Read(@"img\Screenshot.png").Text;
// Quick one-line OCR text extraction from an image
string Text = new IronTesseract().Read(@"img\Screenshot.png").Text;
' Quick one-line OCR text extraction from an image
Dim Text As String = (New IronTesseract()).Read("img\Screenshot.png").Text
$vbLabelText   $csharpLabel

ବନୟାସଯୋଗୟ ହେଲୋ ୱାରଲଡ |

// PM> Install-Package IronOCR.Languages.Oriya
using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;

using (var Input = new OcrInput())
{
    // Add an image to the OCR input
    Input.AddImage("images/sample.jpeg");
    // You can add any number of images here

    var Result = Ocr.Read(Input);
    Console.WriteLine(Result.Text);
}
// PM> Install-Package IronOCR.Languages.Oriya
using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;

using (var Input = new OcrInput())
{
    // Add an image to the OCR input
    Input.AddImage("images/sample.jpeg");
    // You can add any number of images here

    var Result = Ocr.Read(Input);
    Console.WriteLine(Result.Text);
}
' PM> Install-Package IronOCR.Languages.Oriya
Imports IronOcr

Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya

Using Input = New OcrInput()
	' Add an image to the OCR input
	Input.AddImage("images/sample.jpeg")
	' You can add any number of images here

	Dim Result = Ocr.Read(Input)
	Console.WriteLine(Result.Text)
End Using
$vbLabelText   $csharpLabel

C# PDF OCR |

ସମାନ ପଦଧତ ଯେକ PDF ଣସ PDF ଡକୟମେଣଟର ପାଠ ବାହାର କରବା ପାଇ ସମାନ ଭାବରେ ବୟବହତ ହୋଇପାରେ |

using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;

using (var input = new OcrInput())
{
    input.AddPdf("example.pdf", "password");
    // Specific PDF pages can also be selected for OCR

    var Result = Ocr.Read(input);

    Console.WriteLine(Result.Text);
    Console.WriteLine($"{Result.Pages.Count} Pages");
    // Outputs page count for each page in the PDF
}
using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;

using (var input = new OcrInput())
{
    input.AddPdf("example.pdf", "password");
    // Specific PDF pages can also be selected for OCR

    var Result = Ocr.Read(input);

    Console.WriteLine(Result.Text);
    Console.WriteLine($"{Result.Pages.Count} Pages");
    // Outputs page count for each page in the PDF
}
Imports IronOcr

Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya

Using input = New OcrInput()
	input.AddPdf("example.pdf", "password")
	' Specific PDF pages can also be selected for OCR

	Dim Result = Ocr.Read(input)

	Console.WriteLine(Result.Text)
	Console.WriteLine($"{Result.Pages.Count} Pages")
	' Outputs page count for each page in the PDF
End Using
$vbLabelText   $csharpLabel

ମଲଟପେଜ TIFF ପାଇ OCR |

ଏକାଧକ ପଷଠା ଡକୟମେଣଟ ସହତ OCR ପଠନ TIFF ଫାଇଲ ଫରମାଟ | ସନଧାନଯୋଗୟ ପାଠୟ ସହତ TIFF କ ସଧାସଳଖ ଏକ PDF ଫାଇଲରେ ରପାନତର କରାଯାଇପାରେ |

using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;

using (var Input = new OcrInput())
{
    Input.AddMultiFrameTiff("multi-frame.tiff");
    var Result = Ocr.Read(Input);
    Console.WriteLine(Result.Text);
}
using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;

using (var Input = new OcrInput())
{
    Input.AddMultiFrameTiff("multi-frame.tiff");
    var Result = Ocr.Read(Input);
    Console.WriteLine(Result.Text);
}
Imports IronOcr

Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya

Using Input = New OcrInput()
	Input.AddMultiFrameTiff("multi-frame.tiff")
	Dim Result = Ocr.Read(Input)
	Console.WriteLine(Result.Text)
End Using
$vbLabelText   $csharpLabel

ବାରକୋଡ ଏବଂ QR |

ଆଇରନ OCR ର ଏକ ଅନନୟ ବ feature ଶଷଟୟ ହେଉଛ ଏହା ଟେକସଟ ପାଇ ସକାନ କରଥବାବେଳେ ଡକୟମେଣଟର ବାରକୋଡ ଏବଂ QR କୋଡ ପ read ପାରେ | OcrResult.OcrBarcode ଶରେଣୀର ଉଦାହରଣଗଡକ ପରତୟେକ ସକାନ ହୋଇଥବା ବାରକୋଡ ବଷୟରେ ଜଣା ହୋଇଥାଏ |

using IronOcr;

var Ocr = new IronTesseract();
Ocr.Configuration.ReadBarCodes = true;

using (var input = new OcrInput())
{
    input.AddImage("img/Barcode.png");
    var Result = Ocr.Read(input);

    foreach (var Barcode in Result.Barcodes)
    {
        Console.WriteLine(Barcode.Value);
        // Type and Position properties are also available for each barcode
    }
}
using IronOcr;

var Ocr = new IronTesseract();
Ocr.Configuration.ReadBarCodes = true;

using (var input = new OcrInput())
{
    input.AddImage("img/Barcode.png");
    var Result = Ocr.Read(input);

    foreach (var Barcode in Result.Barcodes)
    {
        Console.WriteLine(Barcode.Value);
        // Type and Position properties are also available for each barcode
    }
}
Imports IronOcr

Private Ocr = New IronTesseract()
Ocr.Configuration.ReadBarCodes = True

Using input = New OcrInput()
	input.AddImage("img/Barcode.png")
	Dim Result = Ocr.Read(input)

	For Each Barcode In Result.Barcodes
		Console.WriteLine(Barcode.Value)
		' Type and Position properties are also available for each barcode
	Next Barcode
End Using
$vbLabelText   $csharpLabel

ପରତଛବଗଡକର ନରଦଦଷଟ କଷେତର ଉପରେ OCR |

ସମସତ ଆଇରନ OCR ର ସକାନଂ ଏବଂ ପ reading ବା ପଦଧତଗଡକ କେଉ ପଷଠାର କେଉ ଅଂଶର ଆମେ ପାଠ ପ to ବାକ ଚାହଛ ତାହା ନରଦଦଷଟ କରବାର କଷମତା ଦେଇଥାଏ | ଯେତେବେଳେ ଆମେ ମାନକ ଫରମଗଡକ ଦେଖ ଏହା ଅତୟନତ ଉପଯୋଗୀ ଏବଂ ଏକ ଭୟଙକର ସମୟ ବଞଚାଇ ପାରବ ଏବଂ ଦକଷତା ବଦଧ କରପାରବ |

ଫସଲ ଅଞଚଳ ବୟବହାର କରବାକ, ଆମକ System.Drawing ଏକ ସଷଟମ ରେଫରେନସ ଯୋଡଵାକ ପଡବ ଯାହା System.Drawing.Rectangle ବସତ ବୟବହାର କରପାରବା |

using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;

using (var Input = new OcrInput())
{
    var ContentArea = new System.Drawing.Rectangle()
    {
        X = 215, Y = 1250, Height = 280, Width = 1335
        // Measurements are in pixels
    };

    Input.Add("document.png", ContentArea);

    var Result = Ocr.Read(Input);
    Console.WriteLine(Result.Text);
}
using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;

using (var Input = new OcrInput())
{
    var ContentArea = new System.Drawing.Rectangle()
    {
        X = 215, Y = 1250, Height = 280, Width = 1335
        // Measurements are in pixels
    };

    Input.Add("document.png", ContentArea);

    var Result = Ocr.Read(Input);
    Console.WriteLine(Result.Text);
}
Imports IronOcr

Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya

Using Input = New OcrInput()
	Dim ContentArea = New System.Drawing.Rectangle() With {
		.X = 215,
		.Y = 1250,
		.Height = 280,
		.Width = 1335
	}

	Input.Add("document.png", ContentArea)

	Dim Result = Ocr.Read(Input)
	Console.WriteLine(Result.Text)
End Using
$vbLabelText   $csharpLabel

ନମନ ଗଣବତତା ସକାନ ପାଇ OCR |

ଆଇରନ OCR OcrInput ଶରେଣୀ ସକାନ ଗଣବତତା ପରବଧାନକ ନରଦଦେଶ କରେ ଏବଂ ଏକ ସଂପରଣଣ ସମାଧାନ ଦେଇଥାଏ ଯାହା ସଡଲା ଏବଂ ଅସନଦର ସକାନ ବମୋକତ କରପାରେ ଏବଂ ଡଜଟାଲ ଶବଦ କମବା ଅସମପରଣଣତାକ ଠକ କରପାରେ |

using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;

using (var Input = new OcrInput(@"img\Potter.LowQuality.tiff"))
{
    Input.DeNoise();   // Corrects digital noise and poor scanning
    Input.Deskew();    // Corrects rotation and perspective
    var Result = Ocr.Read(Input);
    Console.WriteLine(Result.Text);
}
using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;

using (var Input = new OcrInput(@"img\Potter.LowQuality.tiff"))
{
    Input.DeNoise();   // Corrects digital noise and poor scanning
    Input.Deskew();    // Corrects rotation and perspective
    var Result = Ocr.Read(Input);
    Console.WriteLine(Result.Text);
}
Imports IronOcr

Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya

Using Input = New OcrInput("img\Potter.LowQuality.tiff")
	Input.DeNoise() ' Corrects digital noise and poor scanning
	Input.Deskew() ' Corrects rotation and perspective
	Dim Result = Ocr.Read(Input)
	Console.WriteLine(Result.Text)
End Using
$vbLabelText   $csharpLabel

ଏକ ସନଧାନଯୋଗୟ PDF ଭାବରେ OCR ଫଳାଫଳ ରପତାନ କରନତ |

କପ ଯୋଗୟ ପାଠୟ ଷଟରଙଗ ସହତ PDF କ ପରତଛବ | ସରଚଚ ଇଞଜନ ଏବଂ ଡାଟାବେସ ଦୱାରା ସଚୀବଦଧ ହୋଇପାରବ |

using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;

using (var Input = new OcrInput())
{
    Input.Title = "Quarterly Report";
    Input.AddImage("image1.jpeg");
    Input.AddImage("image2.png");
    Input.AddImage("image3.gif");

    var Result = Ocr.Read(Input);
    Result.SaveAsSearchablePdf("searchable.pdf");
}
using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;

using (var Input = new OcrInput())
{
    Input.Title = "Quarterly Report";
    Input.AddImage("image1.jpeg");
    Input.AddImage("image2.png");
    Input.AddImage("image3.gif");

    var Result = Ocr.Read(Input);
    Result.SaveAsSearchablePdf("searchable.pdf");
}
Imports IronOcr

Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya

Using Input = New OcrInput()
	Input.Title = "Quarterly Report"
	Input.AddImage("image1.jpeg")
	Input.AddImage("image2.png")
	Input.AddImage("image3.gif")

	Dim Result = Ocr.Read(Input)
	Result.SaveAsSearchablePdf("searchable.pdf")
End Using
$vbLabelText   $csharpLabel

ସନଧାନଯୋଗୟ PDF ରପାନତର ପାଇ TIFF |

ଏକ TIFF ଡକୟମେଣଟ (କମବା ପରତଛବ ଫାଇଲଗଡକର କ group ଣସ ଗୋଷଠୀ) କ ଏକ ସନଧାନଯୋଗୟ PDF କ ସଧାସଳଖ ରପାନତର କରନତ ଯାହା ଇଣଟରନେଟ, ୱେବସାଇଟ ଏବଂ ଗଗଲ ସରଚଚ ଇଞଜନ ଦୱାରା ସଚୀବଦଧ ହୋଇପାରବ |

using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;

using (var Input = new OcrInput())
{
    Input.AddMultiFrameTiff("example.tiff");
    var Result = Ocr.Read(Input).SaveAsSearchablePdf("searchable.pdf");
}
using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;

using (var Input = new OcrInput())
{
    Input.AddMultiFrameTiff("example.tiff");
    var Result = Ocr.Read(Input).SaveAsSearchablePdf("searchable.pdf");
}
Imports IronOcr

Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya

Using Input = New OcrInput()
	Input.AddMultiFrameTiff("example.tiff")
	Dim Result = Ocr.Read(Input).SaveAsSearchablePdf("searchable.pdf")
End Using
$vbLabelText   $csharpLabel

OCR ଫଳାଫଳକ HTML ଭାବରେ ରପତାନ କରନତ |

XHTML ରପାନତର ପାଇ OCR ପରତଛବ |

using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;

using (var Input = new OcrInput())
{
    Input.Title = "Html Title";
    Input.AddImage("image1.jpeg");
    var Result = Ocr.Read(Input);
    Result.SaveAsHocrFile("results.html");
}
using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;

using (var Input = new OcrInput())
{
    Input.Title = "Html Title";
    Input.AddImage("image1.jpeg");
    var Result = Ocr.Read(Input);
    Result.SaveAsHocrFile("results.html");
}
Imports IronOcr

Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya

Using Input = New OcrInput()
	Input.Title = "Html Title"
	Input.AddImage("image1.jpeg")
	Dim Result = Ocr.Read(Input)
	Result.SaveAsHocrFile("results.html")
End Using
$vbLabelText   $csharpLabel

OCR ପରତଛବ ଉନନତ ଫଲଟର |

OCR ଆବଶୟକତା ➗ ବାଢନକ OcrInput ବସତଗଡକ ପାଇ ଦଆଏ ସମାଧାନୀୟ ଫଲଟର ଉପଲବଧ |

ପରତଛବ ଉନନତ ସଂକେତ ଉଦାହରଣ |

ଉପରତ, ବାହାର ମାଇନାମରୀ କାର ଟୟାରୋଉ ନଦେଇଲା |

using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;

using (var Input = new OcrInput(@"LowQuality.jpeg"))
{
    Input.DeNoise();   // Corrects digital noise and poor scanning
    Input.Deskew();    // Corrects rotation and perspective
    var Result = Ocr.Read(Input);
    Console.WriteLine(Result.Text);
}
using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;

using (var Input = new OcrInput(@"LowQuality.jpeg"))
{
    Input.DeNoise();   // Corrects digital noise and poor scanning
    Input.Deskew();    // Corrects rotation and perspective
    var Result = Ocr.Read(Input);
    Console.WriteLine(Result.Text);
}
Imports IronOcr

Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya

Using Input = New OcrInput("LowQuality.jpeg")
	Input.DeNoise() ' Corrects digital noise and poor scanning
	Input.Deskew() ' Corrects rotation and perspective
	Dim Result = Ocr.Read(Input)
	Console.WriteLine(Result.Text)
End Using
$vbLabelText   $csharpLabel

OCR ପରତଛବ ଫଲଟରଗଡକର ତାଲକା |

OCR କାରଯୟଦକଷତା ବଦଧ କରବାକ ଇନପଟ ଫଲଟର ଯାହା IronOCR ରେ ନରମତ:

  • OcrInput.Rotate(double degrees) - Rotates the image a number of degrees clockwise. For counterclockwise, use a negative number.
  • OcrInput.Binarize() - This image filter turns each pixel black or white. Useful where there is low-contrast text on a background and OCR performance improves.
  • OcrInput.ToGrayScale() - Converts each pixel to grayscale. Might not increase OCR accuracy but can improve speed.
  • OcrInput.Contrast() - Automatically increases contrast. Improves OCR speed and accuracy especially on low contrast scans.
  • OcrInput.DeNoise() - Removes digital noise. Should be used wherever noise is expected.
  • OcrInput.Invert(/csharp/ocr/languages/strong> - Inverts each color (e.g., white becomes black; black becomes white/).
  • OcrInput.Dilate() - Enhanced morphology, adds pixels around the boundary of objects in the image. Opposite of Erode.
  • OcrInput.Erode() - Enhanced morphology, removes pixels on object boundaries. Opposite of Dilate.
  • OcrInput.Deskew() - Rotates image so that it is upright and orthogonally aligned. Essential for accuracy, the 5-degree tolerance of Tesseract is improved.
  • OcrInput.DeepCleanBackgroundNoise() - Removes heavy background noise. Use only where known, as it can lower OCR accuracy on clean documents and is CPU intensive.
  • OcrInput.EnhanceResolution - Increases resolution of low-quality images. Automatically handles low res inputs using MinimumDPI and TargetDPI properties.

Clean Background Noise is a setting that is time-sensitive but allows the library to automatically clean digital noise, paper shreds, and other artifacts, which other libraries cannot.

Enhance Contrast automatically increases image text contrast and improves OCR accuracy, thereby generally increasing performance and speed.

Enhance Resolution detects low-resolution images (<275 dpi) and auto-upscales them for sharp text. This operation is timely but often faster overall for OCR ops.

Language settings support multiple languages, useful for translating foreign documents.

126 ଭାଷା ପୟାକ |

IronOCR supports 126 ଆନତରଜାତୀୟ ମାଗଣା languages distributed as DLLs; downloadable from this website or NuGet.

Included languages span German, French, English, Chinese, Japanese, and many more. Specialist packages exist for MRZ, MICR, license plates, and others. Any Tesseract ".traineddata" files you have can be used, including your custom-made ones.

Language Example |

Using another OCR language:

using IronOcr;
// PM> Install IronOcr.Languages.Arabic

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Arabic;

using (var input = new OcrInput())
{
    input.AddImage("img/arabic.gif");
    // Add image noise filters if needed
    // In this case, the input might be of very poor quality
    // IronTesseract can read what conventional ones cannot

    var Result = Ocr.Read(input);

    // Saving Arabic text - cannot print Arabic easily in console on Windows
    Result.SaveAsTextFile("arabic.txt");
}
using IronOcr;
// PM> Install IronOcr.Languages.Arabic

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Arabic;

using (var input = new OcrInput())
{
    input.AddImage("img/arabic.gif");
    // Add image noise filters if needed
    // In this case, the input might be of very poor quality
    // IronTesseract can read what conventional ones cannot

    var Result = Ocr.Read(input);

    // Saving Arabic text - cannot print Arabic easily in console on Windows
    Result.SaveAsTextFile("arabic.txt");
}
Imports IronOcr
' PM> Install IronOcr.Languages.Arabic

Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Arabic

Using input = New OcrInput()
	input.AddImage("img/arabic.gif")
	' Add image noise filters if needed
	' In this case, the input might be of very poor quality
	' IronTesseract can read what conventional ones cannot

	Dim Result = Ocr.Read(input)

	' Saving Arabic text - cannot print Arabic easily in console on Windows
	Result.SaveAsTextFile("arabic.txt")
End Using
$vbLabelText   $csharpLabel

Multiple Language Example |

Using multiple languages in one go is possible, aiding in receiving metadata in multiple languages from runtime OCR:

using IronOcr;
// PM> Install IronOcr.Languages.ChineseSimplified

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.ChineseSimplified;
Ocr.AddSecondaryLanguage(OcrLanguage.Oriya);

// Any number of additional languages can be added

using (var input = new OcrInput())
{
    input.Add("multi-language.pdf");
    var Result = Ocr.Read(input);
    Result.SaveAsTextFile("results.txt");
}
using IronOcr;
// PM> Install IronOcr.Languages.ChineseSimplified

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.ChineseSimplified;
Ocr.AddSecondaryLanguage(OcrLanguage.Oriya);

// Any number of additional languages can be added

using (var input = new OcrInput())
{
    input.Add("multi-language.pdf");
    var Result = Ocr.Read(input);
    Result.SaveAsTextFile("results.txt");
}
Imports IronOcr
' PM> Install IronOcr.Languages.ChineseSimplified

Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.ChineseSimplified
Ocr.AddSecondaryLanguage(OcrLanguage.Oriya)

' Any number of additional languages can be added

Using input = New OcrInput()
	input.Add("multi-language.pdf")
	Dim Result = Ocr.Read(input)
	Result.SaveAsTextFile("results.txt")
End Using
$vbLabelText   $csharpLabel

ାସାଧାରଣ OCR ଫଳାଫଳ ଅବଜେକଟସ |

IronOCR provides robust OCR result object for each operation. Developers can access text but the result DOM is far more extensive.

using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
Ocr.Configuration.EngineMode = TesseractEngineMode.TesseractAndLstm;
Ocr.Configuration.ReadBarCodes = true; // Important!

using (var Input = new OcrInput(@"images\sample.tiff"))
{
    OcrResult Result = Ocr.Read(Input);
    var Pages = Result.Pages;
    var Words = Pages[0].Words;
    var Barcodes = Result.Barcodes;
    // Explore a large, detailed API here:
    // - Pages, Blocks, Paragraphs, Lines, Words, Characters |
    // - Image exports, Font Interactions, Statistical data |
}
using IronOcr;

var Ocr = new IronTesseract();
Ocr.Language = OcrLanguage.Oriya;
Ocr.Configuration.EngineMode = TesseractEngineMode.TesseractAndLstm;
Ocr.Configuration.ReadBarCodes = true; // Important!

using (var Input = new OcrInput(@"images\sample.tiff"))
{
    OcrResult Result = Ocr.Read(Input);
    var Pages = Result.Pages;
    var Words = Pages[0].Words;
    var Barcodes = Result.Barcodes;
    // Explore a large, detailed API here:
    // - Pages, Blocks, Paragraphs, Lines, Words, Characters |
    // - Image exports, Font Interactions, Statistical data |
}
Imports IronOcr

Private Ocr = New IronTesseract()
Ocr.Language = OcrLanguage.Oriya
Ocr.Configuration.EngineMode = TesseractEngineMode.TesseractAndLstm
Ocr.Configuration.ReadBarCodes = True ' Important!

Using Input = New OcrInput("images\sample.tiff")
	Dim Result As OcrResult = Ocr.Read(Input)
	Dim Pages = Result.Pages
	Dim Words = Pages(0).Words
	Dim Barcodes = Result.Barcodes
	' Explore a large, detailed API here:
	' - Pages, Blocks, Paragraphs, Lines, Words, Characters |
	' - Image exports, Font Interactions, Statistical data |
End Using
$vbLabelText   $csharpLabel

ପରଦରଶନ

IronOCR performs efficiently out-of-the-box without extra transformation steps.

Blazing fast: IronOcr.2020 is 10 times faster and 250% less error-prone than previous builds.

ଅଧକ ସଖନତ

For more on OCR in C#, VB, F#, or any .NET language, please read our community tutorials, or see how best to make use of its nuances in real-world example applications using this library.

A full object reference is also available for .NET developers.