How to Use a Document Scanner SDK in a .NET MAUI Application
With the rise of mobile technology, document-scanning apps such as Scanbot SDK, and Native SDKs have become indispensable tools for both individuals and businesses. In this tutorial, we'll explore how to create a document scanner app using the latest version of .NET Multi-platform App UI (MAUI) and IronOCR, a powerful OCR (Optical Character Recognition) library for .NET. .NET MAUI simplifies the creation of cross-platform mobile apps, ensuring seamless deployment on devices such as Android. By the end of this guide, you'll be able to develop your own document scanner SDK app that can extract text from images and scanned files with ease.
How to Use a Document Scanner SDK in a .NET MAUI Application
- Install the IronOCR C# Library to use the Document Scanner SDK.
- Design a .NET MAUI Form with necessary controls.
- Capture a photo using the MediaPicker.CapturePhotoAsync method.
- Convert the captured photo to a Stream.
- Pass the stream to the OcrInput LoadImage method.
- Perform OCR using the IronTesseract Read method.
- Display the document text using the OcrResult Text property.
IronOCR - The C# OCR Library
IronOCR is a cutting-edge Optical Character Recognition (OCR) software developed by Iron Software, LLC, designed to accurately and efficiently convert images and scanned documents into editable text. OCR technology has revolutionized how businesses handle document processing, making it easier to extract valuable information from various sources such as scanned documents, PDFs, and images.
IronOCR stands out among OCR solutions due to its advanced features, robust performance, and ease of integration. Whether you're a developer looking to incorporate OCR features into your applications or a business seeking to streamline document management processes, IronOCR offers a comprehensive solution.
Key Features of IronOCR
- High Accuracy: IronOCR employs state-of-the-art algorithms and machine learning techniques to achieve exceptional accuracy in text recognition. It can accurately extract text from complex documents, including images with low resolution or poor-quality scans.
- Multi-Language Support: IronOCR supports text recognition in over 125 languages, making it suitable for businesses operating in diverse linguistic environments.
- Image Preprocessing: IronOCR provides various image preprocessing capabilities, such as noise reduction, contrast adjustment, and deskewing, to enhance accuracy. These techniques improve OCR results, especially with distorted or imperfect images.
- Support for Various File Formats: IronOCR supports a wide range of file formats, including TIFF, JPEG, PNG, and PDF, ensuring compatibility with different document sources.
- Customization Options: Developers can customize IronOCR's behavior to meet specific requirements, offering flexibility in recognition parameters and workflow integration.
- Fast and Scalable: Optimized for performance, IronOCR rapidly extracts text from large volumes of documents. Its scalable architecture ensures seamless operation, regardless of document volume.
- Integration with .NET Applications: IronOCR integrates seamlessly with .NET applications, providing an easy-to-use API for incorporating OCR functionality. This simplifies development and speeds time-to-market for OCR-enabled applications.
- Document Classification and Data Extraction: Beyond basic text recognition, IronOCR offers advanced features for document classification and data extraction, identifying specific data fields like names, addresses, or invoice numbers.
Prerequisites
- Basic knowledge of C# programming.
- Visual Studio 2022 installed with the .NET MAUI workload.
- IronOCR package library installed via NuGet Package Manager.
1. Setting Up Your .NET MAUI Project
- Open Visual Studio 2022 and create a new .NET MAUI App project.
- Choose a suitable project name and configure your project settings.
- Ensure you have the necessary Android and iOS SDKs installed for target-platform device development.
2. Installing IronOCR Library
- Right-click on your Solution in Visual Studio.
- Select "Manage NuGet Packages for Solution" and in the Browse tab, search for "IronOCR".
- Install the IronOCR library to your project.
3. Designing the UI
Let's start by designing the layout of our MainPage.xaml. We'll create a simple layout with an image control to display the captured photo, a Capture button to take photos, and a Label to display the extracted text.
Here's the XAML code for MainPage.xaml:
<?xml version="1.0" encoding="utf-8" ?>
<ContentPage xmlns="http://schemas.microsoft.com/dotnet/2021/maui"
xmlns:x="http://schemas.microsoft.com/winfx/2009/xaml"
xmlns:d="http://schemas.microsoft.com/dotnet/2021/maui/design"
x:Class="DocumentScanner.MainPage">
<ScrollView>
<VerticalStackLayout Padding="30,0" Spacing="25">
<Image Source="dotnet_bot.png"
HeightRequest="185"
Aspect="AspectFit"
SemanticProperties.Description="dot net bot in a race car number eight" />
<Label Text="Welcome to .NET MAUI Document Scanner SDK"
Style="{StaticResource Headline}"
SemanticProperties.HeadingLevel="Level1" />
<Label Text="Using IronOCR"
Style="{StaticResource SubHeadline}"
SemanticProperties.HeadingLevel="Level2"
SemanticProperties.Description="Welcome to .NET MAUI Document Scanner SDK" />
<!-- Camera preview -->
<Image x:Name="cameraPreview" />
<!-- Capture button -->
<Button Text="Capture" Clicked="OnCaptureClicked" />
<!-- Text display area -->
<Label x:Name="textLabel" Text="Recognized Text:"/>
</VerticalStackLayout>
</ScrollView>
</ContentPage>
<?xml version="1.0" encoding="utf-8" ?>
<ContentPage xmlns="http://schemas.microsoft.com/dotnet/2021/maui"
xmlns:x="http://schemas.microsoft.com/winfx/2009/xaml"
xmlns:d="http://schemas.microsoft.com/dotnet/2021/maui/design"
x:Class="DocumentScanner.MainPage">
<ScrollView>
<VerticalStackLayout Padding="30,0" Spacing="25">
<Image Source="dotnet_bot.png"
HeightRequest="185"
Aspect="AspectFit"
SemanticProperties.Description="dot net bot in a race car number eight" />
<Label Text="Welcome to .NET MAUI Document Scanner SDK"
Style="{StaticResource Headline}"
SemanticProperties.HeadingLevel="Level1" />
<Label Text="Using IronOCR"
Style="{StaticResource SubHeadline}"
SemanticProperties.HeadingLevel="Level2"
SemanticProperties.Description="Welcome to .NET MAUI Document Scanner SDK" />
<!-- Camera preview -->
<Image x:Name="cameraPreview" />
<!-- Capture button -->
<Button Text="Capture" Clicked="OnCaptureClicked" />
<!-- Text display area -->
<Label x:Name="textLabel" Text="Recognized Text:"/>
</VerticalStackLayout>
</ScrollView>
</ContentPage>
In this layout:
- We use a VerticalStackLayout to stack the controls vertically.
- The Image control named cameraPreview is used to display the captured photo.
- The Button control triggers the OnCaptureClicked event handler when clicked.
- The Label control named textLabel is used to display the extracted text.
Output
4. Implementing Document Scanning Functionality
To integrate text extraction functionality into our .NET MAUI Document Scanning app, we will follow these steps:
- Utilize the Camera API: Leverage the camera API provided by .NET MAUI to capture image files directly within your application.
- Pass Image to IronOCR: Once an image is captured, pass it to IronOCR for text extraction, utilizing its robust functionality.
- Display Extracted Text: Display the extracted text in the designated area on your app's user interface for user viewing.
Here's the corresponding code snippet implementing these steps:
using IronOcr;
namespace DocumentScanner
{
public partial class MainPage : ContentPage
{
public MainPage()
{
InitializeComponent();
}
private async void OnCaptureClicked(object sender, EventArgs e)
{
License.LicenseKey = "YOUR-LICENSE-KEY-HERE";
try
{
// Request camera permissions
var status = await Permissions.RequestAsync<Permissions.Camera>();
if (status == PermissionStatus.Granted)
{
// Take photo
var photo = await MediaPicker.CapturePhotoAsync();
if (photo != null)
{
// Display captured photo in Image
cameraPreview.Source = ImageSource.FromStream(() => photo.OpenReadAsync().Result);
using (var stream = await photo.OpenReadAsync())
{
// Use a stream from the captured photo for OCR
var ocr = new IronTesseract();
using var ocrInput = new OcrInput();
ocrInput.LoadImage(stream);
var ocrResult = ocr.Read(ocrInput);
if (string.IsNullOrEmpty(ocrResult.Text))
{
await DisplayAlert("Error", "No Text Detected!", "OK");
}
else
{
await DisplayAlert("Text Detected!", ocrResult.Text, "OK");
// Display extracted text
textLabel.Text = ocrResult.Text;
}
}
}
}
else
{
// Camera permission denied
await DisplayAlert("Permission Denied", "Camera permission is required to capture photos.", "OK");
}
}
catch (Exception ex)
{
// Handle exception
await DisplayAlert("Error", ex.Message, "OK");
}
}
}
}
using IronOcr;
namespace DocumentScanner
{
public partial class MainPage : ContentPage
{
public MainPage()
{
InitializeComponent();
}
private async void OnCaptureClicked(object sender, EventArgs e)
{
License.LicenseKey = "YOUR-LICENSE-KEY-HERE";
try
{
// Request camera permissions
var status = await Permissions.RequestAsync<Permissions.Camera>();
if (status == PermissionStatus.Granted)
{
// Take photo
var photo = await MediaPicker.CapturePhotoAsync();
if (photo != null)
{
// Display captured photo in Image
cameraPreview.Source = ImageSource.FromStream(() => photo.OpenReadAsync().Result);
using (var stream = await photo.OpenReadAsync())
{
// Use a stream from the captured photo for OCR
var ocr = new IronTesseract();
using var ocrInput = new OcrInput();
ocrInput.LoadImage(stream);
var ocrResult = ocr.Read(ocrInput);
if (string.IsNullOrEmpty(ocrResult.Text))
{
await DisplayAlert("Error", "No Text Detected!", "OK");
}
else
{
await DisplayAlert("Text Detected!", ocrResult.Text, "OK");
// Display extracted text
textLabel.Text = ocrResult.Text;
}
}
}
}
else
{
// Camera permission denied
await DisplayAlert("Permission Denied", "Camera permission is required to capture photos.", "OK");
}
}
catch (Exception ex)
{
// Handle exception
await DisplayAlert("Error", ex.Message, "OK");
}
}
}
}
Imports IronOcr
Namespace DocumentScanner
Partial Public Class MainPage
Inherits ContentPage
Public Sub New()
InitializeComponent()
End Sub
Private Async Sub OnCaptureClicked(ByVal sender As Object, ByVal e As EventArgs)
License.LicenseKey = "YOUR-LICENSE-KEY-HERE"
Try
' Request camera permissions
Dim status = Await Permissions.RequestAsync(Of Permissions.Camera)()
If status = PermissionStatus.Granted Then
' Take photo
Dim photo = Await MediaPicker.CapturePhotoAsync()
If photo IsNot Nothing Then
' Display captured photo in Image
cameraPreview.Source = ImageSource.FromStream(Function() photo.OpenReadAsync().Result)
Using stream = Await photo.OpenReadAsync()
' Use a stream from the captured photo for OCR
Dim ocr = New IronTesseract()
Dim ocrInput As New OcrInput()
ocrInput.LoadImage(stream)
Dim ocrResult = ocr.Read(ocrInput)
If String.IsNullOrEmpty(ocrResult.Text) Then
Await DisplayAlert("Error", "No Text Detected!", "OK")
Else
Await DisplayAlert("Text Detected!", ocrResult.Text, "OK")
' Display extracted text
textLabel.Text = ocrResult.Text
End If
End Using
End If
Else
' Camera permission denied
Await DisplayAlert("Permission Denied", "Camera permission is required to capture photos.", "OK")
End If
Catch ex As Exception
' Handle exception
Await DisplayAlert("Error", ex.Message, "OK")
End Try
End Sub
End Class
End Namespace
Code Explanation
Let's break down the code step by step:
- In the MainPage.xaml.cs file, the OnCaptureClicked method is defined to handle the Capture button's click event.
- The IronOCR license key is set up, necessary to use the IronOCR library. Replace
"YOUR-LICENSE-KEY-HERE"
with your actual license key. - Camera permissions are requested using Permissions.RequestAsync
() to ensure that the app can access the device's camera. - MediaPicker.CapturePhotoAsync() is called to take a photo using the camera. If successful, the photo is displayed in the cameraPreview Image control.
- A stream from the captured photo is opened and used as input for IronOCR, creating an IronTesseract instance, loading the image stream into an OcrInput object, and calling the Read method to perform OCR.
- The extracted text is displayed in the textLabel control if successful. If no text is detected, an error message is shown using DisplayAlert.
For further exploration of IronOCR and additional code examples, visit this code examples page.
5. Testing the Document Scanner App
- Run the app on various platforms (Android, iOS, and Windows) to ensure cross-platform compatibility.
- Test different scenarios, such as scanning documents with various fonts, sizes, and orientations.
- Verify that the extracted text is accurate and displayed correctly on the UI.
Output - Scanned Document without Text
Output - Scanned Document with Text
Conclusion
By following this tutorial, you've learned how to use the IronOCR document scanner SDK within .NET MAUI. Document scanning apps have numerous practical applications, from digitizing paper documents to extracting stored information from receipts and invoices. Using the powerful capabilities of IronOCR and the flexibility of .NET MAUI, you can build feature-rich document scanner apps that cater to various use cases. Experiment with different functionalities, explore additional libraries, and continue honing your skills to create even more impressive apps.
For more detailed information on IronOCR capabilities, please visit this documentation page.
IronOCR provides a free trial to test its complete functionality in commercial mode. Its perpetual lite license starts from $749. Download the library from the download page and give it a try.
Frequently Asked Questions
What is the purpose of this tutorial?
The tutorial teaches how to create a document scanner app using .NET MAUI and IronOCR, a C# OCR library.
What is IronOCR?
IronOCR is an Optical Character Recognition (OCR) software developed by Iron Software, LLC, that converts images and scanned documents into editable text.
How do I install the necessary library for OCR in a .NET MAUI project?
You install IronOCR by managing NuGet Packages in Visual Studio, searching for 'IronOCR', and adding it to your project.
What are the prerequisites for creating a document scanner app?
Basic knowledge of C# programming, Visual Studio 2022 with .NET MAUI workload, and IronOCR package installed via NuGet Package Manager are required.
What is the primary function of the OCR library in this tutorial?
IronOCR is used to perform OCR on images captured within the app, extracting text from them.
Can the OCR software handle documents in multiple languages?
Yes, IronOCR supports text recognition in over 125 languages.
What are some key features of the OCR library?
Key features include high accuracy, multi-language support, image preprocessing, support for various file formats, customization options, fast performance, integration with .NET applications, and document classification.
How does .NET MAUI simplify cross-platform app development?
.NET MAUI enables developers to create cross-platform mobile apps with a single codebase, ensuring seamless deployment on Android, iOS, and Windows devices.
What steps are involved in integrating the OCR library into the document scanning app?
Steps include capturing an image using the camera API, passing the image to IronOCR for text extraction, and displaying the extracted text in the app's UI.
What kind of support does the OCR library offer for file formats?
IronOCR supports a wide range of file formats, including TIFF, JPEG, PNG, and PDF.