How to Use a Document Scanner SDK in a .NET MAUI Application
With the rise of mobile technology, document-scanning apps such as Scanbot SDK, and Native SDKs have become indispensable tools for both individuals and businesses. In this tutorial, we'll explore how to create a document scanner app using the latest version of .NET Multi-platform App UI (MAUI) and IronOCR, a powerful OCR (Optical Character Recognition) library for .NET. .NET MAUI simplifies the creation of cross-platform mobile apps, ensuring seamless deployment on devices such as Android. By the end of this guide, you'll be able to develop your own document scanner SDK app that can extract text from images and scanned files with ease.
How to Use a Document Scanner SDK in a .NET MAUI Application
- Install the IronOCR C# Library to use the Document Scanner SDK.
- Design a .NET MAUI Form with necessary controls.
- Capture a photo using the MediaPicker.CapturePhotoAsync method.
- Convert the captured photo to a Stream.
- Pass the stream to the OcrInput LoadImage method.
- Perform OCR using the IronTesseract Read method.
- Display the document text using the OcrResult Text property.
IronOCR - The C# OCR Library
IronOCR is a cutting-edge Optical Character Recognition (OCR) software developed by Iron Software, LLC, designed to accurately and efficiently convert images and scanned documents into editable text. OCR technology has revolutionized how businesses handle document processing, making it easier to extract valuable information from various sources such as scanned documents, PDFs, and images.
IronOCR stands out among OCR solutions due to its advanced features, robust performance, and ease of integration. Whether you're a developer looking to incorporate OCR features into your applications or a business seeking to streamline document management processes, IronOCR offers a comprehensive solution.
Key Features of IronOCR
- High Accuracy: IronOCR employs state-of-the-art algorithms and machine learning techniques to achieve exceptional accuracy in text recognition. It can accurately extract text from complex documents, including images with low resolution or poor-quality scans.
- Multi-Language Support: IronOCR supports text recognition in over 125 languages, making it suitable for businesses operating in diverse linguistic environments.
- Image Preprocessing: IronOCR provides various image preprocessing capabilities, such as noise reduction, contrast adjustment, and deskewing, to enhance accuracy. These techniques improve OCR results, especially with distorted or imperfect images.
- Support for Various File Formats: IronOCR supports a wide range of file formats, including TIFF, JPEG, PNG, and PDF, ensuring compatibility with different document sources.
- Customization Options: Developers can customize IronOCR's behavior to meet specific requirements, offering flexibility in recognition parameters and workflow integration.
- Fast and Scalable: Optimized for performance, IronOCR rapidly extracts text from large volumes of documents. Its scalable architecture ensures seamless operation, regardless of document volume.
- Integration with .NET Applications: IronOCR integrates seamlessly with .NET applications, providing an easy-to-use API for incorporating OCR functionality. This simplifies development and speeds time-to-market for OCR-enabled applications.
- Document Classification and Data Extraction: Beyond basic text recognition, IronOCR offers advanced features for document classification and data extraction, identifying specific data fields like names, addresses, or invoice numbers.
Prerequisites
- Basic knowledge of C# programming.
- Visual Studio 2022 installed with the .NET MAUI workload.
- IronOCR package library installed via NuGet Package Manager.
1. Setting Up Your .NET MAUI Project
- Open Visual Studio 2022 and create a new .NET MAUI App project.
- Choose a suitable project name and configure your project settings.
- Ensure you have the necessary Android and iOS SDKs installed for target-platform device development.
2. Installing IronOCR Library
- Right-click on your Solution in Visual Studio.
- Select "Manage NuGet Packages for Solution" and in the Browse tab, search for "IronOCR".
- Install the IronOCR library to your project.
3. Designing the UI
Let's start by designing the layout of our MainPage.xaml. We'll create a simple layout with an image control to display the captured photo, a Capture button to take photos, and a Label to display the extracted text.
Here's the XAML code for MainPage.xaml:
<?xml version="1.0" encoding="utf-8" ?>
<ContentPage xmlns="http://schemas.microsoft.com/dotnet/2021/maui"
xmlns:x="http://schemas.microsoft.com/winfx/2009/xaml"
xmlns:d="http://schemas.microsoft.com/dotnet/2021/maui/design"
x:Class="DocumentScanner.MainPage">
<ScrollView>
<VerticalStackLayout Padding="30,0" Spacing="25">
<Image Source="dotnet_bot.png"
HeightRequest="185"
Aspect="AspectFit"
SemanticProperties.Description="dot net bot in a race car number eight" />
<Label Text="Welcome to .NET MAUI Document Scanner SDK"
Style="{StaticResource Headline}"
SemanticProperties.HeadingLevel="Level1" />
<Label Text="Using IronOCR"
Style="{StaticResource SubHeadline}"
SemanticProperties.HeadingLevel="Level2"
SemanticProperties.Description="Welcome to .NET MAUI Document Scanner SDK" />
<!-- Camera preview -->
<Image x:Name="cameraPreview" />
<!-- Capture button -->
<Button Text="Capture" Clicked="OnCaptureClicked" />
<!-- Text display area -->
<Label x:Name="textLabel" Text="Recognized Text:"/>
</VerticalStackLayout>
</ScrollView>
</ContentPage>
<?xml version="1.0" encoding="utf-8" ?>
<ContentPage xmlns="http://schemas.microsoft.com/dotnet/2021/maui"
xmlns:x="http://schemas.microsoft.com/winfx/2009/xaml"
xmlns:d="http://schemas.microsoft.com/dotnet/2021/maui/design"
x:Class="DocumentScanner.MainPage">
<ScrollView>
<VerticalStackLayout Padding="30,0" Spacing="25">
<Image Source="dotnet_bot.png"
HeightRequest="185"
Aspect="AspectFit"
SemanticProperties.Description="dot net bot in a race car number eight" />
<Label Text="Welcome to .NET MAUI Document Scanner SDK"
Style="{StaticResource Headline}"
SemanticProperties.HeadingLevel="Level1" />
<Label Text="Using IronOCR"
Style="{StaticResource SubHeadline}"
SemanticProperties.HeadingLevel="Level2"
SemanticProperties.Description="Welcome to .NET MAUI Document Scanner SDK" />
<!-- Camera preview -->
<Image x:Name="cameraPreview" />
<!-- Capture button -->
<Button Text="Capture" Clicked="OnCaptureClicked" />
<!-- Text display area -->
<Label x:Name="textLabel" Text="Recognized Text:"/>
</VerticalStackLayout>
</ScrollView>
</ContentPage>
In this layout:
- We use a VerticalStackLayout to stack the controls vertically.
- The Image control named cameraPreview is used to display the captured photo.
- The Button control triggers the OnCaptureClicked event handler when clicked.
- The Label control named textLabel is used to display the extracted text.
Output
4. Implementing Document Scanning Functionality
To integrate text extraction functionality into our .NET MAUI Document Scanning app, we will follow these steps:
- Utilize the Camera API: Leverage the camera API provided by .NET MAUI to capture image files directly within your application.
- Pass Image to IronOCR: Once an image is captured, pass it to IronOCR for text extraction, utilizing its robust functionality.
- Display Extracted Text: Display the extracted text in the designated area on your app's user interface for user viewing.
Here's the corresponding code snippet implementing these steps:
using IronOcr;
namespace DocumentScanner
{
public partial class MainPage : ContentPage
{
public MainPage()
{
InitializeComponent();
}
private async void OnCaptureClicked(object sender, EventArgs e)
{
License.LicenseKey = "YOUR-LICENSE-KEY-HERE";
try
{
// Request camera permissions
var status = await Permissions.RequestAsync<Permissions.Camera>();
if (status == PermissionStatus.Granted)
{
// Take photo
var photo = await MediaPicker.CapturePhotoAsync();
if (photo != null)
{
// Display captured photo in Image
cameraPreview.Source = ImageSource.FromStream(() => photo.OpenReadAsync().Result);
using (var stream = await photo.OpenReadAsync())
{
// Use a stream from the captured photo for OCR
var ocr = new IronTesseract();
using var ocrInput = new OcrInput();
ocrInput.LoadImage(stream);
var ocrResult = ocr.Read(ocrInput);
if (string.IsNullOrEmpty(ocrResult.Text))
{
await DisplayAlert("Error", "No Text Detected!", "OK");
}
else
{
await DisplayAlert("Text Detected!", ocrResult.Text, "OK");
// Display extracted text
textLabel.Text = ocrResult.Text;
}
}
}
}
else
{
// Camera permission denied
await DisplayAlert("Permission Denied", "Camera permission is required to capture photos.", "OK");
}
}
catch (Exception ex)
{
// Handle exception
await DisplayAlert("Error", ex.Message, "OK");
}
}
}
}
using IronOcr;
namespace DocumentScanner
{
public partial class MainPage : ContentPage
{
public MainPage()
{
InitializeComponent();
}
private async void OnCaptureClicked(object sender, EventArgs e)
{
License.LicenseKey = "YOUR-LICENSE-KEY-HERE";
try
{
// Request camera permissions
var status = await Permissions.RequestAsync<Permissions.Camera>();
if (status == PermissionStatus.Granted)
{
// Take photo
var photo = await MediaPicker.CapturePhotoAsync();
if (photo != null)
{
// Display captured photo in Image
cameraPreview.Source = ImageSource.FromStream(() => photo.OpenReadAsync().Result);
using (var stream = await photo.OpenReadAsync())
{
// Use a stream from the captured photo for OCR
var ocr = new IronTesseract();
using var ocrInput = new OcrInput();
ocrInput.LoadImage(stream);
var ocrResult = ocr.Read(ocrInput);
if (string.IsNullOrEmpty(ocrResult.Text))
{
await DisplayAlert("Error", "No Text Detected!", "OK");
}
else
{
await DisplayAlert("Text Detected!", ocrResult.Text, "OK");
// Display extracted text
textLabel.Text = ocrResult.Text;
}
}
}
}
else
{
// Camera permission denied
await DisplayAlert("Permission Denied", "Camera permission is required to capture photos.", "OK");
}
}
catch (Exception ex)
{
// Handle exception
await DisplayAlert("Error", ex.Message, "OK");
}
}
}
}
Imports IronOcr
Namespace DocumentScanner
Partial Public Class MainPage
Inherits ContentPage
Public Sub New()
InitializeComponent()
End Sub
Private Async Sub OnCaptureClicked(ByVal sender As Object, ByVal e As EventArgs)
License.LicenseKey = "YOUR-LICENSE-KEY-HERE"
Try
' Request camera permissions
Dim status = Await Permissions.RequestAsync(Of Permissions.Camera)()
If status = PermissionStatus.Granted Then
' Take photo
Dim photo = Await MediaPicker.CapturePhotoAsync()
If photo IsNot Nothing Then
' Display captured photo in Image
cameraPreview.Source = ImageSource.FromStream(Function() photo.OpenReadAsync().Result)
Using stream = Await photo.OpenReadAsync()
' Use a stream from the captured photo for OCR
Dim ocr = New IronTesseract()
Dim ocrInput As New OcrInput()
ocrInput.LoadImage(stream)
Dim ocrResult = ocr.Read(ocrInput)
If String.IsNullOrEmpty(ocrResult.Text) Then
Await DisplayAlert("Error", "No Text Detected!", "OK")
Else
Await DisplayAlert("Text Detected!", ocrResult.Text, "OK")
' Display extracted text
textLabel.Text = ocrResult.Text
End If
End Using
End If
Else
' Camera permission denied
Await DisplayAlert("Permission Denied", "Camera permission is required to capture photos.", "OK")
End If
Catch ex As Exception
' Handle exception
Await DisplayAlert("Error", ex.Message, "OK")
End Try
End Sub
End Class
End Namespace
Code Explanation
Let's break down the code step by step:
- In the MainPage.xaml.cs file, the OnCaptureClicked method is defined to handle the Capture button's click event.
- The IronOCR license key is set up, necessary to use the IronOCR library. Replace
"YOUR-LICENSE-KEY-HERE"
with your actual license key. - Camera permissions are requested using Permissions.RequestAsync
() to ensure that the app can access the device's camera. - MediaPicker.CapturePhotoAsync() is called to take a photo using the camera. If successful, the photo is displayed in the cameraPreview Image control.
- A stream from the captured photo is opened and used as input for IronOCR, creating an IronTesseract instance, loading the image stream into an OcrInput object, and calling the Read method to perform OCR.
- The extracted text is displayed in the textLabel control if successful. If no text is detected, an error message is shown using DisplayAlert.
For further exploration of IronOCR and additional code examples, visit this code examples page.
5. Testing the Document Scanner App
- Run the app on various platforms (Android, iOS, and Windows) to ensure cross-platform compatibility.
- Test different scenarios, such as scanning documents with various fonts, sizes, and orientations.
- Verify that the extracted text is accurate and displayed correctly on the UI.
Output - Scanned Document without Text
Output - Scanned Document with Text
Conclusion
By following this tutorial, you've learned how to use the IronOCR document scanner SDK within .NET MAUI. Document scanning apps have numerous practical applications, from digitizing paper documents to extracting stored information from receipts and invoices. Using the powerful capabilities of IronOCR and the flexibility of .NET MAUI, you can build feature-rich document scanner apps that cater to various use cases. Experiment with different functionalities, explore additional libraries, and continue honing your skills to create even more impressive apps.
For more detailed information on IronOCR capabilities, please visit this documentation page.
IronOCR provides a free trial to test its complete functionality in commercial mode. Its perpetual lite license starts from $749. Download the library from the download page and give it a try.
Frequently Asked Questions
How can I create a document scanner app using .NET MAUI?
You can create a document scanner app using .NET MAUI by leveraging IronOCR for Optical Character Recognition. Begin by installing IronOCR via the NuGet Package Manager in Visual Studio, then use .NET MAUI to design your app's UI, and implement the scanning functionality using IronTesseract's Read method.
What are the benefits of using IronOCR for a document scanner app?
IronOCR provides high accuracy in text recognition, multi-language support, and compatibility with various file formats. It also offers image preprocessing, fast performance, and seamless integration with .NET applications, making it a robust choice for a document scanner app.
How do I install IronOCR in a .NET MAUI project?
To install IronOCR in a .NET MAUI project, open Visual Studio and use the NuGet Package Manager to search for 'IronOCR'. Add the package to your project to start using its OCR functionalities.
What steps are involved in capturing and processing images in a document scanner app?
The process involves using the MediaPicker to capture images, converting them to a stream format, and then using IronOCR's IronTesseract to perform text extraction. The extracted text can be displayed in the app's user interface.
What file formats are supported by IronOCR for OCR processing?
IronOCR supports a wide range of file formats including TIFF, JPEG, PNG, and PDF, allowing for versatile document scanning and text extraction capabilities.
Can IronOCR support OCR in multiple languages?
Yes, IronOCR supports OCR in over 125 languages, making it suitable for applications requiring text recognition in various linguistic contexts.
How does .NET MAUI facilitate cross-platform development?
.NET MAUI enables developers to build cross-platform mobile applications with a single codebase, allowing for seamless deployment across Android, iOS, and Windows devices.
What are the prerequisites for developing a document scanner app with .NET MAUI?
The prerequisites include basic knowledge of C# programming, Visual Studio 2022 with the .NET MAUI workload, and the IronOCR library installed from NuGet.
How can I test the compatibility of my document scanner app across platforms?
You can test your document scanner app across platforms by deploying it on Android, iOS, and Windows devices to ensure functionality and accuracy in text extraction, leveraging .NET MAUI's cross-platform capabilities.