Saltar al pie de página
USO DE IRONBARCODE

Cómo Usar un SDK de Escáner de Documentos en una Aplicación .NET MAUI

With the rise of mobile technology, document-scanning apps such as Scanbot SDK, and Native SDKs have become indispensable tools for both individuals and businesses. In this tutorial, we'll explore how to create a document scanner app using the latest version of .NET Multi-platform App UI (MAUI) and IronOCR, a powerful OCR (Optical Character Recognition) library for .NET. .NET MAUI simplifies the creation of cross-platform mobile apps, ensuring seamless deployment on devices such as Android. By the end of this guide, you'll be able to develop your own document scanner SDK app that can extract text from images and scanned files with ease.

How to Use a Document Scanner SDK in a .NET MAUI Application

  1. Install the IronOCR C# Library to use the Document Scanner SDK.
  2. Design a .NET MAUI Form with necessary controls.
  3. Capture a photo using the MediaPicker.CapturePhotoAsync method.
  4. Convert the captured photo to a Stream.
  5. Pass the stream to the OcrInput LoadImage method.
  6. Perform OCR using the IronTesseract Read method.
  7. Display the document text using the OcrResult Text property.

IronOCR - The C# OCR Library

IronOCR is a cutting-edge Optical Character Recognition (OCR) software developed by Iron Software, LLC, designed to accurately and efficiently convert images and scanned documents into editable text. OCR technology has revolutionized how businesses handle document processing, making it easier to extract valuable information from various sources such as scanned documents, PDFs, and images.

IronOCR stands out among OCR solutions due to its advanced features, robust performance, and ease of integration. Whether you're a developer looking to incorporate OCR features into your applications or a business seeking to streamline document management processes, IronOCR offers a comprehensive solution.

Key Features of IronOCR

  1. High Accuracy: IronOCR employs state-of-the-art algorithms and machine learning techniques to achieve exceptional accuracy in text recognition. It can accurately extract text from complex documents, including images with low resolution or poor-quality scans.
  2. Multi-Language Support: IronOCR supports text recognition in over 125 languages, making it suitable for businesses operating in diverse linguistic environments.
  3. Image Preprocessing: IronOCR provides various image preprocessing capabilities, such as noise reduction, contrast adjustment, and deskewing, to enhance accuracy. These techniques improve OCR results, especially with distorted or imperfect images.
  4. Support for Various File Formats: IronOCR supports a wide range of file formats, including TIFF, JPEG, PNG, and PDF, ensuring compatibility with different document sources.
  5. Customization Options: Developers can customize IronOCR's behavior to meet specific requirements, offering flexibility in recognition parameters and workflow integration.
  6. Fast and Scalable: Optimized for performance, IronOCR rapidly extracts text from large volumes of documents. Its scalable architecture ensures seamless operation, regardless of document volume.
  7. Integration with .NET Applications: IronOCR integrates seamlessly with .NET applications, providing an easy-to-use API for incorporating OCR functionality. This simplifies development and speeds time-to-market for OCR-enabled applications.
  8. Document Classification and Data Extraction: Beyond basic text recognition, IronOCR offers advanced features for document classification and data extraction, identifying specific data fields like names, addresses, or invoice numbers.

Prerequisites

  • Basic knowledge of C# programming.
  • Visual Studio 2022 installed with the .NET MAUI workload.
  • IronOCR package library installed via NuGet Package Manager.

1. Setting Up Your .NET MAUI Project

  • Open Visual Studio 2022 and create a new .NET MAUI App project.

How to Use a Document Scanner SDK in a .NET MAUI Application: Figure 1 - .NET MAUI App Project

  • Choose a suitable project name and configure your project settings.

How to Use a Document Scanner SDK in a .NET MAUI Application: Figure 2 - Project Configuration

  • Ensure you have the necessary Android and iOS SDKs installed for target-platform device development.

2. Installing IronOCR Library

  • Right-click on your Solution in Visual Studio.
    • Select "Manage NuGet Packages for Solution" and in the Browse tab, search for "IronOCR".

How to Use a Document Scanner SDK in a .NET MAUI Application: Figure 3 - IronOCR NuGet Package

  • Install the IronOCR library to your project.

3. Designing the UI

Let's start by designing the layout of our MainPage.xaml. We'll create a simple layout with an image control to display the captured photo, a Capture button to take photos, and a Label to display the extracted text.

Here's the XAML code for MainPage.xaml:

<?xml version="1.0" encoding="utf-8" ?>
<ContentPage xmlns="http://schemas.microsoft.com/dotnet/2021/maui"
             xmlns:x="http://schemas.microsoft.com/winfx/2009/xaml"
             xmlns:d="http://schemas.microsoft.com/dotnet/2021/maui/design"
             x:Class="DocumentScanner.MainPage">
    <ScrollView>
        <VerticalStackLayout Padding="30,0" Spacing="25">
            <Image Source="dotnet_bot.png"
                   HeightRequest="185"
                   Aspect="AspectFit"
                   SemanticProperties.Description="dot net bot in a race car number eight" />
            <Label Text="Welcome to .NET MAUI Document Scanner SDK"
                   Style="{StaticResource Headline}"
                   SemanticProperties.HeadingLevel="Level1" />
            <Label Text="Using IronOCR"
                   Style="{StaticResource SubHeadline}"
                   SemanticProperties.HeadingLevel="Level2"
                   SemanticProperties.Description="Welcome to .NET MAUI Document Scanner SDK" />
            <!-- Camera preview -->
            <Image x:Name="cameraPreview" />
            <!-- Capture button -->
            <Button Text="Capture" Clicked="OnCaptureClicked" />
            <!-- Text display area -->
            <Label x:Name="textLabel" Text="Recognized Text:"/>
        </VerticalStackLayout>
    </ScrollView>
</ContentPage>
<?xml version="1.0" encoding="utf-8" ?>
<ContentPage xmlns="http://schemas.microsoft.com/dotnet/2021/maui"
             xmlns:x="http://schemas.microsoft.com/winfx/2009/xaml"
             xmlns:d="http://schemas.microsoft.com/dotnet/2021/maui/design"
             x:Class="DocumentScanner.MainPage">
    <ScrollView>
        <VerticalStackLayout Padding="30,0" Spacing="25">
            <Image Source="dotnet_bot.png"
                   HeightRequest="185"
                   Aspect="AspectFit"
                   SemanticProperties.Description="dot net bot in a race car number eight" />
            <Label Text="Welcome to .NET MAUI Document Scanner SDK"
                   Style="{StaticResource Headline}"
                   SemanticProperties.HeadingLevel="Level1" />
            <Label Text="Using IronOCR"
                   Style="{StaticResource SubHeadline}"
                   SemanticProperties.HeadingLevel="Level2"
                   SemanticProperties.Description="Welcome to .NET MAUI Document Scanner SDK" />
            <!-- Camera preview -->
            <Image x:Name="cameraPreview" />
            <!-- Capture button -->
            <Button Text="Capture" Clicked="OnCaptureClicked" />
            <!-- Text display area -->
            <Label x:Name="textLabel" Text="Recognized Text:"/>
        </VerticalStackLayout>
    </ScrollView>
</ContentPage>
XML

In this layout:

  • We use a VerticalStackLayout to stack the controls vertically.
  • The Image control named cameraPreview is used to display the captured photo.
  • The Button control triggers the OnCaptureClicked event handler when clicked.
  • The Label control named textLabel is used to display the extracted text.

Output

How to Use a Document Scanner SDK in a .NET MAUI Application: Figure 4 - MainPage.xaml Output

4. Implementing Document Scanning Functionality

To integrate text extraction functionality into our .NET MAUI Document Scanning app, we will follow these steps:

  1. Utilize the Camera API: Leverage the camera API provided by .NET MAUI to capture image files directly within your application.
  2. Pass Image to IronOCR: Once an image is captured, pass it to IronOCR for text extraction, utilizing its robust functionality.
  3. Display Extracted Text: Display the extracted text in the designated area on your app's user interface for user viewing.

Here's the corresponding code snippet implementing these steps:

using IronOcr;

namespace DocumentScanner
{
    public partial class MainPage : ContentPage
    {
        public MainPage()
        {
            InitializeComponent();
        }

        private async void OnCaptureClicked(object sender, EventArgs e)
        {
            License.LicenseKey = "YOUR-LICENSE-KEY-HERE";
            try
            {
                // Request camera permissions
                var status = await Permissions.RequestAsync<Permissions.Camera>();
                if (status == PermissionStatus.Granted)
                {
                    // Take photo
                    var photo = await MediaPicker.CapturePhotoAsync();
                    if (photo != null)
                    {
                        // Display captured photo in Image
                        cameraPreview.Source = ImageSource.FromStream(() => photo.OpenReadAsync().Result);
                        using (var stream = await photo.OpenReadAsync())
                        {
                            // Use a stream from the captured photo for OCR
                            var ocr = new IronTesseract();
                            using var ocrInput = new OcrInput();
                            ocrInput.LoadImage(stream);
                            var ocrResult = ocr.Read(ocrInput);
                            if (string.IsNullOrEmpty(ocrResult.Text))
                            {
                                await DisplayAlert("Error", "No Text Detected!", "OK");
                            }
                            else
                            {
                                await DisplayAlert("Text Detected!", ocrResult.Text, "OK");
                                // Display extracted text
                                textLabel.Text = ocrResult.Text;
                            }
                        }
                    }
                }
                else
                {
                    // Camera permission denied
                    await DisplayAlert("Permission Denied", "Camera permission is required to capture photos.", "OK");
                }
            }
            catch (Exception ex)
            {
                // Handle exception
                await DisplayAlert("Error", ex.Message, "OK");
            }
        }
    }
}
using IronOcr;

namespace DocumentScanner
{
    public partial class MainPage : ContentPage
    {
        public MainPage()
        {
            InitializeComponent();
        }

        private async void OnCaptureClicked(object sender, EventArgs e)
        {
            License.LicenseKey = "YOUR-LICENSE-KEY-HERE";
            try
            {
                // Request camera permissions
                var status = await Permissions.RequestAsync<Permissions.Camera>();
                if (status == PermissionStatus.Granted)
                {
                    // Take photo
                    var photo = await MediaPicker.CapturePhotoAsync();
                    if (photo != null)
                    {
                        // Display captured photo in Image
                        cameraPreview.Source = ImageSource.FromStream(() => photo.OpenReadAsync().Result);
                        using (var stream = await photo.OpenReadAsync())
                        {
                            // Use a stream from the captured photo for OCR
                            var ocr = new IronTesseract();
                            using var ocrInput = new OcrInput();
                            ocrInput.LoadImage(stream);
                            var ocrResult = ocr.Read(ocrInput);
                            if (string.IsNullOrEmpty(ocrResult.Text))
                            {
                                await DisplayAlert("Error", "No Text Detected!", "OK");
                            }
                            else
                            {
                                await DisplayAlert("Text Detected!", ocrResult.Text, "OK");
                                // Display extracted text
                                textLabel.Text = ocrResult.Text;
                            }
                        }
                    }
                }
                else
                {
                    // Camera permission denied
                    await DisplayAlert("Permission Denied", "Camera permission is required to capture photos.", "OK");
                }
            }
            catch (Exception ex)
            {
                // Handle exception
                await DisplayAlert("Error", ex.Message, "OK");
            }
        }
    }
}
Imports IronOcr

Namespace DocumentScanner
	Partial Public Class MainPage
		Inherits ContentPage

		Public Sub New()
			InitializeComponent()
		End Sub

		Private Async Sub OnCaptureClicked(ByVal sender As Object, ByVal e As EventArgs)
			License.LicenseKey = "YOUR-LICENSE-KEY-HERE"
			Try
				' Request camera permissions
				Dim status = Await Permissions.RequestAsync(Of Permissions.Camera)()
				If status = PermissionStatus.Granted Then
					' Take photo
					Dim photo = Await MediaPicker.CapturePhotoAsync()
					If photo IsNot Nothing Then
						' Display captured photo in Image
						cameraPreview.Source = ImageSource.FromStream(Function() photo.OpenReadAsync().Result)
						Using stream = Await photo.OpenReadAsync()
							' Use a stream from the captured photo for OCR
							Dim ocr = New IronTesseract()
							Dim ocrInput As New OcrInput()
							ocrInput.LoadImage(stream)
							Dim ocrResult = ocr.Read(ocrInput)
							If String.IsNullOrEmpty(ocrResult.Text) Then
								Await DisplayAlert("Error", "No Text Detected!", "OK")
							Else
								Await DisplayAlert("Text Detected!", ocrResult.Text, "OK")
								' Display extracted text
								textLabel.Text = ocrResult.Text
							End If
						End Using
					End If
				Else
					' Camera permission denied
					Await DisplayAlert("Permission Denied", "Camera permission is required to capture photos.", "OK")
				End If
			Catch ex As Exception
				' Handle exception
				Await DisplayAlert("Error", ex.Message, "OK")
			End Try
		End Sub
	End Class
End Namespace
$vbLabelText   $csharpLabel

Code Explanation

Let's break down the code step by step:

  • In the MainPage.xaml.cs file, the OnCaptureClicked method is defined to handle the Capture button's click event.
  • The IronOCR license key is set up, necessary to use the IronOCR library. Replace "YOUR-LICENSE-KEY-HERE" with your actual license key.
  • Camera permissions are requested using Permissions.RequestAsync() to ensure that the app can access the device's camera.
  • MediaPicker.CapturePhotoAsync() is called to take a photo using the camera. If successful, the photo is displayed in the cameraPreview Image control.
  • A stream from the captured photo is opened and used as input for IronOCR, creating an IronTesseract instance, loading the image stream into an OcrInput object, and calling the Read method to perform OCR.
  • The extracted text is displayed in the textLabel control if successful. If no text is detected, an error message is shown using DisplayAlert.

For further exploration of IronOCR and additional code examples, visit this code examples page.

5. Testing the Document Scanner App

  • Run the app on various platforms (Android, iOS, and Windows) to ensure cross-platform compatibility.
  • Test different scenarios, such as scanning documents with various fonts, sizes, and orientations.
  • Verify that the extracted text is accurate and displayed correctly on the UI.

Output - Scanned Document without Text

How to Use a Document Scanner SDK in a .NET MAUI Application: Figure 5 - Scanned PDF Creation Output

Output - Scanned Document with Text

How to Use a Document Scanner SDK in a .NET MAUI Application: Figure 6 - Scanned Documentation

Conclusion

By following this tutorial, you've learned how to use the IronOCR document scanner SDK within .NET MAUI. Document scanning apps have numerous practical applications, from digitizing paper documents to extracting stored information from receipts and invoices. Using the powerful capabilities of IronOCR and the flexibility of .NET MAUI, you can build feature-rich document scanner apps that cater to various use cases. Experiment with different functionalities, explore additional libraries, and continue honing your skills to create even more impressive apps.

For more detailed information on IronOCR capabilities, please visit this documentation page.

IronOCR provides a free trial to test its complete functionality in commercial mode. Its perpetual lite license starts from $799. Download the library from the download page and give it a try.

Preguntas Frecuentes

¿Cómo puedo crear una aplicación de escáner de documentos usando .NET MAUI?

Puedes crear una aplicación de escáner de documentos usando .NET MAUI aprovechando IronOCR para el Reconocimiento Óptico de Caracteres. Comienza instalando IronOCR a través del Gestor de Paquetes NuGet en Visual Studio, luego usa .NET MAUI para diseñar la interfaz de usuario de tu aplicación e implementa la funcionalidad de escaneo usando el método Read de IronTesseract.

¿Cuáles son los beneficios de usar IronOCR para una aplicación de escáner de documentos?

IronOCR ofrece alta precisión en el reconocimiento de texto, soporte multilingüe y compatibilidad con varios formatos de archivos. También ofrece preprocesamiento de imágenes, rendimiento rápido e integración perfecta con aplicaciones .NET, lo que lo convierte en una opción robusta para una aplicación de escáner de documentos.

¿Cómo instalo IronOCR en un proyecto .NET MAUI?

Para instalar IronOCR en un proyecto .NET MAUI, abre Visual Studio y usa el Gestor de Paquetes NuGet para buscar 'IronOCR'. Añade el paquete a tu proyecto para comenzar a usar sus funcionalidades de OCR.

¿Qué pasos están involucrados en la captura y procesamiento de imágenes en una aplicación de escáner de documentos?

El proceso involucra usar el MediaPicker para capturar imágenes, convertirlas a un formato de flujo y luego usar el IronTesseract de IronOCR para realizar la extracción de texto. El texto extraído puede mostrarse en la interfaz de usuario de la aplicación.

¿Qué formatos de archivo son compatibles con IronOCR para el procesamiento de OCR?

IronOCR es compatible con una amplia gama de formatos de archivo, incluidos TIFF, JPEG, PNG y PDF, lo que permite capacidades versátiles de escaneo de documentos y extracción de texto.

¿Puede IronOCR soportar OCR en múltiples idiomas?

Sí, IronOCR soporta OCR en más de 125 idiomas, lo que lo hace adecuado para aplicaciones que requieren reconocimiento de texto en diversos contextos lingüísticos.

¿Cómo facilita .NET MAUI el desarrollo multiplataforma?

.NET MAUI permite a los desarrolladores construir aplicaciones móviles multiplataforma con una única base de código, permitiendo un despliegue sin problemas en dispositivos Android, iOS y Windows.

¿Cuáles son los requisitos previos para desarrollar una aplicación de escáner de documentos con .NET MAUI?

Los requisitos previos incluyen conocimiento básico de programación en C#, Visual Studio 2022 con la carga de trabajo .NET MAUI y la biblioteca IronOCR instalada desde NuGet.

¿Cómo puedo probar la compatibilidad de mi aplicación de escáner de documentos en todas las plataformas?

Puedes probar tu aplicación de escáner de documentos en todas las plataformas al desplegarla en dispositivos Android, iOS y Windows para asegurar la funcionalidad y precisión en la extracción de texto, aprovechando las capacidades multiplataforma de .NET MAUI.

Jordi Bardia
Ingeniero de Software
Jordi es más competente en Python, C# y C++. Cuando no está aprovechando sus habilidades en Iron Software, está programando juegos. Compartiendo responsabilidades para pruebas de productos, desarrollo de productos e investigación, Jordi agrega un valor inmenso a la mejora continua del producto. La experiencia variada lo mantiene ...
Leer más