如何读取多帧/页 GIF 和 TIFF 文件

查克尼特·宾

2023年十月24日

更新 2024年十二月10日

Translated

View the article in English

TIFF（标签图像文件格式）是一种用于高质量图像的流行格式。它支持无损压缩，适用于需要保持原始质量的图像，例如扫描文档或专业摄影。

GIF（图形交换格式）是一种主要用于简单、网络友好的图片和动画的格式。 GIF支持无损压缩和有损压缩。它以能够在单个文件中包含动画而闻名，因此在网站和消息应用程序中经常看到的短循环动画中非常受欢迎。

IronOCR具有读取单帧和多帧/多页GIF和TIFF的能力。只需使用我们的一种方法导入图像文件，该方法将完成其余操作。

开始使用IronOCR

立即在您的项目中开始使用IronOCR，并享受免费试用。

第一步：

如何读取多帧/页 GIF 和 TIFF 文件

下载一个用于读取多帧GIF和TIFF的C#库
利用 OcrImageInput 类导入单帧/多帧的 TIFF
使用 Read 方法进行 OCR
使用相同的类导入GIF图像
通过指定裁剪区域定义读取区域

读取单帧/多帧TIFF示例

要执行OCR，请首先实例化IronTesseract类。使用 'using' 语句创建 OcrImageInput 对象。此构造函数支持单帧和多帧的 TIFF 和 TIF 格式。最后，应用Read方法对导入的TIFF文件执行OCR。

:path=/static-assets/ocr/content-code-examples/how-to/input-tiff-gif-read-tiff.cs

using IronOcr;

// Instantiate IronTesseract
IronTesseract ocrTesseract = new IronTesseract();

// Import TIFF/TIF
using var imageInput = new OcrImageInput("Potter.tiff");
// Perform OCR
OcrResult ocrResult = ocrTesseract.Read(imageInput);

Imports IronOcr

' Instantiate IronTesseract
Private ocrTesseract As New IronTesseract()

' Import TIFF/TIF
Private imageInput = New OcrImageInput("Potter.tiff")
' Perform OCR
Private ocrResult As OcrResult = ocrTesseract.Read(imageInput)

$vbLabelText $csharpLabel

读取 GIF 示例

同样，在构建OcrImageInput类时只需指定GIF文件路径即可。构造函数将处理导入图像所需的所有步骤。

:path=/static-assets/ocr/content-code-examples/how-to/input-tiff-gif-read-gif.cs

using IronOcr;

// Instantiate IronTesseract
IronTesseract ocrTesseract = new IronTesseract();

// Import GIF
using var imageInput = new OcrImageInput("Potter.gif");
// Perform OCR
OcrResult ocrResult = ocrTesseract.Read(imageInput);

Imports IronOcr

' Instantiate IronTesseract
Private ocrTesseract As New IronTesseract()

' Import GIF
Private imageInput = New OcrImageInput("Potter.gif")
' Perform OCR
Private ocrResult As OcrResult = ocrTesseract.Read(imageInput)

$vbLabelText $csharpLabel

指定扫描区域

在构建 OcrImageInput 类时，您可以包含一个 CropRectangle 对象，以便为 OCR 定义图像文档中的特定区域。这可以显著提高性能，尤其是对于大型图像文件。

:path=/static-assets/ocr/content-code-examples/how-to/input-images-read-specific-region.cs

using IronOcr;
using IronSoftware.Drawing;
using System;

// Instantiate IronTesseract
IronTesseract ocrTesseract = new IronTesseract();

// Specify crop region
Rectangle scanRegion = new Rectangle(800, 200, 900, 400);

// Add image
using var imageInput = new OcrImageInput("Potter.tiff", ContentArea: scanRegion);
// Perform OCR
OcrResult ocrResult = ocrTesseract.Read(imageInput);

// Output the result to console
Console.WriteLine(ocrResult.Text);

Imports IronOcr
Imports IronSoftware.Drawing
Imports System

' Instantiate IronTesseract
Private ocrTesseract As New IronTesseract()

' Specify crop region
Private scanRegion As New Rectangle(800, 200, 900, 400)

' Add image
Private imageInput = New OcrImageInput("Potter.tiff", ContentArea:= scanRegion)
' Perform OCR
Private ocrResult As OcrResult = ocrTesseract.Read(imageInput)

' Output the result to console
Console.WriteLine(ocrResult.Text)

$vbLabelText $csharpLabel

OCR结果

查克尼特·宾

立即与工程团队聊天

软件工程师

Chaknith 是开发者中的福尔摩斯。他第一次意识到自己可能在软件工程方面有前途，是在他出于乐趣做代码挑战的时候。他的重点是 IronXL 和 IronBarcode，但他为能帮助客户解决每一款产品的问题而感到自豪。Chaknith 利用他从直接与客户交谈中获得的知识，帮助进一步改进产品。他的轶事反馈不仅仅局限于 Jira 票据，还支持产品开发、文档编写和市场营销，从而提升客户的整体体验。当他不在办公室时，他可能会在学习机器学习、编程或徒步旅行。