跳過到頁腳內容
使用 IRONWORD

如何使用 C# 閱讀 Word 文件

In today's era, Microsoft Word documents are synonymous with office work and professional and personal communication. Hence, programmatically manipulating Word documents is critical for developers looking to automate tasks to display to users in their applications. Although there are many available libraries, not all are as robust as the others. However, one such contender that stands out from the rest is IronWord. IronWord is a reliable and robust C# Word DOCX library that is easy to use and understand, and simplifies working with Word documents.

This article will explore how we can quickly utilize IronWord to read Word documents with short examples.

How To Read a Word File Using C#

  1. Create a Console Application in Visual Studio

  2. Install IronWord C# DOCX library

  3. Create a new Word document using the WordDocument class

  4. Add text to the Word document

  5. Loop through each paragraph using the Paragraph class

  6. Display the contents

IronWord: The C# DOCX Library

IronWord: The C# DOCX Library

IronWord is a flexible and scalable library with a straightforward and easy-to-use API that eliminates the hassle of integrating Word documents into applications. Whether you want to add and integrate a simple text word document to your application or create complex tables and reports to display to users, IronWord has everything covered.

Here are its most notable features:

  1. Document Manipulation: IronWord allows developers to manipulate Word documents with ease. Whether users want to insert paragraphs of text or more complex structures such as tables and images, IronWord can do it all.

  2. Cross-Platform Support and Compatibility: IronWord is designed to be flexible and supports developers on multiple platforms. It supports various .NET Core (8, 7, 6, 5, and 3.1+), .NET Standard (2.0+), .NET Framework (4.6.2+), and even Azure. Furthermore, developers can utilize IronWord on different platforms and systems, including but not limited to Windows, Linux, macOS, and Android. It covers the most common platforms and allows developers to build cross-platform applications quickly.

  3. Independence from Microsoft Office: One common trouble when integrating Word documents in .NET applications is that popular libraries such as Interop require a licensed installation of Microsoft Office. However, IronWord alleviates that problem by being independent of that restriction. Developers can fully leverage the power of Word documents without being restricted by the license and installation of Microsoft Word.

  4. Formatting Options: IronWord offers extensive support for formatting and styling, allowing developers to make the document unique. Developers can apply fonts to their text, colors, alignments, and other complex formatting, such as table styles.

  5. Ease of Use and Extensive Support: Along with an easy-to-understand API and straightforward method calls, IronWord also provides extension API references and code examples to help developers determine the most optimal way to utilize IronWord.

Creating a New Console Project in Visual Studio

Before we dive into the example, let's start by creating a blank console project in Visual Studio.

Click console app when creating a new project

We then provide a name and a save location for the project.

Specify the name and location path for the project.

Next, select the .NET framework that you will be using. In this example, I will be using .NET 8.0.

Select the desired .NET framework and click on Create.

After creating and setting up the new console project, let's install our C# word library, IronWord.

Installing IronWord

There are two ways to install IronWord.

1. Installing via NuGet Package Manager

To install it through the NuGet Package Manager, click on tools and then Manage NuGet Packages for Solution. We then search for IronWord in the search bar and install **IronWord**.

Install IronWord using the Manage NuGet Package for Solution by searching IronWord in the search bar of NuGet Package Manager, then select the project and click on the Install button.

2. Installing via NuGet Package Manager Console

An alternative method is to install it via the NuGet Package Manager Console. To do so, run the following command in the console:

Install-Package IronWord

License Key

Please note that IronWord doesn't work without a licensing key; you can obtain one here as part of a free trial.

After obtaining a trial key, ensure this variable is set in your project.

// Replace the license key variable with the trial key you obtained
IronWord.License.LicenseKey = "REPLACE-WITH-YOUR-KEY";
// Replace the license key variable with the trial key you obtained
IronWord.License.LicenseKey = "REPLACE-WITH-YOUR-KEY";
' Replace the license key variable with the trial key you obtained
IronWord.License.LicenseKey = "REPLACE-WITH-YOUR-KEY"
$vbLabelText   $csharpLabel

Reading Word Documents in C#

After installing the prerequisites, we will demonstrate the simplicity of reading a Microsoft Word document using IronWord in the following code.

First, we import the necessary namespaces. Using IronWord, we create a new document and add sample text. We then access the paragraphs and text using the WordDocument object to print out the text within the Word document.

using IronWord;
using IronWord.Models;

#region Licensing
// Set the IronWord license key before using the IronWord functionalities
IronWord.License.LicenseKey = "YOUR-KEY-HERE";
#endregion

// Create text run with sample text
Text textRunExample = new Text("Sample text");

// Create a paragraph and add the text run to it
Paragraph paragraphExample = new Paragraph();
paragraphExample.AddChild(textRunExample);

// Create a new Word document with the paragraph
WordDocument doc = new WordDocument(paragraphExample);

// Export the document as a DOCX file
doc.SaveAs("document.docx");

// Access paragraphs and text runs within the document
foreach (Paragraph paragraph in doc.Paragraphs)
{
    foreach (Text textRun in paragraph.Texts)
    {
        // Access text content
        string content = textRun.Text;
        // Display the content to the console
        Console.WriteLine(content);
    }
}
using IronWord;
using IronWord.Models;

#region Licensing
// Set the IronWord license key before using the IronWord functionalities
IronWord.License.LicenseKey = "YOUR-KEY-HERE";
#endregion

// Create text run with sample text
Text textRunExample = new Text("Sample text");

// Create a paragraph and add the text run to it
Paragraph paragraphExample = new Paragraph();
paragraphExample.AddChild(textRunExample);

// Create a new Word document with the paragraph
WordDocument doc = new WordDocument(paragraphExample);

// Export the document as a DOCX file
doc.SaveAs("document.docx");

// Access paragraphs and text runs within the document
foreach (Paragraph paragraph in doc.Paragraphs)
{
    foreach (Text textRun in paragraph.Texts)
    {
        // Access text content
        string content = textRun.Text;
        // Display the content to the console
        Console.WriteLine(content);
    }
}
Imports IronWord
Imports IronWord.Models

#Region "Licensing"
' Set the IronWord license key before using the IronWord functionalities
IronWord.License.LicenseKey = "YOUR-KEY-HERE"
'#End Region

' Create text run with sample text
Dim textRunExample As New Text("Sample text")

' Create a paragraph and add the text run to it
Dim paragraphExample As New Paragraph()
paragraphExample.AddChild(textRunExample)

' Create a new Word document with the paragraph
Dim doc As New WordDocument(paragraphExample)

' Export the document as a DOCX file
doc.SaveAs("document.docx")

' Access paragraphs and text runs within the document
For Each paragraph As Paragraph In doc.Paragraphs
	For Each textRun As Text In paragraph.Texts
		' Access text content
		Dim content As String = textRun.Text
		' Display the content to the console
		Console.WriteLine(content)
	Next textRun
Next paragraph
$vbLabelText   $csharpLabel

Let's explore the methods and parameters to read word files from the above code.

  1. We first create a Text object and assign the string "Sample text" to it.

  2. We then instantiate a Paragraph object and add the "textRunExample" to it.

  3. We also instantiate a WordDocument object, name it WordDocument doc, and pass the paragraphExample to it to create a new Word document containing the paragraph.

  4. The code saves the Word document with the file name "document.docx" for later use.

  5. To access the paragraphs in the Word document we just created, we access the "Paragraphs" property of the WordDocument object. The "Paragraphs" property is a list. Hence, we call a foreach loop to iterate through it.

  6. To obtain the text within the paragraph, we access the "Texts" property of Paragraphs. This also returns a list of Text.

  7. Finally, we assign the Text to a string variable named "content" and print it out to the console.

Console output

Console output displaying the text read from the Word Document doc.

Reading an existing Word document in C#

In the previous example, we programmatically created a new Word document and read its content. We can follow similar steps to read an existing Word document by making a few code changes.

Input Document

Input Example: Long form Word document

using IronWord;
using IronWord.Models;

#region Licensing
// Set the IronWord license key before using the IronWord functionalities
IronWord.License.LicenseKey = "YOUR-KEY-HERE";
#endregion

// Load an existing Word document
WordDocument doc = new WordDocument("existing_document.docx");

// Access paragraphs and text runs within the document
foreach (Paragraph paragraph in doc.Paragraphs)
{
    foreach (Text textRun in paragraph.Texts)
    {
        // Access text content
        string content = textRun.Text;
        // Display the content to the console
        Console.WriteLine(content);
    }
}
using IronWord;
using IronWord.Models;

#region Licensing
// Set the IronWord license key before using the IronWord functionalities
IronWord.License.LicenseKey = "YOUR-KEY-HERE";
#endregion

// Load an existing Word document
WordDocument doc = new WordDocument("existing_document.docx");

// Access paragraphs and text runs within the document
foreach (Paragraph paragraph in doc.Paragraphs)
{
    foreach (Text textRun in paragraph.Texts)
    {
        // Access text content
        string content = textRun.Text;
        // Display the content to the console
        Console.WriteLine(content);
    }
}
Imports IronWord
Imports IronWord.Models

#Region "Licensing"
' Set the IronWord license key before using the IronWord functionalities
IronWord.License.LicenseKey = "YOUR-KEY-HERE"
'#End Region

' Load an existing Word document
Dim doc As New WordDocument("existing_document.docx")

' Access paragraphs and text runs within the document
For Each paragraph As Paragraph In doc.Paragraphs
	For Each textRun As Text In paragraph.Texts
		' Access text content
		Dim content As String = textRun.Text
		' Display the content to the console
		Console.WriteLine(content)
	Next textRun
Next paragraph
$vbLabelText   $csharpLabel

The main difference between the two examples is the parameter passed into the WordDocument object. Instead of creating a new document, we load an existing Word file into it. The rest is the same as the other example.

Console output

Console Output

Conclusion

IronWord licensing information

Throughout the examples, we demonstrated how straightforward it is to use the IronWord library to manipulate and read Word documents programmatically in C#. The flexibility and scalability of the IronWord library make it a valuable tool that allows developers to use IronWord in practical, real-life examples, such as filling templates, generating reports, and batch-processing documents. Understanding how Word integrates with applications is valuable as it provides developers with more solutions to their problems.

Furthermore, developers can test IronWord's extensive features for a period of time before purchasing, as it offers a free trial license. Along with ease of use, IronWord also comes with thorough documentation and 24/5 support for developers, alleviating the constant frustration developers might face in the face of production. We also offer various tutorials and a range of code examples for reference to help you get started with IronWord.

After testing out various aspects of IronWord using a trial-license, you can purchase our Lite developer license that starts at $599 and goes upwards from Lite to Professional. Please refer to our licensing page for more information.

常見問題解答

我怎麼能使用 C# 讀取 Word 文檔?

您可以使用 IronWord 在 C# 中讀取 Word 文檔。只需使用 WordDocument 類加載文檔,然後遍歷段落和文本運行以訪問和顯示文本內容。

什麼是可靠的 C# 庫用于讀取 Word 文檔?

IronWord 是一個可靠的 C# 庫,專為讀取和操作 Word 文檔而設計。它提供了一個簡單的 API,可以簡化應用程序中文檔功能的集成。

我需要安裝 Microsoft Office 來用 C# 讀取 Word 文檔嗎?

不,IronWord 不需要安裝 Microsoft Office。它獨立運行,使您可以在不需要 Office 授權版的情況下操作 Word 文檔。

我如何在 Visual Studio 中安裝一個 C# 庫來讀取 Word 文檔?

您可以通過在 Visual Studio 的 NuGet 包管理器中搜索 'IronWord' 並選擇 'Install',或使用 NuGet 包管理器控制台並使用命令 Install-Package IronWord 來安裝 IronWord。

IronWord 支持哪些平台用于文檔操作?

IronWord 支持多種平台,包括 .NET Core (8, 7, 6, 5, 和 3.1+),.NET Standard (2.0+),.NET Framework (4.6.2+),以及 Azure,並且與 Windows, Linux, macOS 和 Android 相兼容。

我可以使用 C# 庫操作現有的 Word 文檔嗎?

是的,IronWord 允許您以編程方式讀取和修改新建和現有的 Word 文檔,提供對文檔內容和結構的完全控制。

IronWord 有免費試用版嗎?

是的,IronWord 提供免費試用版許可。開發者可以使用此試用版來探索其功能並在購買完整許可證之前評估其適用性。

如何使用 C# 將 Word 文檔轉換為其他格式?

您可以使用 IronWord 將 Word 文檔轉換為多種格式,通過利用其 API 將文檔導出為例如 PDF, HTML 和其他格式,儘管具體的轉換可能會有所不同。

在 C# 應用程序中 IronWord 的一些常見用例是什麼?

IronWord 常用於填寫模板、生成報告、批量處理文檔和將 Word 文檔功能集成到 .NET 應用程序中的任務。

IronWord 用戶有哪些支援選項?

IronWord 提供 24/5 支援,並提供全面的文檔、教程和代碼範例,以幫助開發者有效地利用該庫。

Jordi Bardia
軟體工程師
Jordi 在 Python、C# 和 C++ 上最得心應手,當他不在 Iron Software 展現技術時,便在做遊戲編程。在分担产品测测试,产品开发和研究的责任时,Jordi 为持续的产品改进增值。他说这种多样化的经验使他受到挑战并保持参与, 而这也是他与 Iron Software 中工作一大乐趣。Jordi 在佛罗里达州迈阿密长大,曾在佛罗里达大学学习计算机科学和统计学。