Passer au contenu du pied de page
UTILISATION D'IRONWORD

Comment manipuler un document Word en utilisant C#

Microsoft created Word to serve as a word processor. Initially available under the name Multi-Tool Word for Xenix systems, it was introduced on October 25, 1983. Subsequent versions were developed for a wide range of operating systems, such as SCO Unix (1990), Microsoft Windows (1989), Atari ST (1988), OS/2 (1989), AT&T UNIX PC (1985), IBM PCs running DOS (1983), Apple Macintosh running the Classic macOS (1985), macOS (2001), Web browsers (2010), iOS (2014), and Android (2015). Wine can be used to run older versions of Microsoft Word on Linux.

Commercial Word versions can be licensed as a stand-alone application or as a component of Microsoft 365, which can be purchased as a perpetual license or as part of a Microsoft 365 subscription. In this article, we will manipulate Word documents using C# with the help of Microsoft Interop assemblies and explore how IronXL helps us to Edit Excel documents.

How To Manipulate Word Document Using C#

  1. Make a brand-new Visual Studio project undertaking.
  2. Set up the necessary library to read Word documents.
  3. To manipulate a Word document, load an existing file or create a new file.
  4. Edit the document data and parse the file.
  5. Get rid of all object that was created.

What is Microsoft Interop

Programs written in C# or VB.NET can create or open Word documents (DOC, DOCX, and RTF) with Office Interoperability for Microsoft Word. However, it has a lot of drawbacks when used in projects.

We will discuss frequent issues that you may run across when using Microsoft Office Interop (Word Automation) from C# or VB.NET in this article.

For example:

  • Every client PC required for word automation needs to be licensed for Microsoft Word.
  • On every client's PC, the same version of Microsoft Word must be installed.
  • Word uses a few megabytes of RAM to load different files and DLLs in the background when automation is utilized.
  • Microsoft Word API is accessed via a COM object. Issues may arise when calling a COM object from managed code, such as type conversions, requiring a COM wrapper, and poor .NET Framework integration.

Creating a New Project in Visual Studio

It is necessary to launch Visual Studio and create a .NET project before using the Interop library. Visual Studio is compatible with any version, though the most recent is advised. Depending on your needs, you can either develop a project template or an application that resembles Windows Forms. I'll be using the Console Application in this case for simplicity's sake.

How To Manipulate A Word document Using C#: Figure 1 - Creating a New Visual Studio Project

Configure Project Details

Next, provide the location and name of the project.

How To Manipulate A Word document Using C#: Figure 2 - Configuring the New VS Project

Create a New Project File using the .NET Framework

Using the Framework drop-down menu, you can choose a .NET Framework. The Dot.NET Framework 4.7 will be utilized for this project. The next action is to press the "Create" button.

After the application has generated the solution, you may input the code and build or run the program by accessing the Program.cs file.

How To Manipulate A Word document Using C#: Figure 3 - New .NET Project .cs file

Now that the Microsoft.Office.Interop.Word library has been added, we can test the code.

Install Interop Library

The next repair requires installing the Interop library. Enter the following command in the NuGet Package Manager Console to accomplish this:

Install-Package Microsoft.Office.Interop.Word

How To Manipulate A Word document Using C#: Figure 4 - For installing the IronXL library, you can use the Package Manager Console and enter the given command: Install-Package IronXL.Excel

Another way to find the package "Interop" is to use the NuGet Package Manager. Among all the NuGet packages related to Interop, we may select the required package to download from this list.

How To Manipulate A Word document Using C#: Figure 5 - Selecting `Microsoft.Office.Interop.Word` Library

Once you have installed all the necessary libraries, you can then start to edit DOCX files.

Manipulate Existing Word Documents using Interop

To use Microsoft Word, you must first create an instance of Microsoft.Office.Interop.Word.Application. The communication of Word documents would take place in this instance. The next step is to create a new Word document instance using the Documents property of the Microsoft.Office.Interop.Word.Application instance we just created. As seen in the C# code excerpt below, this allows us to manipulate Word documents programmatically:

using System;
using Microsoft.Office.Interop.Word;

class Program
{
    static void Main()
    {
        try
        {
            // Create a new instance of Word Application
            var WordApp = new Microsoft.Office.Interop.Word.Application();
            // Open an existing document
            var WordDoc = WordApp.Documents.Open(@"d:/Demo.docx");
            // Edit the content of the first paragraph
            WordDoc.Paragraphs[1].Range.Text = "New text here...";
            // Save the edited document
            WordDoc.SaveAs(@"d:/NewDemo.docx");
            // Close the document
            WordDoc.Close();
            // Quit the Word application
            WordApp.Quit();
        }
        catch (Exception ex)
        {
            Console.WriteLine(ex.ToString());
        }
    }
}
using System;
using Microsoft.Office.Interop.Word;

class Program
{
    static void Main()
    {
        try
        {
            // Create a new instance of Word Application
            var WordApp = new Microsoft.Office.Interop.Word.Application();
            // Open an existing document
            var WordDoc = WordApp.Documents.Open(@"d:/Demo.docx");
            // Edit the content of the first paragraph
            WordDoc.Paragraphs[1].Range.Text = "New text here...";
            // Save the edited document
            WordDoc.SaveAs(@"d:/NewDemo.docx");
            // Close the document
            WordDoc.Close();
            // Quit the Word application
            WordApp.Quit();
        }
        catch (Exception ex)
        {
            Console.WriteLine(ex.ToString());
        }
    }
}
Imports System
Imports Microsoft.Office.Interop.Word

Friend Class Program
	Shared Sub Main()
		Try
			' Create a new instance of Word Application
			Dim WordApp = New Microsoft.Office.Interop.Word.Application()
			' Open an existing document
			Dim WordDoc = WordApp.Documents.Open("d:/Demo.docx")
			' Edit the content of the first paragraph
			WordDoc.Paragraphs(1).Range.Text = "New text here..."
			' Save the edited document
			WordDoc.SaveAs("d:/NewDemo.docx")
			' Close the document
			WordDoc.Close()
			' Quit the Word application
			WordApp.Quit()
		Catch ex As Exception
			Console.WriteLine(ex.ToString())
		End Try
	End Sub
End Class
$vbLabelText   $csharpLabel

In the above code, we are able to edit the Word document in C#. First, we create an instance of the Word application using Interop. The Open method is then used to open an existing Word file, converting it into a document object. We can then access the various properties and methods available for interacting with the document. In the example, we update the text of the first paragraph, using the Paragraphs collection and an index to specify which paragraph to edit. Finally, the changes are saved with SaveAs, and the document and application are properly closed.

IronXL Library Alternative to Interop

IronXL is an alternative to Microsoft Interop that may be used in .NET programs to handle Excel files. While Microsoft Interop requires interacting with Excel through the Interop assemblies, IronXL offers a more straightforward, effective, and powerful method for programmatically manipulating Excel files in .NET contexts.

Utilizing IronXL instead of Microsoft Interop has several benefits, such as:

  • Performance and Resource Efficiency: Compared to Microsoft Interop, which depends on the Excel application being installed on the computer, IronXL performs better and uses fewer resources because it is not dependent on the Excel application.
  • Ease of Use and Simplicity: IronXL provides an easier-to-use API that simplifies the reading, writing, and manipulating of Excel files without the complications that come with Microsoft Interop.
  • Compatibility and Dependency: IronXL eliminates dependencies and compatibility problems that may occur with various versions of Excel or Office by not requiring the installation of Microsoft Excel on the computer.
  • Platform Independence: Unlike Microsoft Interop, which may be more closely associated with particular Microsoft Office versions, IronXL offers greater flexibility and ease of deployment across various environments and platforms.

For .NET developers who must operate with Excel files programmatically, IronXL is frequently a better option because of its ease of use, speed, and reduced reliance on third-party software installations. The decision between IronXL and Microsoft Interop, however, could be influenced by the specifics of the project, the infrastructure that already exists, and the user's level of expertise with each library.

When deciding between these options, always keep your application's requirements in mind. Check out this link to learn more about the IronXL library.

Installing IronXL Library

Since the IronXL library is needed for the upcoming patch, install it. To finish this, open the NuGet Package Manager Console and type the following command:

Install-Package IronWord

How To Manipulate A Word document Using C#: Figure 6 - Installing IronXL using the console command

Searching for the package "IronXL" via the NuGet Package Manager is an additional choice. From this list of every NuGet package linked to IronXL, we can select the one we need to download.

How To Manipulate A Word document Using C#: Figure 7 - Installing the `IronXL.Excel` Package through Browsing

Editing Excel Documents using IronXL

Data can be exported to the XLSX or XLS formats with just a few lines of code. The following example of source code shows how data can be exported from an Excel file into a simple tabular table format:

using IronXL;

class Program
{
    static void Main()
    {
        // Load an existing Excel file
        var workbook = WorkBook.Load("Demo file.xlsx");
        // Access the first sheet or the sheet by name
        var ws = workbook.GetWorkSheet("Sheet1");
        // Read a value from a cell and output it to the console
        string address_val = ws["A1"].ToString();
        Console.WriteLine(address_val);
        // Modify a cell's value
        ws["A2"].Value = "Hello World";
        // Save the workbook to different formats
        workbook.SaveAs("export.xlsx");
        workbook.SaveAs("export.xls");
        workbook.WorkSheets[0].SaveAs("export.xls");
    }
}
using IronXL;

class Program
{
    static void Main()
    {
        // Load an existing Excel file
        var workbook = WorkBook.Load("Demo file.xlsx");
        // Access the first sheet or the sheet by name
        var ws = workbook.GetWorkSheet("Sheet1");
        // Read a value from a cell and output it to the console
        string address_val = ws["A1"].ToString();
        Console.WriteLine(address_val);
        // Modify a cell's value
        ws["A2"].Value = "Hello World";
        // Save the workbook to different formats
        workbook.SaveAs("export.xlsx");
        workbook.SaveAs("export.xls");
        workbook.WorkSheets[0].SaveAs("export.xls");
    }
}
Imports IronXL

Friend Class Program
	Shared Sub Main()
		' Load an existing Excel file
		Dim workbook = WorkBook.Load("Demo file.xlsx")
		' Access the first sheet or the sheet by name
		Dim ws = workbook.GetWorkSheet("Sheet1")
		' Read a value from a cell and output it to the console
		Dim address_val As String = ws("A1").ToString()
		Console.WriteLine(address_val)
		' Modify a cell's value
		ws("A2").Value = "Hello World"
		' Save the workbook to different formats
		workbook.SaveAs("export.xlsx")
		workbook.SaveAs("export.xls")
		workbook.WorkSheets(0).SaveAs("export.xls")
	End Sub
End Class
$vbLabelText   $csharpLabel

The previous example loads an Excel file that already exists by calling the Load function, which takes an argument for the file path and name. Importing the file into the WorkBook object is now complete. The Excel worksheets are then loaded with the help of GetWorkSheet, which allows us to load the worksheet using the sheet name. The Excel address was then used to read the value. To know more about reading Excel files click here.

We can alter the Excel sheet's values by utilizing the same Excel address. The Excel document can be saved as an XLSX or XLS file by utilizing the SaveAs function that is offered by the WorkBook object. Using this process, the entire file is saved in the chosen format.

How To Manipulate A Word document Using C#: Figure 8 - Console Output

Additionally, we can choose a specific Excel worksheet by utilizing its index value or by referring to it by name. Next, we may export the data from the Excel spreadsheet to a different file by using the SaveAs option. Click this link to find out more about formatting and exporting Excel files.

Conclusion

One of the most popular add-ons for Excel is IronXL. It doesn't rely on any additional external libraries. It is not necessary to install Microsoft Excel because it is self-contained. It operates via a multitude of channels. This contrasts with the Interop library which has to parse the file using extra libraries to edit Word documents.

A complete solution for any programming process utilizing Microsoft Excel documents is IronXL. Calculations, sorting strings or numbers, pruning, adding, finding and replacing, merging and unmerging, and file storage are just a few of the many available operations. Not only can spreadsheet data be validated, but you can also construct new forms of cell data. It facilitates reading and writing files as well as handling Excel data.

When IronXL was first released, it cost $799. Alternatively, customers can opt to pay a one-year subscription fee to receive software updates and support. For a charge, IronXL provides security against unauthorized redistribution. Go to the IronXL licensing page. To know more about Iron Software products check here.

Questions Fréquemment Posées

Comment puis-je manipuler des documents Word en utilisant C#?

Pour manipuler des documents Word en utilisant C#, vous pouvez utiliser la bibliothèque Microsoft.Office.Interop.Word. Cela implique de créer une instance de l'application Word, d'ouvrir le document, de faire des modifications et de sauvegarder le document de manière programmée.

Quelles sont les limitations de l'utilisation de Microsoft Interop pour la manipulation de documents Word?

Les limitations de l'utilisation de Microsoft Interop incluent la nécessité d'une version sous licence de Microsoft Word sur chaque PC client, des problèmes potentiels de compatibilité entre versions, et une consommation de mémoire accrue en raison des processus en arrière-plan.

Comment puis-je configurer un projet C# dans Visual Studio pour travailler avec des documents Word?

Dans Visual Studio, vous pouvez configurer un nouveau projet en sélectionnant une application console, en configurant les détails nécessaires du projet, et en vous assurant que la bonne version du .NET Framework est choisie. Il vous faudra ensuite ajouter des références à Microsoft.Office.Interop.Word via le gestionnaire de packages NuGet.

Quelles sont les différences entre IronXL et Microsoft Interop pour la gestion des fichiers Excel?

IronXL offre des avantages par rapport à Microsoft Interop, tels que ne pas nécessiter l'installation d'Excel, offrir de meilleures performances, et avoir une API plus simple pour la manipulation de fichiers Excel. Il élimine également les problèmes de compatibilité liés à la méthode Interop.

Comment puis-je installer IronXL dans mon projet .NET ?

Pour installer IronXL dans votre projet .NET, ouvrez la console du gestionnaire de packages NuGet dans Visual Studio et exécutez la commande Install-Package IronXL.Excel. Vous pouvez également rechercher IronXL dans l'interface utilisateur du gestionnaire de packages NuGet et l'installer directement.

Comment puis-je éditer des documents Excel en utilisant IronXL en C#?

En utilisant IronXL, vous pouvez éditer des documents Excel en les chargeant avec WorkBook.Load, en accédant à des feuilles de calcul spécifiques, en modifiant les valeurs des cellules, et en sauvegardant le classeur en utilisant les méthodes fournies par les objets WorkBook et WorkSheet.

Quels sont les avantages d'utiliser IronXL pour la manipulation de fichiers Excel ?

IronXL fournit des avantages tels que des performances améliorées, une facilité d'utilisation, et une indépendance de la plateforme. Il ne requiert pas l'installation d'Excel, ce qui élimine les problèmes de dépendance et permet une intégration transparente dans les applications .NET.

Puis-je automatiser des tâches de document Word sans utiliser Microsoft Interop?

Oui, diverses bibliothèques tierces offrent des alternatives à Microsoft Interop pour automatiser les tâches de document Word, proposant des API plus simples et éliminant le besoin d'installation de Microsoft Word.

Jordi Bardia
Ingénieur logiciel
Jordi est le plus compétent en Python, C# et C++, et lorsqu'il ne met pas à profit ses compétences chez Iron Software, il programme des jeux. Partageant les responsabilités des tests de produit, du développement de produit et de la recherche, Jordi apporte une immense valeur à l'amé...
Lire la suite