How to extract images from Word document?
Syncfusion® Essential® DocIO is a .NET Word library used to create, read, edit, and convert Word documents programmatically without Microsoft Word or interop dependencies. Using this library, you can extract images from Word document using C#.
You can find image by using FindAllItemsByProperty API, find all the images in the Word document.
Steps to extract images from Word document:
- Create a new .NET Core console application project.
- Install the Syncfusion.DocIO.Net.Core NuGet package as a reference to your project from NuGet.org.
Starting with v16.2.0.x, if you reference Syncfusion® assemblies from trial setup or from the NuGet feed, include a license key in your projects. Refer to the link to learn about generating and registering a Syncfusion® license key in your application to use the components without trail message.
- Include the following namespaces in Program.cs file
C#
using Syncfusion.DocIO.DLS;
using Syncfusion.DocIO;
- Use the following code example to extract images from Word document.
C#
// Open the file as a stream.
using (FileStream docStream = new FileStream(Path.GetFullPath(@"Data/Template.docx"), FileMode.Open, FileAccess.Read))
{
// Load the file stream into a Word document.
using (WordDocument document = new WordDocument(docStream, FormatType.Docx))
{
// Find all pictures by EntityType in the Word document.
List<Entity> pictures = document.FindAllItemsByProperty(EntityType.Picture, null, null);
// Iterate through the pictures and save each one as an image file.
for (int i = 0; i < pictures.Count; i++)
{
WPicture image = pictures[i] as WPicture;
// Use a MemoryStream to handle the image bytes from the picture.
using (MemoryStream memoryStream = new MemoryStream(image.ImageBytes))
{
// Define the path where the image will be saved.
string imagePath = Path.GetFullPath(@"Output/Image-" + i + ".jpeg");
// Create a FileStream to write the image to the specified path.
using (FileStream filestream = new FileStream(imagePath, FileMode.Create, FileAccess.Write))
{
memoryStream.CopyTo(filestream);
}
}
}
}
}
You can download a complete working sample to extract images from Word document from the GitHub.
Input template Word document as follows:
By executing the program, you will get the images in given folder as like below:
Take a moment to peruse the documentation where you can find basic Word document processing options along with the features like mail merge, merge, split, and compare Word documents, find and replace text in the Word document, protect the Word documents, and most importantly, the PDF and Image conversions with code examples.
Conclusion
I hope you enjoyed learning about how to extract images from Word document in .NET Core Word document.
You can refer to our ASP.NET Core DocIO feature tour page to know about its other groundbreaking feature representations and documentation, and how to quickly get started for configuration specifications. You can also explore our ASP.NET Core DocIO example to understand how to create and manipulate data.
For current customers, you can check out our components from the License and Downloads page. If you are new to Syncfusion®, you can try our 30-day free trial to check out our other controls.
If you have any queries or require clarifications, please let us know in the comments section below. You can also contact us through our support forums, Direct-Trac, or feedback portal. We are always happy to assist you!