How to convert a table to text in a Word document
Syncfusion® Essential® DocIO is a .NET Word library used to create, read, edit, and convert Word documents programmatically without Microsoft Word or interop dependencies. Using this library, you can convert a table to text in a Word document using C#.
To achieve this, follow the steps below:
- Retrieve the table from the document and convert it to text by iterating through its rows, cells, and paragraphs.
- Split the converted text into individual lines based on line breaks, with each line representing a row as a new paragraph.
- Remove the original table from the document body.
- Insert each line of the converted text as a new paragraph at the original location of the table.
Steps to convert a table to text in a Word document:
- Create a new .NET Core console application project.
- Install the Syncfusion.DocIO.Net.Core NuGet package as a reference to your project from NuGet.org.
Starting with v16.2.0.x, if you reference Syncfusion® assemblies from trial setup or from the NuGet feed, include a license key in your projects. Refer to the link to learn about generating and registering a Syncfusion® license key in your application to use the components without trail message.
- Include the following namespaces in Program.cs file
C#
using Syncfusion.DocIO.DLS;
using Syncfusion.DocIO;
- Use the following code example to convert a table to text in a Word document.
C#
using (FileStream inputStream = new FileStream("Data/Template.docx", FileMode.Open, FileAccess.Read, FileShare.ReadWrite))
{
// Open the template Word document.
using (WordDocument document = new WordDocument(inputStream, FormatType.Docx))
{
// Retrieve the body of the first section.
WTextBody body = document.Sections[0].Body;
// Retrieve the first table from the body.
WTable table = document.Sections[0].Tables[0] as WTable;
// Convert the table to text.
string convertedText = ConvertTableToText(document, table);
// Split the converted text by line breaks to create individual paragraphs.
string[] paraText = convertedText.Split(new string[] { "\r\n" }, StringSplitOptions.None);
// Get the index of the table.
int index = body.ChildEntities.IndexOf(table);
// Remove the table from the document body.
body.ChildEntities.Remove(table);
// Insert each line of the converted text as a new paragraph at the table's original location.
for (int i = 0; i < paraText.Length; i++)
{
// Create a new paragraph.
IWParagraph paragraph = new WParagraph(document);
// Append the corresponding line of text to the paragraph.
paragraph.AppendText(paraText[i]);
// Insert the paragraph into the body at the correct index.
body.ChildEntities.Insert(index + i, paragraph);
}
// Save the modified document.
using (FileStream outputStream = new FileStream("Output/Output.docx", FileMode.Create, FileAccess.Write))
{
document.Save(outputStream, FormatType.Docx);
}
}
}
- Use the following helper method to iterate through the table’s rows, cells, and paragraphs to convert it to text format.
C#
/// <summary>
/// Converts the specified table into a text format, with cells separated by commas and rows by line breaks.
/// </summary>
static string ConvertTableToText(WordDocument document, WTable table)
{
// Use StringWriter to build the text output.
StringWriter stringWriter = new StringWriter();
// Iterate through each row of the table.
for (int row = 0; row < table.Rows.Count; row++)
{
// Iterate through each cell in the current row.
for (int column = 0; column < table.Rows[row].Cells.Count; column++)
{
// Get the current cell.
WTableCell cell = table.Rows[row].Cells[column];
// Iterate through each paragraph in the cell.
for (int para = 0; para < cell.Paragraphs.Count; para++)
{
// Get the current paragraph.
IWParagraph paragraph = cell.Paragraphs[para] as IWParagraph;
// Iterate through each item in the paragraph.
for (int item = 0; item < paragraph.Items.Count; item++)
{
// If the item is a text range, write the text to the StringWriter.
if (paragraph.Items[item] is IWTextRange textRange)
{
stringWriter.Write(textRange.Text);
}
}
// Add a new line for each paragraph within the same cell, except the last one.
if (para < cell.Paragraphs.Count - 1)
{
stringWriter.Write("\r\n");
}
}
// Add a comma delimiter between cells, except for the last cell in the row.
if (column < table.Rows[row].Cells.Count - 1)
{
stringWriter.Write(",");
}
}
// Add a new line for each row, except for the last row.
if (row < table.Rows.Count - 1)
{
stringWriter.Write("\r\n");
}
}
// Return the built string representing the table content.
return stringWriter.ToString();
}
You can download a complete working sample to convert a table to text in a Word document from the GitHub.
By executing the program, you will get the Word document as follows.
Take a moment to peruse the documentation where you can find basic Word document processing options along with the features like mail merge, merge, split, and compare Word documents, find and replace text in the Word document, protect the Word documents, and most importantly, the PDF and Image conversions with code examples.
Conclusion
I hope you enjoyed learning about how to convert a table to text in a Word document.
You can refer to our ASP.NET Core DocIO feature tour page to know about its other groundbreaking feature representations and documentation, and how to quickly get started for configuration specifications. You can also explore our ASP.NET Core DocIO example to understand how to create and manipulate data.
For current customers, you can check out our components from the License and Downloads page. If you are new to Syncfusion®, you can try our 30-day free trial to check out our other controls.
If you have any queries or require clarifications, please let us know in the comments section below. You can also contact us through our support forums, Direct-Trac, or feedback portal. We are always happy to assist you!