Articles in this section
Category / Section

How to find page number for the text in WinForms PDF library?

5 mins read

Syncfusion Essential® PDF is a .NET PDF Library used to create, read, and edit PDF documents. Using this library, you can find page number for the text in a PDF using C# and VB.NET.

Steps to find page number for a text in PDF programmatically:

  1. Create a new C# console application project. Creation of console application project
  2. Install the Syncfusion.Pdf.WinForms NuGet package as reference to your .NET Framework application from NuGet.org.
  3. NuGet package reference
  4. Include the following namespaces in the Program.cs file.

C#

using Syncfusion.Pdf.Parsing;
using System.Drawing;

 

VB.NET

Imports Syncfusion.Pdf.Parsing
Imports System.Drawing

 

  1. Use the following code snippet to find page number for the text in an existing PDF document.

C#

//Load an existing PDF document
PdfLoadedDocument loadedDocument = new PdfLoadedDocument("Sample.pdf");   
//Load page collections
PdfLoadedPageCollection loadedPages = loadedDocument.Pages;
int pageIndex = -1;
string extractedText = string.Empty
//Extract text from existing PDF document pages
for (int i = 0; i < loadedPages.Count; i++)
{
    PdfPageBase loadedPage = loadedPages[i];
    extractedText = loadedPage.ExtractText();
    //Check if the extracted text from page contains the required text
    if (extractedText.Contains("Code39Ext"))
    {
        //Set the page index value
        pageIndex = i;
        break;
    }
 
}
//Close the document
loadedDocument.Close(true);

 

VB.NET

'Load an existing PDF document   
Dim loadedDocument As PdfLoadedDocument = New PdfLoadedDocument( "Sample.pdf")
'Load page collections
Dim loadedPages As PdfLoadedPageCollection = loadedDocument.Pages
Dim pageIndex As Integer = -1
Dim extractedText As String = String.Empty
'Extract text from an existing PDF document pages
Dim i As Integer = 0
Do While (i < loadedPages.Count)
    Dim loadedPage As PdfPageBase = loadedPages(i)
    extractedText = loadedPage.ExtractText
    'Check if the extracted text from page contains the required text 
    If extractedText.Contains("Code39Ext") Then
        'Set the page index value
        pageIndex = i
        Exit Do
    End If
    i = (i + 1)
Loop
' Close the document
loadedDocument.Close(True)

 

In the previously given code sample, the “pageIndex” value will hold the page number in which the given text is located.

A complete working sample can be downloaded from PdfPageNumber.zip

Take a moment to peruse the documentation, where you will find options like basic layout and bounds based text extraction with code examples.

Note:

Starting with v16.2.0.x, if you reference Syncfusion® assemblies from trial setup or from the NuGet feed, include a license key in your projects. Refer to link to learn about generating and registering Syncfusion® license key in your application to use the components without trail message.

 

Conclusion

I hope you enjoyed learning about how to find page number for the text in WinForms PDF library.

You can refer to our WinForms PDF feature tour page to know about its other groundbreaking feature representations and documentation, and how to quickly get started for configuration specifications. You can also explore our WinForms PDF example to understand how to create and manipulate data.

For current customers, you can check out our components from the License and Downloads page. If you are new to Syncfusion®, you can try our 30-day free trial to check out our other controls.

If you have any queries or require clarifications, please let us know in the comments section below. You can also contact us through our support forumsDirect-Trac, or feedback portal. We are always happy to assist you!


Did you find this information helpful?
Yes
No
Help us improve this page
Please provide feedback or comments
Comments (0)
Please  to leave a comment
Access denied
Access denied