
The XpdfText library extracts plain text from PDF files. The PDF file can be on disk or in memory, and likewise, the text can be extracted to memory or directly to disk.
XpdfText can be used in different ways:
The extracted text can be converted to a wide choice of standard encodings:
Additionally, Glyph & Cog can help you define any other encodings you may need.
The XpdfText library also includes all of the functionality of the XpdfInfo library.
XpdfText is easy to use:
|
PDFHandle pdf; char *buf; int length; pdfLoadFile(&pdf, "MyFile.pdf"); // convert to a text file on disk... pdfConvertToTextFile(pdf, 1, 5, "MyFile.txt"); // ... or convert in memory buf = pdfConvertToTextString(pdf, 1, 5, &length); |
Supported platforms:
Buy XpdfText online at PDF Store
Contact Glyph & Cog for more information, including pricing, documentation, and evaluation copies.