pdfSetTextEncoding

Set the encoding to use for text extraction.
void pdfSetTextEncoding(char *encodingName)
This function sets the encoding to use for text extraction.

The default encoding is "Latin1" (ISO-8859-1). The other built-in text encodings are "ASCII7" (7-bit ASCII), "UTF-8" (Unicode in UTF-8 format), and "UCS-2" (2-byte Unicode). Additional encodings can be defined via the config file.

The text encoding is a global setting: it affects text extraction from all XpdfViewer objects.

C:
pdfSetTextEncoding(viewer, "UTF-8");
pdfExtractTextFromPage