pdfGetWordText
Get the text of a word.
char *pdfGetWordText(PDFWordHandle word, int *length)
This function returns the text of a word (whose handle was obtained
with
pdfGetWord
).
A string is returned, and *length
is filled in with the
string length. The string will be zero-terminated, but it may contain
zero bytes, depending on the current text encoding (see
pdfSetTextEncoding
). The caller is responsible for freeing the
string with the pdfFreeMemory
function.
C:
PDFWordHandle word;
char *text;
int length;
pdfBuildWordList(pdf, 1);
for (i = 0; i < pdfGetNumWords(pdf); ++i) {
word = pdfGetWord(pdf, i);
text = pdfGetWordText(word, &length);
printf("%s\n", text);
pdfFreeMemory(text);
}
pdfBuildWordList
pdfGetNumWords
pdfGetWord
pdfGetWordLength
pdfGetWordFontName
pdfGetWordColor
pdfGetWordBox
pdfGetWordBox2
pdfGetWordCharBox
pdfGetWordCharBox2
pdfGetWordSpaceAfter
pdfGetWordFontSize
pdfGetWordFontIsFixedWidth
pdfGetWordFontIsSerif
pdfGetWordFontIsSymbolic
pdfGetWordFontIsItalic
pdfGetWordFontIsBold
pdfGetWordRotation
pdfGetWordCharPos
pdfGetWordCharLen