pdfGetWordLength
Get the Unicode length of a word.
int pdfGetWordLength(PDFWordHandle word)
This function returns the length of a word (whose handle was obtained
with
pdfGetWord
), i.e., the number of Unicode characters in the
word.
The Unicode length is useful when calling pdfGetWordCharBox
.
(The length returned by pdfGetWordLength
will generally
be different from the value obtained with pdfGetWordText
, which
converts the word to the requested output encoding.)
C:
PDFWordHandle word;
char *text;
int length;
pdfBuildWordList(pdf, 1);
for (i = 0; i < pdfGetNumWords(pdf); ++i) {
word = pdfGetWord(pdf, i);
for (j = 0; j < pdfGetWordLength(word); ++j) {
pdfGetWordCharBox(word, i, &xMin, &yMin, &xMax, &yMax);
...
}
}
pdfBuildWordList
pdfGetNumWords
pdfGetWord
pdfGetWordText
pdfGetWordFontName
pdfGetWordColor
pdfGetWordBox
pdfGetWordBox2
pdfGetWordCharBox
pdfGetWordCharBox2
pdfGetWordSpaceAfter
pdfGetWordFontSize
pdfGetWordFontIsFixedWidth
pdfGetWordFontIsSerif
pdfGetWordFontIsSymbolic
pdfGetWordFontIsItalic
pdfGetWordFontIsBold
pdfGetWordRotation
pdfGetWordCharPos
pdfGetWordCharLen