Hi,
I think it is not poppler issue.
In both PDFs, most (all?) fonts are embedded as PostScript Type1
without CID-keyed, so Unicode codepoints for the characters in
the embedded fonts cannot be extracted. In fact, even if you
copy & paste the text via Adobe Acrobat (or Adobe Reader),
the extract
Hi, all:
I use the pdftotext in poppler/util/ directory. When it convert the
following two urls's pdf files,
it seems convert errorly.
http://www.100ec.cn/b2bimages/dcbg.pdf
http://sjb.qlwb.com.cn/images/2011-06/16/Q02/qd0216.pdf
Can you give me some advices? Thank you very much.