Public bug reported:

poppler version 0.5.1, Ubuntu Dapper

Try to copy and paste from evince or pdftotext a document in Polish or
German - all special characters are lost, for example: Polish word
"łódź" becomes "d" in plain text. I haven't tried any other languages.

Acrobat allows to copy&paste text properly. Beagle-extract-content is
also able to recognize proper characters.

Random pdfs to try (it doesn't work with any pdf I have):
German: 
http://www.bmwi.de/BMWi/Redaktion/PDF/Publikationen/monitoring-informationswirtschaft-fakten-und-trendbericht-2006-management-summary-de,property=pdf,bereich=bmwi,sprache=de,rwb=true.pdf
Polish:
http://www.nbp.pl/aktualnosci/Wiadomosci_2006/knb_131106.pdf

** Affects: poppler (Ubuntu)
     Importance: Undecided
         Status: Unconfirmed

-- 
special characters lost in pdftotext
https://launchpad.net/bugs/72078

-- 
ubuntu-bugs mailing list
ubuntu-bugs@lists.ubuntu.com
https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs

Reply via email to