Public bug reported: poppler version 0.5.1, Ubuntu Dapper
Try to copy and paste from evince or pdftotext a document in Polish or German - all special characters are lost, for example: Polish word "łódź" becomes "d" in plain text. I haven't tried any other languages. Acrobat allows to copy&paste text properly. Beagle-extract-content is also able to recognize proper characters. Random pdfs to try (it doesn't work with any pdf I have): German: http://www.bmwi.de/BMWi/Redaktion/PDF/Publikationen/monitoring-informationswirtschaft-fakten-und-trendbericht-2006-management-summary-de,property=pdf,bereich=bmwi,sprache=de,rwb=true.pdf Polish: http://www.nbp.pl/aktualnosci/Wiadomosci_2006/knb_131106.pdf ** Affects: poppler (Ubuntu) Importance: Undecided Status: Unconfirmed -- special characters lost in pdftotext https://launchpad.net/bugs/72078 -- ubuntu-bugs mailing list ubuntu-bugs@lists.ubuntu.com https://lists.ubuntu.com/mailman/listinfo/ubuntu-bugs