Unless I'm mistaken, other locales didn't solve
the problem.
Here's how I tested them:
$ locale -a
C
C.UTF-8
en_US
en_US.iso88591
en_US.utf8
POSIX
$ for encoding in $(pdftotext -listenc | sed 1d) ; do echo Trying encoding
$encoding ; for loc in $(locale -a) ; do LC_ALL="$loc" pdftotext -enc $encoding
-layout -nopgbrk /tmp/pone.0009339.pdf - ; done ; done | egrep "Atractylodes
japonica" | less
--
To UNSUBSCRIBE, email to [email protected]
with a subject of "unsubscribe". Trouble? Contact [email protected]