Re: [Groff] pdfmom grep (was parallel text processing)

2017-09-09 Thread Peter Schaffter
Ralph -- On Sat, Sep 09, 2017, Ralph Corderoy wrote: > I think you're smuggling a -k or -K through to the first groff that > pdfmom runs. Here's its -Tpdf pipeline again. > > groff -Tpdf -dPDF.EXPORT=1 -mom -z $cmdstring 2>&1 | > grep '^.ds' | > groff -Tpdf -mom - $preconv $cmdstring

Re: [Groff] pdfmom grep (was parallel text processing)

2017-09-09 Thread Deri James
On Sat 09 Sep 2017 09:51:27 Peter Schaffter wrote: > On Sat, Sep 09, 2017, Ralph Corderoy wrote: > > Hi Peter, > > > > > > > > > The grep in pdfmom is returning a binary file hit when it encounters > > > the diacritic in > > > > > > .ds pdf:look(pdf:bm1) L'étranger > > > > > > > > What does lo

Re: [Groff] pdfmom grep (was parallel text processing)

2017-09-09 Thread Ralph Corderoy
Hi Peter, I think you're smuggling a -k or -K through to the first groff that pdfmom runs. Here's its -Tpdf pipeline again. groff -Tpdf -dPDF.EXPORT=1 -mom -z $cmdstring 2>&1 | grep '^.ds' | groff -Tpdf -mom - $preconv $cmdstring The problem is grep seeing invalid UTF-8 and thus dec

Re: [Groff] pdfmom grep (was parallel text processing)

2017-09-09 Thread Peter Schaffter
On Sat, Sep 09, 2017, Ralph Corderoy wrote: > Hi Peter, > > > The grep in pdfmom is returning a binary file hit when it encounters > > the diacritic in > > > > .ds pdf:look(pdf:bm1) L'étranger > > What does locale(1) output for you where you run this pdfmom command? LANG=en_CA.UTF-8 LANGU

Re: [Groff] pdfmom grep (was parallel text processing)

2017-09-09 Thread Ralph Corderoy
Hi Peter, > The grep in pdfmom is returning a binary file hit when it encounters > the diacritic in > > .ds pdf:look(pdf:bm1) L'étranger What does locale(1) output for you where you run this pdfmom command? > The solution is to pass the -a flag to grep. How about groff ... 2>&1 | LC_AL