On Mon, May 31, 2004 at 07:36:42PM +0000, Pedro M. (Morphix User) wrote: > Dave Thayer escribi?: > > >On Tue, May 11, 2004 at 02:08:58PM -0400, Antonio Rodriguez wrote: > > > > > >>On Tue, May 11, 2004 at 01:01:16PM -0400, Matt Price wrote: > >> > >> > >>>On Tue, May 11, 2004 at 11:30:11AM -0400, Ralph Katz wrote: > >>> > >>>thanks for the flues folks. pdftohtml -- which I confess I *did* > >>>already know about, sorry, should havesaid so -- won't work so well > >>>for me, i odn't think; these are scanned-in texts from the jstor > >>>journal collection, and it's important I keep the pages in order... > >>> > >>>as ,er, someone mentioned earlier (don't have the thread in front of > >>>me at the moment), a complex process involving gimp and pdftops seems > >>>to be the best bet, but it's insanely labour-intensive for long > >>>documents, so I may forego the whole project. thx all though. > >>> > >>> > >>> > >>Well, if you have scanned all the pages in about the same position, > >>and you can establish reassonably well the coordinates of the crop, > >>you can write a script that does all the work in one step (containing > >>all the inner steps). > >> > >> > >> > >The imagemagick package contains 'convert' which can take pdf as input > >perform various transformations including cropping. This might be a little > >easier than pdf2ps+gimp > > > >dt > > > > > > > You can copy text using adobe acrobat reader ( > http://wiki.debian.net/index.cgi?Acrobat ) and paste it to OpenOffice. >
The OP mentioned using that but that it didn't work for non-english scripts. > Regards. > > > -- > To UNSUBSCRIBE, email to [EMAIL PROTECTED] > with a subject of "unsubscribe". Trouble? Contact > [EMAIL PROTECTED] > > > +++++++++++++++++++++++++++++++++++++++++++ > This Mail Was Scanned By Mail-seCure System > at the Tel-Aviv University CC. > -- To UNSUBSCRIBE, email to [EMAIL PROTECTED] with a subject of "unsubscribe". Trouble? Contact [EMAIL PROTECTED]