tubby wrote:
> Dieter Deyke wrote:
>>> sout = os.popen('pdftotext "%s" - ' %f)
>
>> Your program above should read:
>>
>>sout = os.popen('pdftotext "%s" - ' % (f,))
>
> What is the significance of doing it this way?
It's actually just nit-picking - as long as you know f is never going to
Dieter Deyke wrote:
>> sout = os.popen('pdftotext "%s" - ' %f)
> Your program above should read:
>
>sout = os.popen('pdftotext "%s" - ' % (f,))
What is the significance of doing it this way?
--
http://mail.python.org/mailman/listinfo/python-list
tubby writes:
> David Boddie wrote:
>
>> The pdftotext tool may do what you want:
>>
>> http://www.foolabs.com/xpdf/download.html
>>
>> Let us know how you get on with it.
>>
>> David
>
> Perhaps I'm just using pdftotext wrong? Here's how I was using it:
>
> f = filename
>
> try:
> sout = os
> Perhaps I'm just using pdftotext wrong? Here's how I was using it:
>
> sout = os.popen('pdftotext "%s" - ' %f)
If you are having trouble with popen (not unlikely)
how about just writing to a temporary file and
reading the text from there?
I've used pdftotext several times in the past f
David Boddie wrote:
> The pdftotext tool may do what you want:
>
> http://www.foolabs.com/xpdf/download.html
>
> Let us know how you get on with it.
>
> David
Perhaps I'm just using pdftotext wrong? Here's how I was using it:
f = filename
try:
sout = os.popen('pdftotext "%s" - ' %f)
David Boddie wrote:
> The pdftotext tool may do what you want:
>
> http://www.foolabs.com/xpdf/download.html
>
> Let us know how you get on with it.
I have used this tool. However, I need PDF read ability on Windows and
Linux and in the future Macs. pdftotext works great on Linux, but poorly
On Thursday 25 January 2007 22:05, tubby wrote:
> I know this question comes up a lot, so here goes again. I want to read
> text from a PDF file, run re searches on the text, etc. I do not care
> about layout, fonts, borders, etc. I just want the text. I've been
> reading Adobe's PDF Reference Gui
-BEGIN PGP SIGNED MESSAGE-
Hash: SHA1
have a look at the pdflib (www.pdflib.com). Their Text Extraction
Toolkit might be what you are looking for, though I'm not sure whether
you can use it detached from the pdflib itself.
hth
Nils
tubby schrieb:
> I know this question comes up a lot, s
I know this question comes up a lot, so here goes again. I want to read
text from a PDF file, run re searches on the text, etc. I do not care
about layout, fonts, borders, etc. I just want the text. I've been
reading Adobe's PDF Reference Guide and I'm beginning to develop a
better understandin
Vyz wrote:
> I am looking for a PDF to text script. I am working with multibyte
> language PDFs on Windows Xp. I need to batch convert them to text and
> feed into an encoding converter program
>
> Thanks for any help in this regard
Multibyte languages are not easy. I do text extr
In article <[EMAIL PROTECTED]>,
Vyz <[EMAIL PROTECTED]> wrote:
>I am looking for a PDF to text script. I am working with multibyte
>language PDFs on Windows Xp. I need to batch convert them to text and
>feed into an encoding converter program
>
>Thanks for any
I am looking for a PDF to text script. I am working with multibyte
language PDFs on Windows Xp. I need to batch convert them to text and
feed into an encoding converter program
Thanks for any help in this regard
--
http://mail.python.org/mailman/listinfo/python-list
12 matches
Mail list logo