Re: io module and pdf question

rusi Mon, 24 Jun 2013 23:38:44 -0700

On Tuesday, June 25, 2013 9:48:44 AM UTC+5:30, [email protected] wrote:
> 1. Is there another way to get metadata out of a pdf without having to 
> install another module?
> 2. Is it safe to assume pdf files should always be encoded as latin-1 (when 
> trying to read it this way)?  Is there a chance they could be something else?


If your code is binary open in binary mode (mode="rb") rather than choosing a 
bogus encoding. You then have to make your strings also binary (b-prefix)
Also I am surprised that it works at all.  Most pdfs are compressed I thought??

> 3. Is the io module a good way to pursue this?

The docs say:
> The io module provides the Python interfaces to stream handling. Under Python 
> 2.x, this is proposed as an alternative to the built-in file object, but in 
> Python 3.x it is the default interface to access files and streams.

So I guess no point using io for python 3??

-- 
http://mail.python.org/mailman/listinfo/python-list

Re: io module and pdf question

Reply via email to