On Tuesday, June 25, 2013 9:48:44 AM UTC+5:30, jyou...@kc.rr.com wrote: > 1. Is there another way to get metadata out of a pdf without having to > install another module? > 2. Is it safe to assume pdf files should always be encoded as latin-1 (when > trying to read it this way)? Is there a chance they could be something else?
If your code is binary open in binary mode (mode="rb") rather than choosing a bogus encoding. You then have to make your strings also binary (b-prefix) Also I am surprised that it works at all. Most pdfs are compressed I thought?? > 3. Is the io module a good way to pursue this? The docs say: > The io module provides the Python interfaces to stream handling. Under Python > 2.x, this is proposed as an alternative to the built-in file object, but in > Python 3.x it is the default interface to access files and streams. So I guess no point using io for python 3?? -- http://mail.python.org/mailman/listinfo/python-list