On Fri, Apr 3, 2009 at 8:20 PM, Sridhar Ratnakumar
<sridhar.ra...@gmail.com>wrote:

> On 3/26/09 3:29 PM, M Kumar wrote:
>
>> I need to read one pdf file and extract data from it. Is there any one can
>> guide me
>>
> pyPdf?
>
>  http://pybrary.net/pyPdf/


To give my $0.02, I had an opportunity to use both pyPdf and PDFMiner
for an open source project to measure accessibility of PDF documents.
I initially wrote the library using PDFMiner, but found that it had a higher
failure rate in reading documents, especially large ones when compared
to pyPdf. So, I rewrote the library using pyPdf and the experience was
better. Also I noted pyPdf works better on encrypted documents when
compared to PDFMiner.

pyPdf is not perfect and has a few issues which I faced when
reading certain encrypted documents. However, if your PDF files
are mostly non-encrypted, I would suggest pyPdf a better
choice than PDFMiner.



> <http://pybrary.net/pyPdf/>
>
> There is also reportlab toolkit
>
>  http://www.reportlab.org/rl_toolkit.html
>
>
> _______________________________________________
> BangPypers mailing list
> BangPypers@python.org
> http://mail.python.org/mailman/listinfo/bangpypers
>



-- 
-Anand
_______________________________________________
BangPypers mailing list
BangPypers@python.org
http://mail.python.org/mailman/listinfo/bangpypers

Reply via email to