pdftohtml on sourceforge may help...
On Mon, 27 Jul 2009 19:52:01 -0700 (PDT), writeson <doug.farr...@gmail.com> wrote: > Hi all, > > I've looked around with Google quite a bit, but haven't found anything > like what I'm looking for. Is there a Python library that will extract > images from PDF files? My ultimate goal is to pull the images out, use > the PIL library to reduce the size of the images and rebuild another > PDF file that's an essentially "thumbnail" version of the original PDF > file, smaller in size. > > We've been using imagick to extract the images, but it's difficult to > script and slow to process the input PDF. Can someone suggest > something better? > > Thanks in advance, > Doug -- http://mail.python.org/mailman/listinfo/python-list