I've asked around for alternatives for PDF scraping, and got a tip about
http://tabula.technology/ > (still being developed),
https://github.com/okfn/pdftables > (no longer developed) and
finally http://pdftables.com > which is the commercial successor
of the discontinued github project.
Might be
Package: wnpp
Severity: wishlist
* Package name: pdftable
Version : 1.0
Upstream Author : Kyle Cronan
* URL : http://pdftable.sourceforge.net/
* License : GPL v3
Programming Lang: Python
Description : extract tables from PDF files
Pdftable is a python
2 matches
Mail list logo