Re: scraping PDF

2008-03-29 Thread jeffself
On Mar 29, 12:56 am, "James Bennett" <[EMAIL PROTECTED]> wrote: > On Fri, Mar 28, 2008 at 11:21 PM, <[EMAIL PROTECTED]> wrote: > > I'm trying to figure out the best way to link-up everything. Any > > suggestions? > > So, since I talked about it at PyCon, I'll take an example from this > proje

Re: scraping PDF

2008-03-28 Thread James Bennett
On Fri, Mar 28, 2008 at 11:21 PM, <[EMAIL PROTECTED]> wrote: > I'm trying to figure out the best way to link-up everything. Any > suggestions? So, since I talked about it at PyCon, I'll take an example from this project: http://www2.ljworld.com/data/crime/ku/ And I'll walk through this in a

Re: scraping PDF

2008-03-28 Thread cmccomas80
just trying to think logic through here. if i set the scrape up so the results are: SB9, Anderson, 1 SB9, Andes, 1 SB9, Brown, 2 Where SB9 stands for Senate Bill 9, the middle field is the voting last name (or last name + first initial if two people have the same last name), and the last field

Re: scraping PDF

2008-03-28 Thread cmccomas80
James, Thnx. I would prefer scraping it into a CSV as well. I had a scraper that got NCAA football scores from a site and output them in CSV to drop into a db, it was in PHP though and scraped .html files. Also, love your blog, a lot of great stuff there. Thnx again, C On Mar 28, 9:34 pm, "Ja

Re: scraping PDF

2008-03-28 Thread James Bennett
On Fri, Mar 28, 2008 at 10:15 PM, <[EMAIL PROTECTED]> wrote: > our state legislature has all their reports online in PDF format, i > was hoping to scrape 'em and get them and use them with django to > create something similar to what adrian did with the w-p and others > have done. There are