Re: PDF Scraping

2024-01-16 Thread Stephen Russell
Keeping mouth shut. On Tue, Jan 16, 2024 at 10:24 AM Richard Kaye wrote: > As Stephen would say, Bad Ed! 😉 > > From: ProfoxTech On Behalf Of Ed Leafe > Sent: Monday, January 15, 2024 8:18 PM > To: profoxt...@leafe.com > Subject: Re: PDF Scraping > > On Jan 12, 2024,

RE: PDF Scraping

2024-01-16 Thread Richard Kaye
As Stephen would say, Bad Ed! 😉 From: ProfoxTech On Behalf Of Ed Leafe Sent: Monday, January 15, 2024 8:18 PM To: profoxt...@leafe.com Subject: Re: PDF Scraping On Jan 12, 2024, at 21:51, Brian Erickson <mailto:br...@dashley.net> wrote: > > It is really easy to do with python.

Re: PDF Scraping

2024-01-15 Thread Ed Leafe
On Jan 12, 2024, at 21:51, Brian Erickson wrote: > > It is really easy to do with python. Heh, I think those exact words with most posts on this list! ;-P -- Ed Leafe ___ Post Messages to: ProFox@leafe.com Subscription Maintenance: https://mail.le

RE: PDF Scraping

2024-01-15 Thread Chris Davis
Looks interesting, I will check it out ... thanks Gianni -Original Message- From: ProfoxTech On Behalf Of Gianni Turri Sent: Saturday, January 13, 2024 12:07 PM To: profoxt...@leafe.com Subject: Re: PDF Scraping Another option is the Balabolka Text Extract Utility, I have used it with

Re: PDF Scraping

2024-01-13 Thread Gianni Turri
11:27 AM To: profoxt...@leafe.com Subject: Re: PDF Scraping Chris This is not easy in general and probably not possible without going outside of VFP. You're probably looking at leveraging Ghostcript somehow to parse the PDF files and dump the text out. -- Alan Bourke alanpbour

Re: PDF Scraping

2024-01-12 Thread Brian Erickson
, January 12, 2024 11:27 AM > To: profoxt...@leafe.com > Subject: Re: PDF Scraping > > Chris > > This is not easy in general and probably not possible without going outside > of VFP. You're probably looking at leveraging Ghostcript somehow to parse the > PDF f

RE: PDF Scraping

2024-01-12 Thread Chris Davis
Forgot Ghostscript could do that, thank you Alan ... works a treat 😊 -Original Message- From: ProfoxTech On Behalf Of Alan Bourke Sent: Friday, January 12, 2024 11:27 AM To: profoxt...@leafe.com Subject: Re: PDF Scraping Chris This is not easy in general and probably not possible

Re: PDF Scraping

2024-01-12 Thread Alan Bourke
Chris This is not easy in general and probably not possible without going outside of VFP. You're probably looking at leveraging Ghostcript somehow to parse the PDF files and dump the text out. -- Alan Bourke alanpbourke (at) fastmail (dot) fm __

PDF Scraping

2024-01-12 Thread Chris Davis
Any suggestions on how best to find data (I can't find it simply by using notepad) in a PDF? I need to process a folder full of PDF's. TIA Chris --- StripMime Report -- processed MIME parts --- multipart/alternative text/plain (text body -- kept) text/html --- ___