Re: Scraping non-html webpage in Perl

2012-05-15 Thread Paolo Gianrossi
Looks like plain text, so I don't know what you exactly mean by "scraping"... You mean download it? cheers Paolo Gianrossi (An unmatched left parenthesis creates an unresolved tension that will stay with you all day -- xkcd 2012/5/9 Formatting Solutions

Re: Scraping non-html webpage in Perl

2012-05-15 Thread Randal L. Schwartz
> "Formatting" == Formatting Solutions > writes: Formatting> I would like to get some information from a non-html webpage: Formatting> http://www.biomart.org/biomart/martservice?type=datasets&mart=CosmicMartusing Can't fetch that, so I have no idea what "non-html" is. What is the MIME

Re: Scraping non-html webpage in Perl

2012-05-09 Thread Matthew K
#!/usr/bin/perl use LWP::Simple; my $url = 'your website' my $content = get("$url"); print $content;   -- Matt > > From: Formatting Solutions >To: beginners@perl.org >Sent: Wednesday, May 9, 2012 8:11 AM >Subject: Scraping non-html webpage in Perl > >Hi, > >I wo

Re: Scraping

2006-06-02 Thread Ken Foskey
Found this the yesterday day... http://www.perl.com/cs/user/print/a/980 FEAR::API helps automate scraping. PS: I do not really understand the bit about template yet. Look interesting to rip reports from old mainframe systems, is it? -- To unsubscribe, e-mail: [EMAIL PROTECTED] For additiona

Re: Scraping

2006-06-01 Thread David Romano
Hi Ken, On 6/1/06, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote: The second option worked to print Abercrombie, Neil to the screen. Still working on basic concepts. The split construction was suggested by someone as a way to get to pulling in all listings and ultimately all votes. All votes? Yo

Re: Scraping

2006-06-01 Thread kc68
On Thu, 01 Jun 2006 20:14:36 -0400, David Romano <[EMAIL PROTECTED]> wrote: Hi kc68, On 6/1/06, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote: I'm not getting past printing to the screen and to a file the page in the script below but without the list of names in the middle. Without the if

Re: Scraping

2006-06-01 Thread David Romano
Hi kc68, On 6/1/06, [EMAIL PROTECTED] <[EMAIL PROTECTED]> wrote: I'm not getting past printing to the screen and to a file the page in the script below but without the list of names in the middle. Without the if line I get an endless scroll. I want to be able to pull in all names and then isol

Re: Scraping Data Behind a Form

2006-04-12 Thread Alan_C
On Wednesday 12 April 2006 12:20, [EMAIL PROTECTED] wrote: [ . . ] lynx -source http://www.theblackchurchpage.com/modules.php?name=Locator > tsthtmsource.htm That gets it the page's markup/source on my Slackware. There's javascript in that page but I don't know much about this. It appears you

Re: Scraping Data Behind a Form

2006-04-12 Thread kc68
On Tue, 11 Apr 2006 18:12:16 -0400, <[EMAIL PROTECTED]> wrote: I am slowly making my way through the process of scraping the data behind a form and can now get five results plus a series of links using the script below. I need help in doing the following: 1) Eliminating all material on the

Re: Scraping Data Behind a Form

2006-04-11 Thread kc68
I am slowly making my way through the process of scraping the data behind a form and can now get five results plus a series of links using the script below. I need help in doing the following: 1) Eliminating all material on the page other than the list and the links (and ultimately elimina

RE: Scraping Data Behind a Form

2006-04-11 Thread Dhanashri Bhate
Hi, The page shows 3 forms, ( 2 for Search and one for language selection ). You will first need to decide which form you want to use, and then issue submit_form function with the fields you mentioned. Something like below (untested! :) ) $browser->submit_form ( form_number => 1,

RE: Scraping Data Behind a Form

2006-04-10 Thread Charles K. Clarkson
[EMAIL PROTECTED] wrote: : I don't follow - when I add the suggested line I do get No forms : at (the url). But there is a form on the page cited in the : script. There is no form on the page returned by the given url. You can double check it by navigating to that page in a browser and viewi

Re: Scraping Data Behind a Form

2006-04-10 Thread kc68
On Mon, 10 Apr 2006 19:03:25 -0400, Charles K. Clarkson <[EMAIL PROTECTED]> wrote: [EMAIL PROTECTED] wrote: : I'm trying to scrape the data behind the form at : http://www.theblackchurch.com/modules.php?name=Locator As a true : beginner with Perl (I know some php), I'm working from training

RE: Scraping Data Behind a Form

2006-04-10 Thread Charles K. Clarkson
[EMAIL PROTECTED] wrote: : I'm trying to scrape the data behind the form at : http://www.theblackchurch.com/modules.php?name=Locator As a true : beginner with Perl (I know some php), I'm working from training : scripts that scrape from another site. There are four scripts of : increasing comple