Hello

 

I have used Soup, but I now prefer XMLHTMLParser, because I find it easier to 
relate the structure of the XML output to that of the original web page. It is 
also possible to use XPath to locate the target area more quickly. I 
particularly like Monty’s ‘Smalltalkish’ adaptation of XPath (see the last para 
of the class comment to XPath).

 

Hope this helps

 

Peter Kenny

 

From: Pharo-users [mailto:pharo-users-boun...@lists.pharo.org] On Behalf Of 
stepharo
Sent: 27 June 2016 10:03
To: pharo-users@lists.pharo.org
Subject: Re: [Pharo-users] If you have to do web data scraping, what tool would 
you use?

 

I scrapped all the magic cards (bad bad practice) using Soup. 

It could be easier but it worked for me. This is why I maintain it and added 
tests.

Stef

 

Le 26/6/16 à 23:14, Bernardo Ezequiel Contreras a écrit :

Hi,

Imagine that you have to do some data scraping work, what tool would you use?

I know about ZnClient, Soup, NeoCSV, NeoJSON, is there something else that i'm 
not aware of it?

 

thanks.

 

 

-- 

Bernardo E.C. 

 

Sent from a cheap desktop computer in South America.

 

Reply via email to