On 10/26/18 9:25 AM, Tim via users wrote:
> Allegedly, on or about 26 October 2018, Eddie O'Connor sent:
>> "Crawling"....is that the same as "Parsing"?
> 
> Going from page to page, or site to site, parsing the contents. 
> Whether that be following links from one page to another, using the
> links on those pages, or following links from some other list.
> 
> As opposed to just parsing the contents of one particular page.

Other terms for what Bruce is referring to doing are "scraping" or
"spidering". One uses a tool such as wget or something similar to walk
down a website (or several sites), collect the data and "scrape"
interesting tidbits into a database for use in some way. In some
respects, this is what Google or Bing or Yahoo or (this'll date me)
Alta Vista does to drive their search engines (I think Alta Vista is
long gone--it was owned by DEC).

A huge part of this is the pattern recognition bit and often employs
different types of AI to extract the information one is interested in.
It's quite an involved process and very impressive if it's done right.
----------------------------------------------------------------------
- Rick Stevens, Systems Engineer, AllDigital    ri...@alldigital.com -
- AIM/Skype: therps2        ICQ: 226437340           Yahoo: origrps2 -
-                                                                    -
-  Perseverance:  When you're too damned stubborn to say "I quit!"   -
----------------------------------------------------------------------
_______________________________________________
users mailing list -- users@lists.fedoraproject.org
To unsubscribe send an email to users-le...@lists.fedoraproject.org
Fedora Code of Conduct: https://getfedora.org/code-of-conduct.html
List Guidelines: https://fedoraproject.org/wiki/Mailing_list_guidelines
List Archives: 
https://lists.fedoraproject.org/archives/list/users@lists.fedoraproject.org

Reply via email to