Re: [OT] About stopwords

2008-11-27 Thread David Causse
Thanks for the tip, but I can't imagine the number of documents google has to join in order process such results... There must be a trick. Maybe stopwords are not indexed alone but twice with previous and next token, some sort of 2-gram index? David. Aleksander M. Stensby a écrit : Your que

Re: [OT] About stopwords

2008-11-27 Thread Michael McCandless
That's a phrase search, so it's conceivable google could be doing something similar to nutch, whereby adjacent ngrams are indexed as unique terms. But if you do the same search without quotes: http://www.google.fr/search?hl=fr&q=HOW+at+at+of+a+A+a&btnG=Rechercher&meta= they still find

Re: [OT] About stopwords

2008-11-27 Thread Aleksander M. Stensby
Your query includeds apostrophes which tells google to include common words in the query. But, if you remove the apostrophes, you will still get results, as google states: "Google ignores stop words when they're placed in searches alongside less common words. For example, a search for [ The

[OT] About stopwords

2008-11-27 Thread David Causse
Hi, Look at this google query : http://www.google.fr/search?q=%22HOW+at+at+of+a+A+a%22 What do you think about that concerning stop words? Google has no stop words? David. - To unsubscribe, e-mail: [EMAIL PROTECTED] For addi