Re: Inverted search / Search on profilenet

2008-01-17 Thread Endre Stølsvik
Mark Miller wrote: In any case, it shouldnt be that difficult to rig something. Is the profilenet system even that valuable? Sounds a bit hokey to me, but then im just a kid that has never used it May I ask: What IS a profilenet? I ask since this obviously is something that you two hit off o

Re: Splitting of words

2005-09-22 Thread Endre Stølsvik
| The StandardTokenizer is the most sophisticated one built into Lucene. You | can see the types of tokens it emits by looking at the javadoc here: | | | It recognizes e-mail addresses, interi

Re: Splitting of words

2005-09-27 Thread Endre Stølsvik
On Thu, 22 Sep 2005, Erik Hatcher wrote: | | On Sep 22, 2005, at 4:36 AM, Endre Stølsvik wrote: | | > | > | The StandardTokenizer is the most sophisticated one built into Lucene. | > You | > | can see the types of tokens it emits by looking at the javadoc here: | &g

Re: Single Analyzer for multiple European languages

2005-09-27 Thread Endre Stølsvik
On Mon, 26 Sep 2005, Andrzej Bialecki wrote: | Shashikant Kore wrote: | | > Search: | > - Get the superset of stopwords by merging the stopwords from all the | > languages. | | This step doesn't make sense. Stopwords ARE language specific. A stopword in | one language may be a valid content word