Re: Inverted search / Search on profilenet

2008-01-19 Thread Otis Gospodnetic
From: Mark Miller <[EMAIL PROTECTED]> To: java-user@lucene.apache.org Sent: Thursday, January 17, 2008 7:57:38 AM Subject: Re: Inverted search / Search on profilenet Verity, autonomy, whatever, has a what they call a reverse query system called profilenet. A profile is just a query

Re: Inverted search / Search on profilenet

2008-01-17 Thread markharw00d
There is a trick to indexing queries in this way... you need only index the rarest term in queries which have one or more mandatory terms. As an example - for the phrase query "XYZ Group limited" you need only index the rarest term "XYZ" and thus avoid the selecting the query for execution with

Re: Inverted search / Search on profilenet

2008-01-17 Thread Vieri
using this technique it wasn't enough, if I recall we > could run like 10.000 profiles with good performance. And as you say it > doesn't scale well at all. > > > > / > Regards > Marcus > > > > > > -----Ursprungligt meddelande- > Från: Mark Mill

Re: Inverted search / Search on profilenet

2008-01-17 Thread Mark Miller
Verity, autonomy, whatever, has a what they call a reverse query system called profilenet. A profile is just a query (or I guess more than one query?) and you can setup a bunch of them. Then you supply the document and you will get the matching queries as well as a score. They say its the oppos

Re: Inverted search / Search on profilenet

2008-01-17 Thread Endre Stølsvik
Mark Miller wrote: In any case, it shouldnt be that difficult to rig something. Is the profilenet system even that valuable? Sounds a bit hokey to me, but then im just a kid that has never used it May I ask: What IS a profilenet? I ask since this obviously is something that you two hit off o

Re: Inverted search / Search on profilenet

2008-01-16 Thread Mark Miller
0k new articles every day. > > > Any way... We really need to find a good api / some one that knows how to > add inverted searching to lucene. > > /Regards > Marcus > > > > > > > -----Ursprungligt meddelande----- > Från: Mark Miller [mailto:[EMAIL PROTECTED]

Re: Inverted search / Search on profilenet

2008-01-16 Thread Mark Miller
Don't have any info to add, but out of curiosity, what kind of setup are you using to host the 300 mil archive? Is the index distributed? Single machine? Solr? Thanks, Mark On Jan 16, 2008 12:27 PM, Marcus Falk <[EMAIL PROTECTED]> wrote: > Hi again, > > > > Today we are hosting a 300 million la