From: Mark Miller <[EMAIL PROTECTED]>
To: java-user@lucene.apache.org
Sent: Thursday, January 17, 2008 7:57:38 AM
Subject: Re: Inverted search / Search on profilenet
Verity, autonomy, whatever, has a what they call a reverse query system
called profilenet. A profile is just a query
There is a trick to indexing queries in this way... you need only index
the rarest term in queries which have one or more mandatory terms.
As an example - for the phrase query "XYZ Group limited" you need only
index the rarest term "XYZ" and thus avoid the selecting the query for
execution with
using this technique it wasn't enough, if I recall we
> could run like 10.000 profiles with good performance. And as you say it
> doesn't scale well at all.
>
>
>
> /
> Regards
> Marcus
>
>
>
>
>
> -----Ursprungligt meddelande-
> Från: Mark Mill
Verity, autonomy, whatever, has a what they call a reverse query system
called profilenet. A profile is just a query (or I guess more than one
query?) and you can setup a bunch of them. Then you supply the document
and you will get the matching queries as well as a score. They say its
the oppos
Mark Miller wrote:
In any case, it shouldnt be that difficult to rig something. Is the
profilenet system even that valuable? Sounds a bit hokey to me, but then im
just a kid that has never used it
May I ask: What IS a profilenet? I ask since this obviously is something
that you two hit off o
0k new articles every day.
>
>
> Any way... We really need to find a good api / some one that knows how to
> add inverted searching to lucene.
>
> /Regards
> Marcus
>
>
>
>
>
>
> -----Ursprungligt meddelande-----
> Från: Mark Miller [mailto:[EMAIL PROTECTED]
Don't have any info to add, but out of curiosity, what kind of setup are you
using to host the 300 mil archive? Is the index distributed? Single machine?
Solr?
Thanks,
Mark
On Jan 16, 2008 12:27 PM, Marcus Falk <[EMAIL PROTECTED]> wrote:
> Hi again,
>
>
>
> Today we are hosting a 300 million la