hashing comparison implemented. How do i find a PHF?
Please reply soon..
--
View this message in context:
http://lucene.472066.n3.nabble.com/posting-list-strings-tp4075816p4091028.html
Sent from the Lucene - Java Users mailing list archive at Nabble.com
er
>> H.-H.-Meier-Allee 63, D-28213 Bremen
>> http://www.thetaphi.de
>> eMail: u...@thetaphi.de
>>
>>
>> -Original Message-
>>> From: Adrien Grand [mailto:jpou...@gmail.com]
>>> Sent: Tuesday, July 09, 2013 11:25 PM
>>> To: java-user@lucene.apa
http://www.thetaphi.de
eMail: u...@thetaphi.de
-Original Message-
From: Adrien Grand [mailto:jpou...@gmail.com]
Sent: Tuesday, July 09, 2013 11:25 PM
To: java-user@lucene.apache.org
Subject: Re: posting list strings
Hi,
Lucene stores the string because it may need it to run prefix or
eMail: u...@thetaphi.de
>
>
> > -Original Message-
> > From: Adrien Grand [mailto:jpou...@gmail.com]
> > Sent: Tuesday, July 09, 2013 11:25 PM
> > To: java-user@lucene.apache.org
> > Subject: Re: posting list strings
> >
> > Hi,
> >
>
, 2013 11:25 PM
> To: java-user@lucene.apache.org
> Subject: Re: posting list strings
>
> Hi,
>
> Lucene stores the string because it may need it to run prefix or range
> queries. We don't have a hash-based terms dictionary right now but I know
> some people wrote
Hi,
Lucene stores the string because it may need it to run prefix or range
queries. We don't have a hash-based terms dictionary right now but I
know some people wrote one since they don't need support for these
queries, see for instance the Earlybird paper[1]. Then if you can find
a perfect hashin
It looks like Lucene stores the string names of the posting lists in the
index. How compact is this storage (when there may be a very large number
of posting lists, and the string lengths may be large - for example, I may
have an entry that looks like "Cn:4593846->8957363,485". I've seen other
sy