What is the difference between PhraseQuery and BooleanQuery with BooleanClause.Occur.SHOULD

2016-03-31 Thread Sachin Kulkarni
Hi, I am using Lucene-5.0.0. If I had a qurey "New York" and if I use the BooleanQuery with the BooleanClause set to MUST on the two terms, is it the same as dong a PhraseQuery with the two terms? I am doing some 2-gram type queries and they are giving me different results with these two methods.

Re: Compression algorithm for posting lists

2016-03-31 Thread Adrien Grand
Are posting lists the biggest disk user of your index? Usually it is rather stored fields or term vectors. You can tell Lucene to compress stored fields more aggressively by passing BEST_COMPRESSION to the Lucene54Codec constructor. Also maybe there are some features of the index that you do not ne

Re: Compression algorithm for posting lists

2016-03-31 Thread Vishwas Jain
​Hi Adrien, Thanks for the help, actually we are trying to compress ​the actual posting lists. Our main aim here is to save the disk space as much as possible occupied by the index created. Is only compressing the posting lists will suffice the problem or we have to explore more opti