Re: org.apache.lucene.index.CorruptIndexException: checksum failed

2016-07-20 Thread Ziming Dong
I upgrade lucene to 6.1.0, hope it works, thank you very much! the mount( contain all data directories and index directories ) is shared by four different computers, but each index directory can be accessed only by one computer, we have four index directories. 2016-07-21 5:13 GMT+08:00 Michael Mc

Re: Query documents where Field Doesn't Exist

2016-07-20 Thread Vlad K
Hi Adrien, I had a chance to test and I see that there is one more solution. For fields that we want to search for exist/doesn't exist add one more indexed field, like "ex_field=1" and can search by: +ex_field=1 or -ex_field=1. It works fast. On Fri, Nov 13, 2015 at 5:21 AM, Adrien Grand wrote:

Re: org.apache.lucene.index.CorruptIndexException: checksum failed

2016-07-20 Thread Michael McCandless
This means that something is flipping bits from when Lucene wrote this file, to when Lucene wanted to merge it. NFS is dangerous to use with Lucene: it doesn't support delete-on-final-close semantics, its client side directory listing caches can be stale, locking is iffy, etc. If you are using a

Setting LMJelinekMercer Similarity in Luke

2016-07-20 Thread Dwaipayan Roy
​Hello. I want to set LMJelinekMercer Similarity (with lambda set to, say, 0.6) for the Luke similarity calculation. Luke by default use the DefaultSimilarity. Can​ anyone help with this? I use Lucene 4.10.4 and Luke for that version of Lucene index. Dwaipayan.. ​

Re: Query the doc frequency across multiple search field.

2016-07-20 Thread Adrien Grand
Note that if you only have two fields A and B, you could make it faster by returning `docFreq(A)+docFreq(B)-IndexSearcher.count(A AND B)` rather than `IndexSearcher.count(A OR B)` since Lucene is typically faster at running conjunctions than disjunctions. Le mer. 20 juil. 2016 à 15:41, Xiaolong Zh

Re: Query the doc frequency across multiple search field.

2016-07-20 Thread Xiaolong Zheng
Thanks! The use case that I am having is I am trying to calculate the docFreq for the suggestion word which produced by my "did you mean"/"spellcheck" feature. I was trying to avoid to having a second search request. But it seems in this case, I have to formula another search query to do the job.

Re: Query the doc frequency across multiple search field.

2016-07-20 Thread Adrien Grand
There is no way to get this statistic in constant-time. If you need it for scoring, you need to make approximations. For instance, BlendedTermQuery uses the max of the doc freqs as the aggregated doc freq. Otherwise, you can also compute this number by running a BooleanQuery with one SHOULD clause

org.apache.lucene.index.CorruptIndexException: checksum failed

2016-07-20 Thread 董自鸣
I use ubuntu to run lucene(java8, lucene6.0.0, not ES), and nfs to mount disk(actually is rocketstor) of macmini. I get this error constantly. I have already used org.apache.lucene.index.CheckIndex, but will still crash after restart program. Exception in thread "main" org.apache.lucene.store.Alr