Re: problem found with DiskDocValuesFormat

2013-10-22 Thread Duke DAI
er thread creation is heavy or ThreadLocal cleanup from outside is > > complicated. > > My test shows NumericDocValues will return wrong value, but sure that > it's > > a long value, upper logic can verify whether the value is valid or not. > > > > As I described i

Re: problem found with DiskDocValuesFormat

2013-10-21 Thread Michael McCandless
e, but sure that it's > a long value, upper logic can verify whether the value is valid or not. > > As I described in earlier mail, in Lucene4.4 > Lucene42DocValuesFormat(in-memory) > has no problem, DiskDocValuesFormat(in-disk) has problem. Now in > Lucene4.5, MemoryDocValue

Re: problem found with DiskDocValuesFormat

2013-10-21 Thread Duke DAI
from outside is complicated. My test shows NumericDocValues will return wrong value, but sure that it's a long value, upper logic can verify whether the value is valid or not. As I described in earlier mail, in Lucene4.4 Lucene42DocValuesFormat(in-memory) has no problem, DiskDocValuesFormat(in

Re: problem found with DiskDocValuesFormat

2013-10-21 Thread Michael McCandless
:28 AM, Duke DAI wrote: > Hi guys, > > Seems I have the same problem with Lucene45DocValuesFormat, no problem with > MemoryDocValuesFormat. The problem I encountered with Lucene4.4 is with > DiskDocValuesFormat, no with Lucene42DocValuesFormat. > > I dig into a little and

Re: problem found with DiskDocValuesFormat

2013-10-21 Thread Duke DAI
Hi guys, Seems I have the same problem with Lucene45DocValuesFormat, no problem with MemoryDocValuesFormat. The problem I encountered with Lucene4.4 is with DiskDocValuesFormat, no with Lucene42DocValuesFormat. I dig into a little and found the superficial cause. In SegmentCoreReaders, there is

Re: problem found with DiskDocValuesFormat

2013-08-22 Thread Sean Bridges
> We can't reindex every time we upgrade lucene since our indexes are too > > large. Should we copy the code from DiskDocValuesFormat and call it > > CustomDiskDocValuesFormat, and give CustomDiskDocValuesFormat a new name > so > > that when we upgrade lucene, we won

Re: problem found with DiskDocValuesFormat

2013-08-21 Thread Robert Muir
time we upgrade lucene since our indexes are too > large. Should we copy the code from DiskDocValuesFormat and call it > CustomDiskDocValuesFormat, and give CustomDiskDocValuesFormat a new name so > that when we upgrade lucene, we won't use an incompatible version of > DiskDocValu

Re: problem found with DiskDocValuesFormat

2013-08-21 Thread Sean Bridges
code from DiskDocValuesFormat and call it CustomDiskDocValuesFormat, and give CustomDiskDocValuesFormat a new name so that when we upgrade lucene, we won't use an incompatible version of DiskDocValuesFormat? Thanks, Sean On Wed, Aug 21, 2013 at 8:44 AM, Robert Muir wrote: > On Wed, Aug 21

Re: problem found with DiskDocValuesFormat

2013-08-21 Thread Robert Muir
On Wed, Aug 21, 2013 at 11:30 AM, Sean Bridges wrote: > What is the recommended way to use DiskDocValuesFormat in production if we > can't reindex when we upgrade? I'm not going to recommend using any experimental codecs in production, but... 1. with 4.3 jar file: IWC.setCodec(

Re: problem found with DiskDocValuesFormat

2013-08-21 Thread Sean Bridges
What is the recommended way to use DiskDocValuesFormat in production if we can't reindex when we upgrade? Will the 4.4 version of DDVF be backwards compatible, or should we make our own copy of DDVF and give it a different codec name to protect ourselves against incompatible changes? T

Re: problem found with DiskDocValuesFormat

2013-08-13 Thread Duke DAI
; On Tue, Aug 13, 2013 at 4:54 AM, Duke DAI wrote: > > Hi experts, > > > > I'm upgrading Lucene 4.4 and trying to use DocValues instead of store > field > > for performance reason. But due to unknown size of index(depends on > > customer), so I will use Dis

Re: problem found with DiskDocValuesFormat

2013-08-13 Thread Michael McCandless
http://blog.mikemccandless.com On Tue, Aug 13, 2013 at 4:54 AM, Duke DAI wrote: > Hi experts, > > I'm upgrading Lucene 4.4 and trying to use DocValues instead of store field > for performance reason. But due to unknown size of index(depends on > customer), so I will use DiskDocValuesFo

problem found with DiskDocValuesFormat

2013-08-13 Thread Duke DAI
Hi experts, I'm upgrading Lucene 4.4 and trying to use DocValues instead of store field for performance reason. But due to unknown size of index(depends on customer), so I will use DiskDocValuesFormat, especially for some binary field. Then I wrote my customized Codec: final Codec

Re: DiskDocValuesFormat

2013-04-14 Thread Wei Wang
at > > > > > org.apache.lucene.index.ConcurrentMergeScheduler$MergeThread.run(ConcurrentMergeScheduler.java:514) > > > > I configured jvm with "-Xmx4096m", and it seems still not enough memory. > I > > thought DiskDocValuesFormat puts most of the

Re: DiskDocValuesFormat

2013-04-14 Thread Robert Muir
pache.lucene.index.ConcurrentMergeScheduler$MergeThread.run(ConcurrentMergeScheduler.java:514) > > I configured jvm with "-Xmx4096m", and it seems still not enough memory. I > thought DiskDocValuesFormat puts most of the data on disk and there should > not be that much memory consumptio

Re: DiskDocValuesFormat

2013-04-14 Thread Wei Wang
geScheduler.java:541) at org.apache.lucene.index.ConcurrentMergeScheduler$MergeThread.run(ConcurrentMergeScheduler.java:514) I configured jvm with "-Xmx4096m", and it seems still not enough memory. I thought DiskDocValuesFormat puts most of the data on disk and there should not be tha

Re: DiskDocValuesFormat

2013-04-14 Thread Wei Wang
JAR file with a ZIP > > > program and check that all files in META-INF/services contain all > > > entries merged from all Lucene JARs. > > > > > > Uwe > > > > > > - > > > Uwe Schindler > > > H.-H.-Meier-Allee 63, D-28213 Brem

RE: DiskDocValuesFormat

2013-04-14 Thread Uwe Schindler
ARs. > > > > Uwe > > > > - > > Uwe Schindler > > H.-H.-Meier-Allee 63, D-28213 Bremen > > http://www.thetaphi.de > > eMail: u...@thetaphi.de > > > > > > > -Original Message- > > > From: Wei Wang [mailto:welshw...@gmail.com] &g

Re: DiskDocValuesFormat

2013-04-14 Thread Wei Wang
inal Message- > > From: Wei Wang [mailto:welshw...@gmail.com] > > Sent: Sunday, April 14, 2013 11:49 PM > > To: java-user@lucene.apache.org > > Subject: Re: DiskDocValuesFormat > > > > Yes, I used Maven Shade plugin, but still have this problem. Here is the

RE: DiskDocValuesFormat

2013-04-14 Thread Uwe Schindler
e.apache.org > Subject: Re: DiskDocValuesFormat > > Yes, I used Maven Shade plugin, but still have this problem. Here is the > Maven output during packaging: > > [INFO] --- maven-shade-plugin:2.0:shade (default) @ audience-profile- > indexer --- [INFO] Including commons-collections:co

Re: DiskDocValuesFormat

2013-04-14 Thread Wei Wang
3, D-28213 Bremen > http://www.thetaphi.de > eMail: u...@thetaphi.de > > > > -Original Message- > > From: Wei Wang [mailto:welshw...@gmail.com] > > Sent: Sunday, April 14, 2013 11:30 PM > > To: java-user@lucene.apache.org > > Subject: Re: DiskDocValuesFormat >

RE: DiskDocValuesFormat

2013-04-14 Thread Uwe Schindler
://www.thetaphi.de eMail: u...@thetaphi.de > -Original Message- > From: Wei Wang [mailto:welshw...@gmail.com] > Sent: Sunday, April 14, 2013 11:30 PM > To: java-user@lucene.apache.org > Subject: Re: DiskDocValuesFormat > > Hi Adrien, > > The Lucene42Codec works well

Re: DiskDocValuesFormat

2013-04-14 Thread Wei Wang
Hi Adrien, The Lucene42Codec works well to generate the index with DiskDocValuesFormat. But when I tried to merge the index segments by calling: IndexWriter iw = new IndexWriter(directory, iw_config); ... iw.forceMerge(1); I got the following error message: Caused by

Re: DiskDocValuesFormat

2013-04-13 Thread Wei Wang
Hi Adrien, Thanks for your example. Really helpful! Wei On Sat, Apr 13, 2013 at 4:25 AM, Adrien Grand wrote: > Hi Wei, > > On Sat, Apr 13, 2013 at 7:44 AM, Wei Wang wrote: > > I am trying to use DiskDocValuesFormat for a particular > > BinaryDocValuesField. It seems ther

Re: DiskDocValuesFormat

2013-04-13 Thread Adrien Grand
Hi Wei, On Sat, Apr 13, 2013 at 7:44 AM, Wei Wang wrote: > I am trying to use DiskDocValuesFormat for a particular > BinaryDocValuesField. It seems there is no good examples showing how to do > this. The only hint I got from various docs and forums is set some codec in > IndexW

DiskDocValuesFormat

2013-04-12 Thread Wei Wang
I am trying to use DiskDocValuesFormat for a particular BinaryDocValuesField. It seems there is no good examples showing how to do this. The only hint I got from various docs and forums is set some codec in IndexWriter. Could someone give a few lines of code snippet and show how to set