Terms.getSumTotalTermFreq() in Lucene 4.0

2013-01-03 Thread 장용석
Hello. I have some questions. Document 1 : "learning perl learning java learning ruby" Document 2 : "perl test" I have indexed this documents, with StoreTermVectors(true) and IndexOptions.DOCS_AND_FREQS. Field name is "f". And I executed this code. IndexReader ir = IndexReader.open(dir); Terms

Re: More about storing NLP-type stuff in the index

2013-01-03 Thread Michael Sokolov
On 1/3/2013 6:16 PM, Wu, Stephen T., Ph.D. wrote: I think we've been saying that if we put something in a Payload, it will be indexed. From what I understand of the indexing format, that means that what you put in the Payload will be stored in the Lucene index... But it won't *itself* be indexed

More about storing NLP-type stuff in the index

2013-01-03 Thread Wu, Stephen T., Ph.D.
I think we've been saying that if we put something in a Payload, it will be indexed. From what I understand of the indexing format, that means that what you put in the Payload will be stored in the Lucene index... But it won't *itself* be indexed & optimized for search. That's good, but can we bu

Re: potential memory leak when using RAMDirectory ,CloseableThreadLocal and a thread pool .

2013-01-03 Thread Robert Muir
On Thu, Jan 3, 2013 at 12:16 PM, Alon Muchnick wrote: > value org.apache.lucene.index.TermInfosReader$ThreadResources ---> > > termInfoCache |org.apache.lucene.util.cache.SimpleLRUCache > termEnum |org.apache.lucene.index.SegmentTermEnum > You aren't using lucene 3.6.2 if you have th

Re: potential memory leak when using RAMDirectory ,CloseableThreadLocal and a thread pool .

2013-01-03 Thread Michael McCandless
On Thu, Jan 3, 2013 at 12:16 PM, Alon Muchnick wrote: > hi Mike , > > at the peak there are 500 live threads going through Lucune (not all of > them at the same time , tomcat thread pool uses round robin ) ,regarding > the Directory impl we are using RAMDirectory. > the object that takes most of

Re: potential memory leak when using RAMDirectory ,CloseableThreadLocal and a thread pool .

2013-01-03 Thread Alon Muchnick
hi Mike , at the peak there are 500 live threads going through Lucune (not all of them at the same time , tomcat thread pool uses round robin ) ,regarding the Directory impl we are using RAMDirectory. the object that takes most of the heap is the "hardRefs" WeakHashMap class member in the Close