Re: Lucene 4.4.0 mergeSegments OutOfMemoryError

2013-10-07 Thread Michael van Rooyen
With forceMerge(1) throwing an OOM error, we switched to forceMergeDeletes() which worked for a while, but that is now also running out of memory. As a result, I've turned all manner of forced merges off. I'm more than a little apprehensive that if the OOM error can happen as part of a force

Performance/scoring impacts with multiple occurrences of a field

2013-10-07 Thread Earl Hood
Using Lucene 3. I know Lucene supports multiple occurrences of a field, and if one searches on that field, all fields are checked for hits. One question I have is if there is a performance difference between if all the data I want to index is represented by a single field vs multiple fields of t

Re: Getting integer value from BytesRef

2013-10-07 Thread 장용석
Peter. Thanks for reply. This code is just sample for question. Actually, I have index many documents. And the reason for try this, I want to get statistics of index file. Thanks and Regards. 2013/10/8 Peter Chang > Your doc freq is always 1. It's useless. > I don't know why you try to inde

Re: Getting integer value from BytesRef

2013-10-07 Thread 장용석
Thanks very much! Uwe. I have get right value using NumericUtils. And as you talk, there were many terms more than I have indexing. Thanks and Regards. 2013/10/8 Uwe Schindler > Hi, > > Use NumericUtils to convert the BytesRef back to a number: > http://goo.gl/3KG9Pd > But be careful, the term

optimal way to access many TermVectors

2013-10-07 Thread Rose, Stuart J
Is there an optimal way to access many document TermVectors (in the same chunk) consecutively when using the LZ4 termvector compression? I'm curious to know whether all TermVectors in a single compressed chunk are decompressed and cached when one TermVector in the same chunk is accessed? Also w

Re: Getting integer value from BytesRef

2013-10-07 Thread Peter Chang
Your doc freq is always 1. It's useless. I don't know why you try to index and search a binary field except for range searching. On Mon, Oct 7, 2013 at 11:23 PM, 장용석 wrote: > Dear, > > I have indexing integer field like this > > - > Document doc = new Document(); > FieldType fieldType = new

RE: Getting integer value from BytesRef

2013-10-07 Thread Uwe Schindler
Hi, Use NumericUtils to convert the BytesRef back to a number: http://goo.gl/3KG9Pd But be careful, the terms index contains more terms with lower precisions (bits stripped off), unless you use infinite precisionStep! Uwe - Uwe Schindler H.-H.-Meier-Allee 63, D-28213 Bremen http://www.theta

Getting integer value from BytesRef

2013-10-07 Thread 장용석
Dear, I have indexing integer field like this - Document doc = new Document(); FieldType fieldType = new FieldType(); fieldType.setIndexed(true); fieldType.setStored(true); fieldType.setTokenized(false); fieldType.setIndexOptions(IndexOptions.DOCS_AND_FREQS_AND_POSITIONS); fieldType.setStoreT

Re: Synonym Search in Lucene..

2013-10-07 Thread Koji Sekiguchi
(13/10/07 18:33), VIGNESH S wrote: Hi, How to implement synonym Search for All languages.. As far as i know,Wordnet has only English Support..Is there any other we can use to get support for all languages. I think most people make synonym data manually... I've never explored Wordnet, but I t

Synonym Search in Lucene..

2013-10-07 Thread VIGNESH S
Hi, How to implement synonym Search for All languages.. As far as i know,Wordnet has only English Support..Is there any other we can use to get support for all languages. -- Thanks and Regards Vignesh Srinivasan