Getting the number of indexed fields in an index

2008-02-12 Thread marc.dumontier
Hi, Does anyone have a code snippet which would allow me to ask my index how many instances of a field are indexed? Thanks, Marc Dumontier Manager, Software Development Thomson Scientific (Canada) 1 Yonge Street, Suite 1801 Toronto, Ontario M5E 1W7 Direct +1 416 214 3448 Mobile +

RE: large term vectors

2008-02-11 Thread marc.dumontier
Hi Grant, Lucene 2.2.0 I'm not actually explicitely storing term vectors. It seems the huge amount of byte arrays is actually coming from SegmentReader.norms. Maybe that cache constantly grows as I read somewhere that it's on-demand. I'm not using any field or document boosting..is there some way

RE: large term vectors

2008-02-11 Thread marc.dumontier
No, it's split into about 100 individual indexes. But I'm running my 64-bit JVM with around 10gb max memory in order to avoid running out of memory after running all my unit tests (I have some other indexes as well running as part of this application). Upon further investigation, it seems to have

large term vectors

2008-02-08 Thread marc.dumontier
Hi, I have a large index which is around 275GB. As I search different parts of the index, the memory footprint grows with large byte arrays being stored. They never seem to get unloaded or GC'ed. Is there any way to control this behavior so that I can periodically unload cached information?

NumberTools - Range Searches

2007-09-18 Thread marc.dumontier
Hi, I'm trying to do Range searches on a numeric field and have come across the lexicographical behavior, such that mylength: [0-2] brings back values such as 1172 I've seen prior posts about the NumberTools class, and have indexed a field using the longToString() method. Do