getTermFreqVector atomicity

2007-05-21 Thread Walter Ferrara
I'm interested in getting the term vector of a lucene doc. The point is, it seems I have to give to the IndexReader.getTermFreqVector a doc ID, while I would know if there is a way to get the termvector by a doc identifier (not lucene doc id, but a my own field). I know how to get the lucene docid

Queries on small subset in a very large index

2007-05-22 Thread Walter Ferrara
Hi, I need to execute a query on a subset of documents (I know their ids) and it has to be very fast. I've made a Filter that set the bitset only for needed docids. The point is, the subset is very small versus a index which is very big (subset size is always below the 0.05% of the total numbers of

Re: regaridng Reader.terms()

2007-05-22 Thread Walter Ferrara
If Reader.terms() gives you: text3 text4 while you expect text3 text4 you should change, I presume, the Analyzer, maybe writing your own one. Mohammad Norouzi wrote: > Hi all > > consider following index > > field1 field2 field3 > text1 text1 text2

Re: regaridng Reader.terms()

2007-05-22 Thread Walter Ferrara
t factor or any other feature of > lucene, I > need just searching through the index. > > > On 5/22/07, Walter Ferrara <[EMAIL PROTECTED]> wrote: >> >> If Reader.terms() gives you: >> text3 >> text4 >> while you expect >> text3 text4 >> &g

RAMDirectory issue

2007-05-26 Thread Walter Ferrara
I have an index on the disk. I'm using RAMDirectory( String ) to load index in RAM. I need from time to time to reload the index (when it is not current anymore), but I keep getting Out Of memory exception. Making the jvm heap bigger helps, but only allow more reloading, eventually it will throw t