hi when you are indexing, use termvectors org.apache.lucene.document.Field.TermVector set this in the Field object constructor when you create your Field objects at index time.
i've never done it but i'm pretty sure these can be retrieved at search time using one of the IndexReader.getTermFreqVector methods. lucene in action has a really good section on using termfreqvectors: http://www.manning.com/hatcher3/ if you want the positional info too e.g. the two positions of the "question" word in your example then have a look at the org.apache.lucene.search.spans.SpanTermQuery class -- in the getSpans method -- it grabs the terms + positions using the IndexReader as well: reader.termPositions(term) hope that helps, bec :) On 2 June 2010 04:53, Sirish Vadala <sirishre...@gmail.com> wrote: > > Hello All: > > Can any one suggest me the best way to get the no. of occurrences of each > word per document in Lucene? > > Eg: Let the indexed text be: > > If you are posting a question, please try search first. Your question may > have already been answered. > > Now if I search for the word 'question', then I would like to get this > document along with the number of occurrences of question in the document, > in the above case it would be 2. > > Any hint would be appreciated. > Thanks. > > > -- > View this message in context: > http://lucene.472066.n3.nabble.com/Problem-fetching-number-of-occurrences-tp862859p862859.html > Sent from the Lucene - Java Users mailing list archive at Nabble.com. > > --------------------------------------------------------------------- > To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org > For additional commands, e-mail: java-user-h...@lucene.apache.org > > --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org