RE: fastest way to gather simple terms that match documents?

2010-04-05 Thread Chris Hostetter
: Alternatively index your documents with term vectors for the field enabled: ... : And then use IndexReader.getTermFreqVector() with the matching doc ID: Uwe: this is an area i'm not particularly strong on, so i'm curious: do you expect that the TermFreqVector approach would be faster t

RE: fastest way to gather simple terms that match documents?

2010-04-05 Thread Uwe Schindler
Alternatively index your documents with term vectors for the field enabled: http://lucene.apache.org/java/3_0_1/api/all/org/apache/lucene/document/Field.TermVector.html And then use IndexReader.getTermFreqVector() with the matching doc ID: http://lucene.apache.org/java/3_0_1/api/all/org/apache/l

Re: fastest way to gather simple terms that match documents?

2010-04-05 Thread Chris Hostetter
: After I've run a query I need to know which terms matched each : result document (ie doc termfrequency>0). ... : I don't care how many were found or what position or anything else. : just which ones matched. if all you care about is simple "which terms does it have" you can take yo

Re: Apache Lucene EuroCon Call For Participation: Prague, Czech Republic May 20 & 21, 2010

2010-04-05 Thread Grant Ingersoll
Just a reminder, just over one week left open on the CFP. Some great talks entered already. Keep it up! On Mar 24, 2010, at 8:03 PM, Grant Ingersoll wrote: > Apache Lucene EuroCon Call For Participation - Prague, Czech Republic May 20 > & 21, 2010 > > All submissions must be received by Tue