In Lucene 2.4.0 there is *org.apache.lucene.index.TermFreqVector* method *getTermFrequencies()* method. You can use Apache Maths for other mathematical operations. Also see following Blog <http://sujitpal.blogspot.in/2011/10/computing-document-similarity-using.html> for more information. Hope this helps..
On Mon, Jul 28, 2014 at 6:19 AM, Erin Colvin <eschlapk...@yahoo.com.invalid> wrote: > Hi, I am working on my doctoral dissertation in CS and am trying to use > Lucene to do custom similarity measures, namely MMM (mixed, min and Max), > Paice and p-norm and then compare those results to the traditional Boolean > similarity and TF/IDF similarity. > So far I have tried creating custom similarity, and scoring with no luck > and right now I am able to pull the IDF value out of a search result but > cannot for the life of me get the term frequency. > Is there any way to create a custom similarity measure and run it for > results or at least can you help me pull the term frequencies out so I can > do the calculations myself? > I am a C++ and VB programmer who hasn't programmed in years so my java is > a bit juvenile but I am 6 months away from being down with this degree and > really want to graduate with my class so I am in need of help ASAP!!!! > I'm using Eclipse with Lucene 4.0 so I can check my results in Luke. > Please help, > Erin Colvin > > > Erin Colvin > Computer Science Doctoral Student > Colorado Technical University > BS, MS, M.Ed >