Hey Guys, Can you tell me how to find TF-IDF using lucene ?
Regards Vineel On 7/28/14, 3:11 PM, "Prakash Dubey" <pkdapa...@gmail.com> wrote: >In Lucene 2.4.0 there is *org.apache.lucene.index.TermFreqVector* method >*getTermFrequencies()* method. You can use Apache Maths for other >mathematical operations. Also see following Blog ><http://sujitpal.blogspot.in/2011/10/computing-document-similarity-using.h >tml> >for >more information. >Hope this helps.. > > >On Mon, Jul 28, 2014 at 6:19 AM, Erin Colvin ><eschlapk...@yahoo.com.invalid> >wrote: > >> Hi, I am working on my doctoral dissertation in CS and am trying to use >> Lucene to do custom similarity measures, namely MMM (mixed, min and >>Max), >> Paice and p-norm and then compare those results to the traditional >>Boolean >> similarity and TF/IDF similarity. >> So far I have tried creating custom similarity, and scoring with no luck >> and right now I am able to pull the IDF value out of a search result but >> cannot for the life of me get the term frequency. >> Is there any way to create a custom similarity measure and run it for >> results or at least can you help me pull the term frequencies out so I >>can >> do the calculations myself? >> I am a C++ and VB programmer who hasn't programmed in years so my java >>is >> a bit juvenile but I am 6 months away from being down with this degree >>and >> really want to graduate with my class so I am in need of help ASAP!!!! >> I'm using Eclipse with Lucene 4.0 so I can check my results in Luke. >> Please help, >> Erin Colvin >> >> >> Erin Colvin >> Computer Science Doctoral Student >> Colorado Technical University >> BS, MS, M.Ed >> --------------------------------------------------------------------- To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org For additional commands, e-mail: java-user-h...@lucene.apache.org