Hey Guys,

Can you tell me how to find TF-IDF using lucene ?

Regards
Vineel

On 7/28/14, 3:11 PM, "Prakash Dubey" <pkdapa...@gmail.com> wrote:

>In Lucene 2.4.0 there is *org.apache.lucene.index.TermFreqVector* method
>*getTermFrequencies()* method. You can use Apache Maths for other
>mathematical operations. Also see following Blog
><http://sujitpal.blogspot.in/2011/10/computing-document-similarity-using.h
>tml>
>for
>more information.
>Hope this helps..
>
>
>On Mon, Jul 28, 2014 at 6:19 AM, Erin Colvin
><eschlapk...@yahoo.com.invalid>
>wrote:
>
>> Hi, I am working on my doctoral dissertation in CS and am trying to use
>> Lucene to do custom similarity measures, namely MMM (mixed, min and
>>Max),
>> Paice and p-norm and then compare those results to the traditional
>>Boolean
>> similarity and TF/IDF similarity.
>> So far I have tried creating custom similarity, and scoring with no luck
>> and right now I am able to pull the IDF value out of a search result but
>> cannot for the life of me get the term frequency.
>> Is there any way to create a custom similarity measure and run it for
>> results or at least can you help me pull the term frequencies out so I
>>can
>> do the calculations myself?
>> I am a C++ and VB programmer who hasn't programmed in years so my java
>>is
>> a bit juvenile but I am 6 months away from being down with this degree
>>and
>> really want to graduate with my class so I am in need of help ASAP!!!!
>> I'm using Eclipse with Lucene 4.0 so I can check my results in Luke.
>> Please help,
>> Erin Colvin
>>
>>
>> Erin Colvin
>> Computer Science Doctoral Student
>> Colorado Technical University
>> BS, MS, M.Ed
>>


---------------------------------------------------------------------
To unsubscribe, e-mail: java-user-unsubscr...@lucene.apache.org
For additional commands, e-mail: java-user-h...@lucene.apache.org

Reply via email to