Re: Jensen–Shannon divergence

2015-12-13 Thread Ahmet Arslan
Hi Shay, I suggest you to extend o.a.l.search.similarities.SimilarityBase. All you need to implement a score() method. After all fancy names (language models, etc), a similarity is a function of seven salient statistics. It is actually six: avgFieldLength can derived from other two (numberOfFiel

Re: Jensen–Shannon divergence

2015-12-13 Thread will martin
Sorry it was early. If you go looking on the web, you can find, as I did reputable work on implementing DiricletLanguage Models. However, at this hour you might get answers here. Extrapolating others work into a lucene implantation is only slightly different from getting answers here. imo g'lu

Re: Jensen–Shannon divergence

2015-12-13 Thread Shay Hummel
Hi I am sorry but I didn't understand your answer. Can you please elaborate? Shay On Sun, Dec 13, 2015 at 3:41 PM will martin wrote: > expand your due diligence beyond wikipedia: > i.e. > > http://ciir.cs.umass.edu/pubfiles/ir-464.pdf > > > > > On Dec 13, 2015, at 8:30 AM, Shay Hummel wrote:

Re: Jensen–Shannon divergence

2015-12-13 Thread will martin
expand your due diligence beyond wikipedia: i.e. http://ciir.cs.umass.edu/pubfiles/ir-464.pdf > On Dec 13, 2015, at 8:30 AM, Shay Hummel wrote: > > LMDiricletbut its feasibilit

Jensen–Shannon divergence

2015-12-13 Thread Shay Hummel
Hi I need help to implement similarity between query model and document model. I would like to use the JS-Divergence for ranking documents. The documents and the query will be represented according to the language models approach -