Re: The current default similarity implementation of Lucene is BM25, right?

2022-11-23 Thread Michael Wechner
o ask whether the current default similarity implementation of Lucene is really BM25, right? as described at https://opensourceconnections.com/blog/2015/10/16/bm25-the-next-generation-of-lucene-relevation/ Thanks Michael ---

Re: The current default similarity implementation of Lucene is BM25, right?

2022-11-23 Thread Michael Wechner
don't write anything wrong I would like to ask whether the current default similarity implementation of Lucene is really BM25, right? as described at https://opensourceconnections.com/blog/2015/10/16/bm25-the-next-generation-of-lucene-relevation/ Thanks Mi

Re: The current default similarity implementation of Lucene is BM25, right?

2022-11-23 Thread Adrien Grand
> would like to ask > > whether the current default similarity implementation of Lucene is > really BM25, right? > > as described at > > https://opensourceconnections.com/blog/2015/10/16/bm25-the-next-generation-

The current default similarity implementation of Lucene is BM25, right?

2022-11-23 Thread Michael Wechner
Hi On the Lucene FAQ there is no mentioning re tf-idf or bm25 and I would like to add some notes, but to be sure I don't write anything wrong I would like to ask whether the current default similarity implementation of Lucene is really BM25, right? as described at

Re: Similarity Implementation

2016-07-07 Thread Đạt Cao Mạnh
Hi Siraj, I think https://lucene.apache.org/core/6_1_0/core/index.html?org/apache/lucene/search/ConstantScoreQuery.html should be good enough. On Fri, Jul 8, 2016 at 12:27 AM Siraj Haider wrote: > We are in the process of upgrading from 2.x to 6.x. In 2.x we implemented > our own similarity whe

Similarity Implementation

2016-07-07 Thread Siraj Haider
We are in the process of upgrading from 2.x to 6.x. In 2.x we implemented our own similarity where all the functions return 1.0f, how can we implement such thing in 6.x? Is there an implementation already there that we can use and have the same results? -- Regards -Siraj Haider (212) 306-01

Re: Simple Similarity Implementation to Count the Number of Hits

2016-05-12 Thread Ahmet Arslan
Hi Luis, Thats an interesting question. Can you share your similarity? I suspect you return 1 expect Similarity#coord method. Not sure but, for phrase query, one may require to modify ExactPhraseScorer/ExactPhraseScorer etc. ahmet On Thursday, May 12, 2016 5:41 AM, Luís Filipe Nassif wrote:

Simple Similarity Implementation to Count the Number of Hits

2016-05-11 Thread Luís Filipe Nassif
Hi, In the past (lucene 4) I have tried to implement a simple Similarity to only count the number of occurrences (term frequencies) into the documents, ignoring norms, doc frequencies, boosts... It worked for some queries like term and wildcard queries, but not for others, like phrase and range qu

Re: Vector Space Model: New Similarity Implementation Issues

2008-02-28 Thread h t
d "idf". > > FYI: I could implement "idf" according to miisliat.com formula, but not > the > "tf" and "norm" > > Could you please comment me how I can implement a new Similarity class > that > will fit in the Lucene's architecture

Re: Vector Space Model: New Similarity Implementation Issues

2008-02-28 Thread Dharmalingam
>>> Try this as an example: >>> >>> Setup a really simple index with 1 or 2 docs each with a few words. >>> Setup a simple Similarity class where you override all of these >>> methods to return 1 (or some

Re: Vector Space Model: New Similarity Implementation Issues

2008-02-28 Thread Grant Ingersoll
ext: http://www.nabble.com/Vector-Space-Model%3A-New-Similarity-Implementation-Issues-tp15696719p15745822.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. - To unsubscribe, e-mail: [EMAIL PROTECTED]

Re: Vector Space Model: New Similarity Implementation Issues

2008-02-28 Thread Dharmalingam
t trying to understand Lucene at an academic/programming level or > do you have something you are trying to achieve in terms of relevance? > > -Grant > > - &g

Re: Vector Space Model: New Similarity Implementation Issues

2008-02-28 Thread Grant Ingersoll
On Feb 28, 2008, at 9:00 AM, Dharmalingam wrote: Thanks for the reply. Sorry if my explanation is not clear. Yes, you are correct the model is based on Salton's VSM. However, the calculation of the term weight and the doc norm is, in my opinion, different from Lucene. If you look at th

Re: Vector Space Model: New Similarity Implementation Issues

2008-02-28 Thread Dharmalingam
Next Training: April 7, 2008 at ApacheCon Europe in Amsterdam > > Lucene Helpful Hints: > http://wiki.apache.org/lucene-java/BasicsOfPerformance > http://wiki.apache.org/lucene-java/LuceneFAQ > > > > > >

Re: Vector Space Model: New Similarity Implementation Issues

2008-02-28 Thread Grant Ingersoll
Not sure I am understanding what you are asking, but I will give it a shot. See below On Feb 26, 2008, at 3:45 PM, Dharmalingam wrote: Hi List, I am pretty new to Lucene. Certainly, it is very exciting. I need to implement a new Similarity class based on the Term Vector Space Model giv

Vector Space Model: New Similarity Implementation Issues

2008-02-26 Thread Dharmalingam
-- View this message in context: http://www.nabble.com/Vector-Space-Model%3A-New-Similarity-Implementation-Issues-tp15696719p15696719.html Sent from the Lucene - Java Users mailing list archive at Nabble.com. - To unsubscribe, e-mail

RE: BM25 Similarity implementation

2006-02-17 Thread Trieschnigg, R.B. \(Dolf\)
Sorry, the image wasn't sent: http://wwwhome.cs.utwente.nl/~trieschn/bm25.PNG > -Original Message- > From: Trieschnigg, R.B. (Dolf) > [mailto:[EMAIL PROTECTED] > Sent: vrijdag 17 februari 2006 10:54 > To: java-user@lucene.apache.org > Subject: RE: BM25 Si

RE: BM25 Similarity implementation

2006-02-17 Thread Trieschnigg, R.B. \(Dolf\)
> > I would like to implement the Okapi BM25 weighting function > > using my own Similarity implementation. Unfortunately BM25 > > requires the document length in the score calculation, which > > is not provided by the Scorer. > > How do you want to measure docu

Re: BM25 Similarity implementation

2006-02-16 Thread Doug Cutting
Trieschnigg, R.B. (Dolf) wrote: I would like to implement the Okapi BM25 weighting function using my own Similarity implementation. Unfortunately BM25 requires the document length in the score calculation, which is not provided by the Scorer. How do you want to measure document length? If

BM25 Similarity implementation

2006-02-16 Thread Trieschnigg, R.B. \(Dolf\)
Hi, I would like to implement the Okapi BM25 weighting function using my own Similarity implementation. Unfortunately BM25 requires the document length in the score calculation, which is not provided by the Scorer. Does anyone know a solution to this problem? I've tried to find