[IMPORTANT] Please set your GitHub profile "name" to your Jira "Full Name" for account mapping

2022-07-09 Thread Tomoko Uchida
[image: jira_profile.png] [image: github_profile.png] Hi all Lucene maintainers and contributors, we need your cooperation for the coming GitHub issue migration from Jira. In short, please set your GitHub profile's "Name" to exactly the same string in your Jira "Full Name" (see the attached images

Re: Fuzzy Query Similarity

2022-07-09 Thread Michael Sokolov
Oh good! Thanks for clarifying, Uwe On Sat, Jul 9, 2022, 12:23 PM Uwe Schindler wrote: > Hi > > FuzzyQuery/MultiTermQuery and I don't see any way to "boost" exact > > matches, or even to incorporate the edit distance more generally into > > the per-term score, although it does seem like that wou

Re: Fuzzy Query Similarity

2022-07-09 Thread Uwe Schindler
Hi FuzzyQuery/MultiTermQuery and I don't see any way to "boost" exact matches, or even to incorporate the edit distance more generally into the per-term score, although it does seem like that would be something people would generally expect. Actually it does this: * By default FuzzyQuery uses

Re: Fuzzy Query Similarity

2022-07-09 Thread Uwe Schindler
The problem is that the query combines the native termquery score (which depends on length of document and term's statistic). The edit distance is also multiplied in. When the difference in term statistics is too large, the edit distance no longer matters. This is perfectly fine and also happen

Re: Fuzzy Query Similarity

2022-07-09 Thread Michael Sokolov
I am no expert with this, but I got curious and looked at FuzzyQuery/MultiTermQuery and I don't see any way to "boost" exact matches, or even to incorporate the edit distance more generally into the per-term score, although it does seem like that would be something people would generally expect. So