Score exact matches higher than matches that match analysed text but not original text

Paul Taylor Tue, 10 Jan 2012 01:13:46 -0800

My analyser strips out accents as often these are not entered correctly,so assume there are two documents in the database with default fieldcontaining

República
Republica

a search for República or Republica will return both results, each witha score of 1.

Its correct that they both get returned but it would be really nice ifat the scoring stage it could recognise that if I had search forRepública that the document containing República is a slightly bettermatch than the other one and score slightly higher, and vice versa.

Is there are any way to do this in Lucene, alternatively I thought aboutaugmenting the score results returned by Lucene, and when multipleresults have the same score check the number of matching letters andincrease the score based on how many letters match, but only increasethe score so still lower than any results that Lucene scored higher. Ialso realise that this seems to make sense when just searching one fieldbut more complex when the query is searching over multiple fields but Ithink in this case when searching for artists/bands (music) I would onlydo the boost if the artist name was one of the search fields.


Paul

Score exact matches higher than matches that match analysed text but not original text

Reply via email to