Re: Compound words in English

2023-08-23 Thread Ramkumar Krishnamoorthy
Thanks Tim & Walter. Have managed to get it working with shingles and edge ngram. Initially it did bring up a lot of false positives but managed to mitigate it tweaking with the parameters and also by splitting this into a separate copy field with lower boost than a normal match. On Wed, Aug 16,

Re: Compound words in English

2023-08-16 Thread Walter Underwood
There are two cases. Index has “well being”, query is “wellbeing”. This is solved by using a shingle filter. That will make lots of nonsense compounds, too, but they won’t match real queries. Well, almost never. Index has “wellbeing”, query is “well being”. Best approach for this is synonym ex

Re: Compound words in English

2023-08-15 Thread Tim Casey
Index all diagrams. If you use a dictionary then there is a lot of work to maintain it. Also this does not translate well to other languages. The downside to this is having partial token hits which decrease precision. But, usually people who are looking for "well being" or "wellbeing" will not e