Thanks Tim & Walter.
Have managed to get it working with shingles and edge ngram. Initially it
did bring up a lot of false positives but managed to mitigate it tweaking
with the parameters and also by splitting this into a separate copy field
with lower boost than a normal match.
On Wed, Aug 16,
There are two cases.
Index has “well being”, query is “wellbeing”. This is solved by using a shingle
filter. That will make lots of nonsense compounds, too, but they won’t match
real queries. Well, almost never.
Index has “wellbeing”, query is “well being”. Best approach for this is synonym
ex
Index all diagrams. If you use a dictionary then there is a lot of work to
maintain it. Also this does not translate well to other languages. The
downside to this is having partial token hits which decrease precision.
But, usually people who are looking for "well being" or "wellbeing" will
not e