Re: Quoted phrase doesn't match when stemming and synonyms combined.

2023-01-12 Thread Dave
That’s awesome you found it! And of course anytime. But again the idea of having a complete reindex plan ready would be wise in my opinion. Just something that makes you feel a tad safer when the s and the fan hit each other. I’ve had to rebuild well over a terabyte of a solr index in less tha

Re: Quoted phrase doesn't match when stemming and synonyms combined.

2023-01-12 Thread Mateusz Matela
If anyone's interested, I've submitted https://github.com/apache/lucene/issues/12080 I found a small change in code that seem to fix the problem. Thank you Dave for the feedback! W dniu 11.01.2023 o 15:17, Dave pisze: On one hand that’s great news, on the other ot probably deserves a ticket but

Re: Quoted phrase doesn't match when stemming and synonyms combined.

2023-01-11 Thread Dave
On one hand that’s great news, on the other ot probably deserves a ticket but you need to have a very specific scenario where your index filters don’t match your query filters. Also maybe spend some time putting together a reindexing plan. Solr can use multiple cores so you can index content

Re: Quoted phrase doesn't match when stemming and synonyms combined.

2023-01-11 Thread Mateusz Matela
After reindexing with SGF the document matches, as expected. Still, it looks like SGF was designed to work well when used only in query, and it's just a bug revealed by an edge case. Shall I submit an issue to https://github.com/apache/lucene ? W dniu 11.01.2023 o 13:09, Dave pisze: Yes then

Re: Quoted phrase doesn't match when stemming and synonyms combined.

2023-01-11 Thread Dave
Yes then that is a problem, and I agree it should be intuitive that the quotes work without the modifier. I’m not familiar with the underlying code enough to know for sure what’s going on in this instance, but reinfecting the content with the filter I wonder would fix it? You can experiment wit

Re: Quoted phrase doesn't match when stemming and synonyms combined.

2023-01-11 Thread Mateusz Matela
W dniu 11.01.2023 o 12:04, Dave pisze: Hmm. As an experiment what happens when you use a range of three or four with the quotes using the tilda in the query? You mean query like "test polskie"~1 ? Yes, it does match. Unfortunately it's not a workaround I can use because the query is provided

Re: Quoted phrase doesn't match when stemming and synonyms combined.

2023-01-11 Thread Dave
Hmm. As an experiment what happens when you use a range of three or four with the quotes using the tilda in the query? Also generally o find it best to use the same filters for both indexing and query, just a personal preference, I know it’s not always possible however. > On Jan 11, 2023, at 5

Quoted phrase doesn't match when stemming and synonyms combined.

2023-01-11 Thread Mateusz Matela
Hi, My query is 'test polskie'. I use MorfologikFilter for Polish stemming, it turns 'polskie' into 'polski' + 'polskie'. I also use SynonymGraphFilter which turns 'polski' into 'pol'. Here's what I see in quey analysis (token position in parenthesis): Tokenizer: test(1) polskie(2) MF: test(1)