That’s awesome you found it! And of course anytime. But again the idea of
having a complete reindex plan ready would be wise in my opinion. Just
something that makes you feel a tad safer when the s and the fan hit each
other. I’ve had to rebuild well over a terabyte of a solr index in less tha
If anyone's interested, I've submitted
https://github.com/apache/lucene/issues/12080
I found a small change in code that seem to fix the problem.
Thank you Dave for the feedback!
W dniu 11.01.2023 o 15:17, Dave pisze:
On one hand that’s great news, on the other ot probably deserves a ticket but
On one hand that’s great news, on the other ot probably deserves a ticket but
you need to have a very specific scenario where your index filters don’t match
your query filters.
Also maybe spend some time putting together a reindexing plan. Solr can use
multiple cores so you can index content
After reindexing with SGF the document matches, as expected.
Still, it looks like SGF was designed to work well when used only in
query, and it's just a bug revealed by an edge case. Shall I submit an
issue to https://github.com/apache/lucene ?
W dniu 11.01.2023 o 13:09, Dave pisze:
Yes then
Yes then that is a problem, and I agree it should be intuitive that the quotes
work without the modifier. I’m not familiar with the underlying code enough to
know for sure what’s going on in this instance, but reinfecting the content
with the filter I wonder would fix it? You can experiment wit
W dniu 11.01.2023 o 12:04, Dave pisze:
Hmm. As an experiment what happens when you use a range of three or four with
the quotes using the tilda in the query?
You mean query like "test polskie"~1 ? Yes, it does match.
Unfortunately it's not a workaround I can use because the query is
provided
Hmm. As an experiment what happens when you use a range of three or four with
the quotes using the tilda in the query?
Also generally o find it best to use the same filters for both indexing and
query, just a personal preference, I know it’s not always possible however.
> On Jan 11, 2023, at 5
Hi,
My query is 'test polskie'.
I use MorfologikFilter for Polish stemming, it turns 'polskie' into
'polski' + 'polskie'.
I also use SynonymGraphFilter which turns 'polski' into 'pol'.
Here's what I see in quey analysis (token position in parenthesis):
Tokenizer: test(1) polskie(2)
MF: test(1)