Hi, My query is 'test polskie'.I use MorfologikFilter for Polish stemming, it turns 'polskie' into 'polski' + 'polskie'.
I also use SynonymGraphFilter which turns 'polski' into 'pol'. Here's what I see in quey analysis (token position in parenthesis): Tokenizer: test(1) polskie(2) MF: test(1) polskie(2) polski(2) SGF: test(1) polskie(2) pol(3) polski(3).
When I search for "test polskie" with quotation marks, a document with the same text doesn't match. I think it's because SGF changes position of output tokens (SGF is applied only for query, so in index the positions are only 1 and 2). It mtches when I disable SGF.
Am I doing something wrong, or is this a bug in SGF? Thanks, Mateus
smime.p7s
Description: Kryptograficzna sygnatura S/MIME