Hi,

My query is 'test polskie'.
I use MorfologikFilter for Polish stemming, it turns 'polskie' into 'polski' + 'polskie'.
I also use SynonymGraphFilter which turns 'polski' into 'pol'.
Here's what I see in quey analysis (token position in parenthesis):
Tokenizer: test(1) polskie(2)
MF: test(1) polskie(2) polski(2)
SGF: test(1) polskie(2) pol(3) polski(3).

When I search for "test polskie" with quotation marks, a document with the same text doesn't match. I think it's because SGF changes position of output tokens (SGF is applied only for query, so in index the positions are only 1 and 2). It mtches when I disable SGF.
Am I doing something wrong, or is this a bug in SGF?

Thanks,
Mateus

Attachment: smime.p7s
Description: Kryptograficzna sygnatura S/MIME

Reply via email to