On Sat, 28 Jan 2017, marco turchi wrote:
Dear All, I need to use the ShingleAnalyzerWrapper in PyLucene. I have built the analyzer similar to Lucene: self.analyzer = ShingleAnalyzerWrapper(WhitespaceAnalyzer(), 2, 4, " " , True, False, None) and I have used it inside QuertParser query = QueryParser("source", self.analyzer).parse("welcome world is at on") the output is: source:welcome source:world source:is source:at source:on I have run the same code in Java and the output is how I would expect it: source:welcome source:welcome world source:welcome world is source:welcome world is at source:world source:world is source:world is at source:world is at on source:is content:is at source:is at on source:at source:at on source:on Do you have any ideas in what I'm doing wrong in PyLucene?
Please, help me help you by including two simple programs that I can run to reproduce the problem. One in Java producing the output you expect, one in Python producing the output you're reporting.
Thanks ! Andi..
Thanks a lot in advance for your help Marco