On Sat, 28 Jan 2017, marco turchi wrote:

Dear All,
I need to use the ShingleAnalyzerWrapper in PyLucene.

I have built the analyzer similar to Lucene:
self.analyzer = ShingleAnalyzerWrapper(WhitespaceAnalyzer(), 2, 4, " " ,
True, False, None)

and I have used it inside QuertParser
query = QueryParser("source", self.analyzer).parse("welcome world is at on")

the output is:
source:welcome source:world source:is source:at source:on

I have run the same code in Java and the output is how I would expect it:
source:welcome source:welcome world source:welcome world is source:welcome
world is at source:world source:world is source:world is at source:world is
at on source:is content:is at source:is at on source:at source:at on
source:on

Do you have any ideas in what I'm doing wrong in PyLucene?

Please, help me help you by including two simple programs that I can run to reproduce the problem. One in Java producing the output you expect, one in Python producing the output you're reporting.

Thanks !

Andi..


Thanks a lot in advance for your help
Marco

Reply via email to