Re: Slowdown during the search for similar documents

Andi Vajda Sat, 27 Mar 2010 15:18:11 -0700


On Sat, 27 Mar 2010, Valery Khamenya wrote:

there is a strange slowdown during the search for similar documents.
For some reason pylucene version is much slower than the pure Lucene one.
The test document collection contains 200K docs.

Here is the pylucene version:

content = ref_doc.getField('content').stringValue()
similarity_query = SimilarityQueries.formSimilarQuery(content,
default_analyzer, 'content', None)
search = index.search(similarity_query, 200)

Did you initialize the JVMs with the same memory parameters when comparingPyLucene vs Lucene Java ?


Andi..

Re: Slowdown during the search for similar documents

Reply via email to