On Sat, 27 Mar 2010, Valery Khamenya wrote:

there is a strange slowdown during the search for similar documents.
For some reason pylucene version is much slower than the pure Lucene one.
The test document collection contains 200K docs.

Here is the pylucene version:

content = ref_doc.getField('content').stringValue()
similarity_query = SimilarityQueries.formSimilarQuery(content,
default_analyzer, 'content', None)
search = index.search(similarity_query, 200)

Did you initialize the JVMs with the same memory parameters when comparing PyLucene vs Lucene Java ?

Andi..

Reply via email to