Re: Java Memory errors and "too many open files" when using Pylucene

Andi Vajda Wed, 20 May 2009 10:07:06 -0700


On Wed, 20 May 2009, Moshe Cohen wrote:

Thanks.
Version being used : 2.4.1 .
I have already tried most of the well documented Lucene ideas. The seemingly
weird thing  is that the index is always quite small. I have experience of
much larger indices on SOLR with no such errors.

Started with a memory error, after increasing JVM heap on init I got the too
many open files error, increased the OS limit and got a memory error
again:-)
Of course, I got further along in each stage but ultimately I hit an error.
I can workaround the problem by just restarting the program. This is what
lead me to suspecting resource leaks specific to Pylucene.

If it's a small enough program, it might be interesting to see if you canreproduce the problem in pure Java.

Are there any useful monitoring functions that can retrieve the resource
usage state along the way?

PyLucene is only a wrapper around Java Lucene and the JVM. The one thing youcan track in that context is how many java objects escaped the jvm to pythonand how many references python holds to them. Use env._dumpRefs(); env iswhat initVM() returns. _dumpRefs() dumps the hashtable of java objects thatescaped the VM to python listing their java.lang.System::identifyHashCode()and how many references python holds to each of them.

If that dump grows beyond reasonable, you have a clue about what could begoing wrong if you can then track down what the actual objects in questionare (log their identifyHashCode() when you use them, for example). If itdoesn't grow, then the problem is most likely on the java side and rewritingyour program in pure java is going to help with debugging this.


Andi..

Re: Java Memory errors and "too many open files" when using Pylucene

Reply via email to