Can you open a new ticket, then? Preferably with the thrift code involved, I'm not sure what find_by_natural_key or find_all_by_service is translating into. (It looks like just one of those is responsible for the leak.)
On Sun, Jun 13, 2010 at 12:11 PM, Matthew Conway <m...@backupify.com> wrote: > Pretty sure as the list of file descriptors below shows (at this point the > client has exited, so doubly sure its not open sockets): > > # lsof -p `ps ax | grep [C]assandraDaemon | awk '{print $1}'` | awk '{print > $9}' | sort | uniq -c | sort -n | tail -n 5 > 2 > /usr/local/apache-cassandra-2010-06-11_12-30-33/lib/slf4j-log4j12-1.5.8.jar > 2 /usr/local/apache-cassandra-2010-06-11_12-30-33/lib/snakeyaml-1.6.jar > 2 /usr/share/java/gnome-java-bridge.jar > 1003 /mnt/cassandra/data/MyKeyspace/MySuperColumn-c-1-Data.db > 1003 /mnt/cassandra/data/MyKeyspace/MySuperColumn-c-2-Data.db > > On Jun 11, 2010, at Fri Jun 11, 7:34 PM, Jonathan Ellis wrote: > >> it goes up by exactly 2000, which is the number of loop iterations >> exactly? are you sure this isn't just counting your open sockets? >> >> On Fri, Jun 11, 2010 at 1:53 PM, Matthew Conway <m...@backupify.com> wrote: >>> Thanks, I just tried apache-cassandra-2010-06-11_12-30-33 (hudson 462) but >>> my tests ares still reporting a leak (though not as bad), I do the >>> following (ruby tests using cassandra_object/cassandra, but you should be >>> able to get the idea): >>> >>> should "not leak file descriptors" do >>> cassandra_pid = `ps ax | grep [C]assandraDaemon | awk '{print $1}'` >>> original_count = `lsof -p #{cassandra_pid}`.lines.to_a.size >>> assert original_count > 0 >>> count = 1000 >>> count.times do |n| >>> ChildMetadatum.new(:service_id => 4, :child_id => "def#{n}", >>> :updated => Time.now, :labels => ["label2", "label3"]).save! >>> end >>> count.times do |n| >>> ChildMetadatum.find_by_natural_key(:service_id => 4, :child_id => >>> "def#{n}") >>> ChildMetadatum.find_all_by_service_id(3) >>> end >>> new_count = `lsof -p #{cassandra_pid}`.lines.to_a.size >>> assert new_count > 0 >>> assert new_count < original_count * 1.1, "File descriptors leaked >>> from #{original_count} to #{new_count}" >>> end >>> >>> Which reports: File descriptors leaked from 112 to 2112. >>> SHould I reopen the bug or create a new one? >>> >>> Matt >>> >>> On Jun 10, 2010, at Thu Jun 10, 6:40 PM, Jonathan Ellis wrote: >>> >>>> Fixed in https://issues.apache.org/jira/browse/CASSANDRA-1178 >>>> >>>> On Thu, Jun 10, 2010 at 9:01 AM, Matt Conway <m...@backupify.com> wrote: >>>>> Hi All, >>>>> I'm running a small 4-node cluster with minimal load using >>>>> the 2010-06-08_12-31-16 build from trunk, and its exhausting file >>>>> descriptors pretty quickly (65K in less than an hour). Here's a list of >>>>> the >>>>> files I see it leaking, I can do a more specific query if you'd like. >>>>> Am I >>>>> doing something wrong, is this a known problem, something being done wrong >>>>> from the client side, or something else? Any help appreciated, thanks, >>>>> Matt >>>>> r...@cassandra01:~# lsof -p `ps ax | grep [C]assandraDaemon | awk '{print >>>>> $1}'` | awk '{print $9}' | sort | uniq -c | sort -n | tail -n 5 >>>>> 3 /mnt/cassandra/data/system/Schema-c-2-Data.db >>>>> 1278 /mnt/cassandra/data/MyKeyspace/MyColumnType-c-7-Data.db >>>>> 1405 /mnt/cassandra/data/MyKeyspace/MyColumnType-c-9-Data.db >>>>> 1895 /mnt/cassandra/data/MyKeyspace/MyColumnType-c-5-Data.db >>>>> 26655 /mnt/cassandra/data/MyKeyspace/MyColumnType-c-11-Data.db >>>>> >>>>> >>>> >>>> >>>> >>>> -- >>>> Jonathan Ellis >>>> Project Chair, Apache Cassandra >>>> co-founder of Riptano, the source for professional Cassandra support >>>> http://riptano.com >>> >>> >> >> >> >> -- >> Jonathan Ellis >> Project Chair, Apache Cassandra >> co-founder of Riptano, the source for professional Cassandra support >> http://riptano.com > > -- Jonathan Ellis Project Chair, Apache Cassandra co-founder of Riptano, the source for professional Cassandra support http://riptano.com