Pretty sure as the list of file descriptors below shows (at this point the client has exited, so doubly sure its not open sockets):
# lsof -p `ps ax | grep [C]assandraDaemon | awk '{print $1}'` | awk '{print $9}' | sort | uniq -c | sort -n | tail -n 5 2 /usr/local/apache-cassandra-2010-06-11_12-30-33/lib/slf4j-log4j12-1.5.8.jar 2 /usr/local/apache-cassandra-2010-06-11_12-30-33/lib/snakeyaml-1.6.jar 2 /usr/share/java/gnome-java-bridge.jar 1003 /mnt/cassandra/data/MyKeyspace/MySuperColumn-c-1-Data.db 1003 /mnt/cassandra/data/MyKeyspace/MySuperColumn-c-2-Data.db On Jun 11, 2010, at Fri Jun 11, 7:34 PM, Jonathan Ellis wrote: > it goes up by exactly 2000, which is the number of loop iterations > exactly? are you sure this isn't just counting your open sockets? > > On Fri, Jun 11, 2010 at 1:53 PM, Matthew Conway <m...@backupify.com> wrote: >> Thanks, I just tried apache-cassandra-2010-06-11_12-30-33 (hudson 462) but >> my tests ares still reporting a leak (though not as bad), I do the following >> (ruby tests using cassandra_object/cassandra, but you should be able to get >> the idea): >> >> should "not leak file descriptors" do >> cassandra_pid = `ps ax | grep [C]assandraDaemon | awk '{print $1}'` >> original_count = `lsof -p #{cassandra_pid}`.lines.to_a.size >> assert original_count > 0 >> count = 1000 >> count.times do |n| >> ChildMetadatum.new(:service_id => 4, :child_id => "def#{n}", >> :updated => Time.now, :labels => ["label2", "label3"]).save! >> end >> count.times do |n| >> ChildMetadatum.find_by_natural_key(:service_id => 4, :child_id => >> "def#{n}") >> ChildMetadatum.find_all_by_service_id(3) >> end >> new_count = `lsof -p #{cassandra_pid}`.lines.to_a.size >> assert new_count > 0 >> assert new_count < original_count * 1.1, "File descriptors leaked >> from #{original_count} to #{new_count}" >> end >> >> Which reports: File descriptors leaked from 112 to 2112. >> SHould I reopen the bug or create a new one? >> >> Matt >> >> On Jun 10, 2010, at Thu Jun 10, 6:40 PM, Jonathan Ellis wrote: >> >>> Fixed in https://issues.apache.org/jira/browse/CASSANDRA-1178 >>> >>> On Thu, Jun 10, 2010 at 9:01 AM, Matt Conway <m...@backupify.com> wrote: >>>> Hi All, >>>> I'm running a small 4-node cluster with minimal load using >>>> the 2010-06-08_12-31-16 build from trunk, and its exhausting file >>>> descriptors pretty quickly (65K in less than an hour). Here's a list of >>>> the >>>> files I see it leaking, I can do a more specific query if you'd like. Am >>>> I >>>> doing something wrong, is this a known problem, something being done wrong >>>> from the client side, or something else? Any help appreciated, thanks, >>>> Matt >>>> r...@cassandra01:~# lsof -p `ps ax | grep [C]assandraDaemon | awk '{print >>>> $1}'` | awk '{print $9}' | sort | uniq -c | sort -n | tail -n 5 >>>> 3 /mnt/cassandra/data/system/Schema-c-2-Data.db >>>> 1278 /mnt/cassandra/data/MyKeyspace/MyColumnType-c-7-Data.db >>>> 1405 /mnt/cassandra/data/MyKeyspace/MyColumnType-c-9-Data.db >>>> 1895 /mnt/cassandra/data/MyKeyspace/MyColumnType-c-5-Data.db >>>> 26655 /mnt/cassandra/data/MyKeyspace/MyColumnType-c-11-Data.db >>>> >>>> >>> >>> >>> >>> -- >>> Jonathan Ellis >>> Project Chair, Apache Cassandra >>> co-founder of Riptano, the source for professional Cassandra support >>> http://riptano.com >> >> > > > > -- > Jonathan Ellis > Project Chair, Apache Cassandra > co-founder of Riptano, the source for professional Cassandra support > http://riptano.com