1.0 branch is less stable then 0.8 for production. We discovered following problems:

1. memory leak in scrub (also reported on this list)

2. problem with saving key caches for super column family -  CASSANDRA-3511

3. in 1.0.3 some hints are stuck in system tables. Hints to other nodes seems to deliver fine, but node is probably writing also hints to itself and they get stuck there. I cleaned hints in entire cluster (stopped it) to be sure that no < 1.0 hints are there

4. memory leak. Node boots and work fine, but after some time (few hours) it is doing stuff like:

WARN [ScheduledTasks:1] 2011-11-22 07:48:18,843 GCInspector.java (line 146) Heap is 0.8598437159194559 full. You may need to reduce memtable and/or cache sizes. Cassandra will now flush up to the two largest memtables to free up memory. Adjust flush_largest_memtables_at threshold in cassandra.yaml if you don't want Cassandra to do this automatically WARN [ScheduledTasks:1] 2011-11-22 07:48:18,844 StorageService.java (line 2511) Flushing CFS to relieve memory pressure INFO [ScheduledTasks:1] 2011-11-22 07:48:18,844 ColumnFamilyStore.java (line 688) Enqueuing flush 1034292896(3645/57056 serialized/live bytes, 51 ops) flushed table is very small. After node restart it is flushing CFs in 10MB serialized chunks. It is not caches are too large problem because row caches are minimal (now are even reduced to 0.6 their sizes). It works with larger caches after node restart just fine.

It will OOM because situation is getting worse. I am now monitoring this issue more closely, collecting heap dumps.

799686 instances of class org.apache.cassandra.db.DecoratedKey
796969 instances of class org.apache.cassandra.dht.BigIntegerToken
612414 instances of class org.apache.cassandra.io.sstable.IndexSummary$KeyPosition
444125 instances of class org.apache.cassandra.db.ExpiringColumn
187205 instances of class com.googlecode.concurrentlinkedhashmap.ConcurrentLinkedHashMap$Node 187205 instances of class com.googlecode.concurrentlinkedhashmap.ConcurrentLinkedHashMap$WeightedValue
186347 instances of class org.apache.cassandra.utils.Pair
111813 instances of class org.apache.cassandra.db.AbstractColumnContainer$DeletionInfo
111813 instances of class org.apache.cassandra.db.ThreadSafeSortedColumns
110932 instances of class org.apache.cassandra.db.SuperColumn

Reply via email to