Hi guys,
I have what may be a dumb question but I am confused by how much disk space is 
being used by my Cassandra nodes.  I have 10 nodes in my cluster with a 
replication factor of 3.  After I write 1,000,000 rows to the database (100kB 
each), I see that they have been distributed very evenly, about 100,000 rows 
per node but because of the replication factor of 3, each node contains about 
300,000 rows.  This is all good.  Since my rows are 100kB each, I expect each 
node to store about 30GB of data, however that is not what I am seeing.  
Instead, I am seeing some nodes that do not experience any compaction 
exceptions but report their space used as MUCH more.  Here's one using 106 GB 
of disk.  My disks are only 160 GB so this is at the bleeding edge and I 
thought my node would be able to store more data.

I only use a single column family so here is the cfstats output from one of my 
nodes (server5):

                Column Family: Standard1
                SSTable count: 12
                Space used (live): 113946099884
                Space used (total): 113946099884
                Memtable Columns Count: 0
                Memtable Data Size: 0
                Memtable Switch Count: 451
                Read Count: 31786
                Read Latency: 161.429 ms.
                Write Count: 300633
                Write Latency: 0.124 ms.
                Pending Tasks: 0
                Key cache: disabled
                Row cache capacity: 3000
                Row cache size: 3000
                Row cache hit rate: 0.38331340841880074
                Compacted row minimum size: 100220
                Compacted row maximum size: 100225
                Compacted row mean size: 100224

Note that I wrote these 1M rows of data yesterday and the system has had 24 
hours to digest it. There are no exceptions in the system.log file.  Here's 
the tail end of it:

...
INFO [SSTABLE-CLEANUP-TIMER] 2010-07-06 16:13:43,162
SSTableDeletingReference.java (line 104) Deleted
/var/lib/cassandra/data/Keyspace1/Standard1-430-Data.db
 INFO [SSTABLE-CLEANUP-TIMER] 2010-07-06 16:13:43,269
SSTableDeletingReference.java (line 104) Deleted
/var/lib/cassandra/data/Keyspace1/Standard1-445-Data.db
 INFO [COMPACTION-POOL:1] 2010-07-06 16:35:21,718 CompactionManager.java (line 
246) Compacting []
 INFO [Timer-1] 2010-07-06 17:01:01,907 Gossiper.java (line 179) InetAddress 
/10.248.107.19 is now dead.
 INFO [GMFD:1] 2010-07-06 17:01:42,039 Gossiper.java (line 568) InetAddress 
/10.248.107.19 is now UP
 INFO [COMPACTION-POOL:1] 2010-07-06 17:35:21,306 CompactionManager.java (line 
246) Compacting []
 INFO [COMPACTION-POOL:1] 2010-07-06 18:35:20,802 CompactionManager.java (line 
246) Compacting []
 INFO [COMPACTION-POOL:1] 2010-07-06 19:35:20,389 CompactionManager.java (line 
246) Compacting []
 INFO [COMPACTION-POOL:1] 2010-07-06 20:35:19,934 CompactionManager.java (line 
246) Compacting []
 INFO [COMPACTION-POOL:1] 2010-07-06 21:35:19,582 CompactionManager.java (line 
246) Compacting []
 INFO [COMPACTION-POOL:1] 2010-07-06 22:35:19,233 CompactionManager.java (line 
246) Compacting []
 INFO [COMPACTION-POOL:1] 2010-07-06 23:35:18,593 CompactionManager.java (line 
246) Compacting []
 INFO [COMPACTION-POOL:1] 2010-07-07 00:35:18,076 CompactionManager.java (line 
246) Compacting []
 INFO [COMPACTION-POOL:1] 2010-07-07 01:35:17,673 CompactionManager.java (line 
246) Compacting []
 INFO [COMPACTION-POOL:1] 2010-07-07 02:35:17,172 CompactionManager.java (line 
246) Compacting []
 INFO [COMPACTION-POOL:1] 2010-07-07 03:35:16,784 CompactionManager.java (line 
246) Compacting []
 INFO [COMPACTION-POOL:1] 2010-07-07 04:35:16,383 CompactionManager.java (line 
246) Compacting []

Thank you for your help!!
Julie


Reply via email to