Looks like compaction for this column family stopped after some time.
The last message for this column family in the system.log is


INFO [MigrationStage:1] 2011-10-25 16:57:00,385 Migration.java (line
119) Applying migration 43f106c0-ff54-11e0-0000-68877f281daf Update
column family to
org.apache.cassandra.config.CFMetaData@86c50e4[cfId=1000,ksName=MSA,cfName=uid,cfType=Standard,comparator=org.apache.cassandra.db.marshal.BytesType,subcolumncomparator=<null>,comment=,rowCacheSize=0.0,keyCacheSize=1.3E7,readRepairChance=1.0,replicateOnWrite=true,gcGraceSeconds=3600,defaultValidator=org.apache.cassandra.db.marshal.BytesType,keyValidator=org.apache.cassandra.db.marshal.BytesType,minCompactionThreshold=4,maxCompactionThreshold=32,rowCacheSavePeriodInSeconds=0,keyCacheSavePeriodInSeconds=14400,rowCacheKeysToSave=2147483647,rowCacheProvider=org.apache.cassandra.cache.SerializingCacheProvider@7f32ad0d,mergeShardsChance=0.1,keyAlias=java.nio.HeapByteBuffer[pos=0
lim=3 cap=3],column_metadata={},compactionStrategyClass=class
org.apache.cassandra.db.compaction.LeveledCompactionStrategy,compactionStrategyOptions={sstable_size_in_mb=10},compressionOptions={}]
 INFO [MigrationStage:1] 2011-10-25 16:57:00,389
ColumnFamilyStore.java (line 664) Enqueuing flush of
Memtable-Migrations@1386279942(8806/11007 serialized/live bytes, 1
ops)
 INFO [FlushWriter:307] 2011-10-25 16:57:00,389 Memtable.java (line
237) Writing Memtable-Migrations@1386279942(8806/11007 serialized/live
bytes, 1 ops)
 INFO [MigrationStage:1] 2011-10-25 16:57:00,389
ColumnFamilyStore.java (line 664) Enqueuing flush of
Memtable-Schema@1156898891(4336/5420 serialized/live bytes, 3 ops)
 INFO [FlushWriter:307] 2011-10-25 16:57:00,402 Memtable.java (line
273) Completed flushing
/var/lib/cassandra/data/system/Migrations-h-48-Data.db (8870 bytes)
 INFO [FlushWriter:307] 2011-10-25 16:57:00,402 Memtable.java (line
237) Writing Memtable-Schema@1156898891(4336/5420 serialized/live
bytes, 3 ops)
 INFO [FlushWriter:307] 2011-10-25 16:57:00,413 Memtable.java (line
273) Completed flushing
/var/lib/cassandra/data/system/Schema-h-48-Data.db (4486 bytes)
 INFO [CompactionExecutor:23] 2011-10-25 16:57:00,929
CompactionTask.java (line 119) Compacting
[SSTableReader(path='/var/lib/cassandra/data/MSA/uid-h-9016-Data.db'),
SSTableReader(path='/var/lib/cassandra/data/MSA/uid-h-9046-Data.db'),
SSTableReader(path='/var/lib/cassandra/data/MSA/uid-h-9042-Data.db'),
SSTableReader(path='/var/lib/cassandra/data/MSA/uid-h-9039-Data.db'),
SSTableReader(path='/var/lib/cassandra/data/MSA/uid-h-9043-Data.db'),
SSTableReader(path='/var/lib/cassandra/data/MSA/uid-h-9045-Data.db'),
SSTableReader(path='/var/lib/cassandra/data/MSA/uid-h-9044-Data.db'),
SSTableReader(path='/var/lib/cassandra/data/MSA/uid-h-9015-Data.db'),
SSTableReader(path='/var/lib/cassandra/data/MSA/uid-h-9040-Data.db'),
SSTableReader(path='/var/lib/cassandra/data/MSA/uid-h-9054-Data.db'),
SSTableReader(path='/var/lib/cassandra/data/MSA/uid-h-9047-Data.db'),
SSTableReader(path='/var/lib/cassandra/data/MSA/uid-h-9041-Data.db')]
 INFO [MigrationStage:1] 2011-10-25 16:57:06,693 Migration.java (line
119) Applying migration 47b1b840-ff54-11e0-0000-68877f281daf Update
column family to
org.apache.cassandra.config.CFMetaData@23de1953[cfId=1000,ksName=MSA,cfName=uid,cfType=Standard,comparator=org.apache.cassandra.db.marshal.BytesType,subcolumncomparator=<null>,comment=,rowCacheSize=0.0,keyCacheSize=1.5E7,readRepairChance=1.0,replicateOnWrite=true,gcGraceSeconds=3600,defaultValidator=org.apache.cassandra.db.marshal.BytesType,keyValidator=org.apache.cassandra.db.marshal.BytesType,minCompactionThreshold=4,maxCompactionThreshold=32,rowCacheSavePeriodInSeconds=0,keyCacheSavePeriodInSeconds=14400,rowCacheKeysToSave=2147483647,rowCacheProvider=org.apache.cassandra.cache.SerializingCacheProvider@4a50aa8a,mergeShardsChance=0.1,keyAlias=java.nio.HeapByteBuffer[pos=0
lim=3 cap=3],column_metadata={},compactionStrategyClass=class
org.apache.cassandra.db.compaction.LeveledCompactionStrategy,compactionStrategyOptions={sstable_size_in_mb=10},compressionOptions={}]
 INFO [MigrationStage:1] 2011-10-25 16:57:06,694
ColumnFamilyStore.java (line 664) Enqueuing flush of
Memtable-Migrations@1978429475(8806/11007 serialized/live bytes, 1
ops)
 INFO [FlushWriter:307] 2011-10-25 16:57:06,694 Memtable.java (line
237) Writing Memtable-Migrations@1978429475(8806/11007 serialized/live
bytes, 1 ops)



The schema for this CF is

create column family uid
  with column_type = 'Standard'
  and comparator = 'BytesType'
  and default_validation_class = 'BytesType'
  and key_validation_class = 'BytesType'
  and rows_cached = 0.0
  and row_cache_save_period = 0
  and row_cache_keys_to_save = 2147483647
  and keys_cached = 1.5E7
  and key_cache_save_period = 14400
  and read_repair_chance = 1.0
  and gc_grace = 3600
  and min_compaction_threshold = 4
  and max_compaction_threshold = 32
  and replicate_on_write = true
  and row_cache_provider = 'SerializingCacheProvider'
  and compaction_strategy =
'org.apache.cassandra.db.compaction.LeveledCompactionStrategy'
  and compaction_strategy_options = {'sstable_size_in_mb' : '10'};



Any idea why compaction stopped automatically for  this CF?

thanks
Ramesh





On Wed, Oct 26, 2011 at 5:23 AM, Sylvain Lebresne <sylv...@datastax.com> wrote:
> The estimate for the number of keys is computed by summing the key
> estimate for each sstable of the CF. For each sstable, the estimate
> should be fairly good. However, that's when we sum all the sstable estimates
> that we can loose potentially a lot of precision if there is a lot of rows 
> that
> have parts in different sstables. But that in turn would suggest a problem
> with compaction lacking badly behind, especially with leveled compaction.
>
> --
> Sylvain
>
> On Wed, Oct 26, 2011 at 3:58 AM, Terry Cumaranatunge <cumar...@gmail.com> 
> wrote:
>> I have a cluster of 8 nodes all running 1.0. The stats shown on the 1st node
>> on one of the CFs for the number of keys is much larger than expected. The
>> first node shows the key count estimate to be 9.2M whereas the rest report
>> ~650K on each node. The 650K is in the correct neighborhood of the number of
>> keys that have been inserted. The counts are comparable for all other CFs
>> across the cluster. I'm using Level compaction, but no compression.
>>
>> The 'nodetool ring' shows that the load is equal across all nodes. What
>> could cause this large disparity in the number of keys? Is this just a stats
>> issue or does this suggest a functional problem?
>>
>> 1st node:
>>         Column Family: uid
>>                 SSTable count: 395
>>                 Space used (live): 13752624444
>>                 Space used (total): 5482088532
>>                 Number of Keys (estimate): 9215104
>>                 Memtable Columns Count: 514952
>>                 Memtable Data Size: 295213448
>>                 Memtable Switch Count: 290
>>                 Read Count: 193102511
>>                 Read Latency: 0.146 ms.
>>                 Write Count: 176934874
>>                 Write Latency: 0.018 ms.
>>                 Pending Tasks: 0
>>                 Key cache capacity: 8302131
>>                 Key cache size: 8302131
>>                 Key cache hit rate: 0.8644664668071792
>>                 Row cache: disabled
>>                 Compacted row minimum size: 87
>>                 Compacted row maximum size: 7007506
>>                 Compacted row mean size: 8944
>> 2nd node:
>>         Column Family: uid
>>                 SSTable count: 402
>>                 Space used (live): 13723958304
>>                 Space used (total): 4044833220
>>                 Number of Keys (estimate): 652928
>>                 Memtable Columns Count: 170290
>>                 Memtable Data Size: 102378904
>>                 Memtable Switch Count: 272
>>                 Read Count: 192463595
>>                 Read Latency: 0.289 ms.
>>                 Write Count: 176527238
>>                 Write Latency: 0.014 ms.
>>                 Pending Tasks: 0
>>                 Key cache capacity: 8783058
>>                 Key cache size: 8783058
>>                 Key cache hit rate: 0.7865727464740025
>>                 Row cache: disabled
>>                 Compacted row minimum size: 87
>>                 Compacted row maximum size: 7007506
>>                 Compacted row mean size: 12151
>> 3rd node:
>>               Column Family: uid
>>                 SSTable count: 401
>>                 Space used (live): 13204714872
>>                 Space used (total): 4030024144
>>                 Number of Keys (estimate): 675968
>>                 Memtable Columns Count: 42881
>>                 Memtable Data Size: 30992298
>>                 Memtable Switch Count: 304
>>                 Read Count: 190769879
>>                 Read Latency: 0.224 ms.
>>                 Write Count: 175381826
>>                 Write Latency: 0.014 ms.
>>                 Pending Tasks: 0
>>                 Key cache capacity: 8920108
>>                 Key cache size: 8920108
>>                 Key cache hit rate: 0.8053563128870577
>>                 Row cache: disabled
>>                 Compacted row minimum size: 87
>>                 Compacted row maximum size: 4866323
>>                 Compacted row mean size: 12074
>>
>

Reply via email to