Re: split large sstable

2011-11-21 Thread Edward Capriolo
On Mon, Nov 21, 2011 at 11:26 AM, sridhar basam wrote: > > > On Mon, Nov 21, 2011 at 10:34 AM, Edward Capriolo > wrote: > >> >> >> On Mon, Nov 21, 2011 at 10:07 AM, Dan Hendry >> wrote: >> >>> Pretty sure your argument about indirect blocks making large files >>> inefficient only pertains to ex

Re: split large sstable

2011-11-21 Thread sridhar basam
On Mon, Nov 21, 2011 at 10:34 AM, Edward Capriolo wrote: > > > On Mon, Nov 21, 2011 at 10:07 AM, Dan Hendry wrote: > >> Pretty sure your argument about indirect blocks making large files >> inefficient only pertains to ext2/3 and not ext4. It seems ext4 replaces >> the >> 'indirect block' approach

Re: split large sstable

2011-11-21 Thread Edward Capriolo
e tiered compaction. > > Dan > > -Original Message- > From: Radim Kolar [mailto:h...@sendmail.cz] > Sent: November-19-11 19:42 > To: user@cassandra.apache.org > Subject: Re: split large sstable > > Dne 17.11.2011 17:42, Dan Hendry napsal(a): > > What

Re: split large sstable

2011-11-21 Thread Zhu Han
gt; compelling reason ext4 should be chosen (over ext3) for Cassandra - at > least > when using size tiered compaction. > An alternative is XFS, which is also extent based. > > Dan > > -Original Message- > From: Radim Kolar [mailto:h...@sendmail.cz] > Sent: Nov

RE: split large sstable

2011-11-21 Thread Dan Hendry
mber-19-11 19:42 To: user@cassandra.apache.org Subject: Re: split large sstable Dne 17.11.2011 17:42, Dan Hendry napsal(a): > What do you mean by ' better file offset caching'? Presumably you mean > 'better page cache hit rate'? fs metadata used to find blocks in smaller f

Re: split large sstable

2011-11-19 Thread Radim Kolar
Dne 17.11.2011 17:42, Dan Hendry napsal(a): What do you mean by ' better file offset caching'? Presumably you mean 'better page cache hit rate'? fs metadata used to find blocks in smaller files are cached better. Large files are using indirect blocks and you need more reads to find correct bloc

RE: split large sstable

2011-11-17 Thread Dan Hendry
ge- From: Radim Kolar [mailto:h...@sendmail.cz] Sent: November-17-11 5:02 To: user@cassandra.apache.org Subject: split large sstable Is there some simple way how to split large sstable into several smaller ones? I increased min_compaction_threshold (smaller tables seems to get better file offset

split large sstable

2011-11-17 Thread Radim Kolar
Is there some simple way how to split large sstable into several smaller ones? I increased min_compaction_threshold (smaller tables seems to get better file offset caching from OS) and now i need to reshuffle data to smaller sstables, running several cluster wide repairs worked well just