Re: Many really small SSTables

Jan Kesten Thu, 15 Jan 2015 23:38:59 -0800

Hi Eric and all,

I almost expected this kind answer. I did a nodetool compactionstatsalready to see if those sstables are beeing compacted, but on all nodesthere are 0 outstanding compactions (right now in the morning, notrunning any tests on this cluster).

The reported read latency is about 1-3ms and on nodes which have manysstables (new highscore are ~18k sstables). The 99% percentile is about30-40 micros and a cell count of about 80-90 (if I got the docs rightthese are the number of sstables accessed, that changed from 2.0 to 2.1I think as I see this only on testing cluster).

I looks to me that compactions were not triggered. I tried a nodetoolcompact on one node overnight - but that crashed the entire node.


Roland

Am 15.01.2015 um 19:14 schrieb Eric Stevens:

Yes, many sstables can have a huge negative impact read performance,and will also create memory pressure on that node.
There are a lot of things which can produce this effect, and itstrongly also suggests you're falling behind on compaction in general(check nodetool compactionstats, you should have <5outstanding/pending, preferably 0-1). To see whether and how much itis impacting your read performance, check nodetool cfstats<keyspace.table> and nodetool cfhistograms <keyspace> <table>.
On Thu, Jan 15, 2015 at 2:11 AM, Roland Etzenhammer<r.etzenham...@t-online.de <mailto:r.etzenham...@t-online.de>> wrote:
    Hi,

    I'm testing around with cassandra fair a bit, using 2.1.2 which I
    know has some major issues,but it is a test environment. After
    some bulk loading, testing with incremental repairs and running
    out of heap once I found that now I have a quit large number of
    sstables which are really small:

    <1k              0      0,0%
    <10k          2780     76,8%
    <100k         3392     93,7%
    <1000k        3461     95,6%
    <10000k       3471     95,9%
    <100000k      3517     97,1%
    <1000000k     3596     99,3%
    all           3621    100,0%

    76,8% of all sstables in this particular column familiy are
    smaller that 10kB, 93.7% are smaller then 100kB.

    Just for my understanding - does that impact performance? And is
    there any way to reduce the number of sstables? A full run of
    nodetool compact is running for a really long time (more than 1day).

    Thanks for any input,
    Roland

--

i.A. Jan Kesten Systemadministration enercast GmbH Friedrich - Ebert -Straße 104 D–34119 Kassel Tel.: +49 561 / 4739664-0 Fax:(+49)561/4739664-9 mailto: j.kes...@enercast.de http://www.enercast.deAG Kassel HRB 15471 Thomas Landgraf Geschäftsführert.landg...@enercast.de Tel.: (+49)561/4739664-0 FAX: -9 Mobil:(+49)172/6565087 enercast GmbH Friedrich-Ebert-Str. 104 D-34119 KasselHRB15471 http://www.enercast.de Online-Prognosen für erneuerbareEnergien Geschäftsführung: Thomas Landgraf (CEO), Bernd Kratz (CTO),Philipp Rinder (CSO) Diese E-Mail und etwaige Anhänge könnenvertrauliche und/oder rechtlich geschützte Informationen enthalten.Falls Sie nicht der angegebene Empfänger sind oder falls diese E-Mailirrtümlich an Sie adressiert wurde, benachrichtigen Sie uns bitte sofortdurch Antwort-E-Mail und löschen Sie diese E-Mail nebst etwaigen Anlagenvon Ihrem System. Ebenso dürfen Sie diese E-Mail oder ihre Anlagen nichtkopieren oder an Dritte weitergeben. Vielen Dank. This e-mail and anyattachment may contain confidential and/or privileged information. Ifyou are not the named addressee or if this transmission has beenaddressed to you in error, please notify us immediately by reply e-mailand then delete this e-mail and any attachment from your system. Pleaseunderstand that you must not copy this e-mail or any attachment ordisclose the contents to any other person. Thank you for your cooperation.

Re: Many really small SSTables

Reply via email to