I just wanted to close the loop for
https://issues.apache.org/jira/browse/CASSANDRA-14902 (update
compaction_throughput_mb_per_sec default from 16 to 64) - the default will
become 64 for 4.0 unless people had strong objections.
I'll update the separate discussion on num_tokens (
https://issues.apache.org/jira/browse/CASSANDRA-13701) but that is getting
finalized after some testing.

On Sat, Jan 25, 2020 at 2:59 AM Alexander Dejanovski <a...@thelastpickle.com>
wrote:

> I support changing the default GC settings. The ones we have now drive me
> nuts.
> We should raise the max heap size for CMS to 16G instead of 8 now. We
> should still not go higher than half the available RAM.
> Also, we should set a new gen size between 40% and 50% of the heap size.
> The 100MB per core rule for computing the new gen size doesn't make any
> sense IMO (at least in the context of Cassandra).
>
> This is one of the most common optimizations we make on clusters, and most
> peeps that run Cassandra aren't GC experts (and shouldn't be).
>
> -----------------
> Alexander Dejanovski
> France
> @alexanderdeja
>
> Consultant
> Apache Cassandra Consulting
> http://www.thelastpickle.com
>
>
> On Fri, Jan 24, 2020 at 3:48 PM Joshua McKenzie <jmcken...@apache.org>
> wrote:
>
> > >
> > > I'm unable to create an epic in the project - not sure if that has to
> do
> > > with project permissions.  Could someone create an epic and link these
> > > tickets as subtasks?
> >
> >
> > Just realized I can no longer create epics anymore (or the "new" JIRA UI
> is
> > just so obtuse I can't figure it out. I give it 50/50 odds). Thinking
> this
> > may have been due to transition to LDAP.
> >
> > Since I planned on experimenting with the whole "what does the confluent
> > testing page look like in an epic" today, I'll ping Nate once it's not
> > godawfully early in NZ about this. Or he'll read this email, either way.
> :)
> >
> > On Thu, Jan 23, 2020 at 11:13 PM Jeremy Hanna <
> jeremy.hanna1...@gmail.com>
> > wrote:
> >
> > > I've previously created
> > > https://issues.apache.org/jira/browse/CASSANDRA-14902 for updating the
> > > compaction_throughput_in_mb default.  I created
> > > https://issues.apache.org/jira/browse/CASSANDRA-15521 for updating the
> > > num_tokens default,
> > https://issues.apache.org/jira/browse/CASSANDRA-15522
> > > for updating the [roles|permissions|credentials]_validity_in_ms
> defaults,
> > > and https://issues.apache.org/jira/browse/CASSANDRA-15523 for updating
> > the
> > > default snitch to GossipingPropertyFileSnitch.
> > > I'm unable to create an epic in the project - not sure if that has to
> do
> > > with project permissions.  Could someone create an epic and link these
> > > tickets as subtasks?
> > > Jon - would you mind creating the ticket around JVM defaults?  Are you
> > > thinking of the default GC and settings for a better out of the box
> > > experience?
> > > Thanks all,
> > > Jeremy
> > >
> > > On Fri, Jan 24, 2020 at 1:57 PM Jon Haddad <j...@jonhaddad.com> wrote:
> > >
> > > > Yes. please do. We should also update our JVM defaults.
> > > >
> > > > On Thu, Jan 23, 2020, 9:28 PM Jeremy Hanna <
> jeremy.hanna1...@gmail.com
> > >
> > > > wrote:
> > > >
> > > > > To summarize this thread, I think people are generally okay with
> > > updating
> > > > > certain defaults for 4.0 provided we make sure it doesn't
> > unpleasantly
> > > > > surprise cluster operators.  I think with the num_tokens and
> > > > > compaction_throughput_in_mb we could go with a release note for the
> > > > reasons
> > > > > in my last email.  I also agree that we should consider bump
> > > > > roles_validity_in_ms, permissions_validity_in_ms, and
> > > > >    credentials_validity_in_ms along with the default snitch (going
> to
> > > > GPFS
> > > > > as the default) as that gives people a DC aware default at least to
> > > > start.
> > > > >
> > > > > Is everyone okay if I create tickets for each of these and link
> them
> > > with
> > > > > an epic so that we can discuss them separately?
> > > > >
> > > > > Thanks,
> > > > >
> > > > > Jeremy
> > > > >
> > > > > On Thu, Jan 23, 2020 at 5:34 AM Alex Ott <alex...@gmail.com>
> wrote:
> > > > >
> > > > > > In addition to these, maybe we could consider to change other as
> > > well?
> > > > > > Like:
> > > > > >
> > > > > > 1. bump roles_validity_in_ms, permissions_validity_in_ms, and
> > > > > >    credentials_validity_in_ms as well - maybe at least to a
> minute,
> > > or
> > > > > 2. I
> > > > > >    have seen multiple times when authentication was failing under
> > the
> > > > > heavy
> > > > > >    load because queries to system tables were timing out - with
> > these
> > > > > >    defaults people may still have the possibility to get updates
> to
> > > > > >    roles/credentials faster when specifying _update_interval_
> > > variants
> > > > of
> > > > > >    these configurations.
> > > > > > 2. change default snitch from SimpleSnitch to
> > > > > GossipingPropertyFileSnitch -
> > > > > >    we're anyway saying that SimpleSnitch is only appropriate for
> > > > > >    single-datacenter deployments, and for real production we need
> > to
> > > > use
> > > > > >    GossipingPropertyFileSnitch - why not to set it as default?
> > > > > >
> > > > > >
> > > > > > Jeremy Hanna  at "Wed, 22 Jan 2020 11:22:36 +1100" wrote:
> > > > > >  JH> I mentioned this in the contributor meeting as a topic to
> > bring
> > > up
> > > > > on
> > > > > > the list - should we
> > > > > >  JH> take the opportunity to update defaults for Cassandra 4.0?
> > > > > >
> > > > > >  JH> The rationale is two-fold:
> > > > > >  JH> 1) There are best practices and tribal knowledge around
> > certain
> > > > > > properties where people
> > > > > >  JH> just know to update those properties immediately as a
> starting
> > > > > > point.  If it's pretty much
> > > > > >  JH> a given that we set something as a starting point different
> > than
> > > > the
> > > > > > current defaults, why
> > > > > >  JH> not make that the new default?
> > > > > >  JH> 2) We should align the defaults with what we test with.
> There
> > > may
> > > > > be
> > > > > > exceptions if we
> > > > > >  JH> have one-off tests but on the whole, we should be testing
> with
> > > > > > defaults.
> > > > > >
> > > > > >  JH> As a starting point, compaction throughput and number of
> > vnodes
> > > > seem
> > > > > > like good candidates
> > > > > >  JH> but it would be great to get feedback for any others.
> > > > > >
> > > > > >  JH> For compaction throughput (
> > > > > > https://jira.apache.org/jira/browse/CASSANDRA-14902), I've made
> > > > > >  JH> a basic case on the ticket to default to 64 just as a
> starting
> > > > point
> > > > > > because the decision
> > > > > >  JH> for 16 was made when spinning disk was most common.  Hence
> > most
> > > > > > people I know change that
> > > > > >  JH> and I think without too much bikeshedding, 64 is a
> reasonable
> > > > > > starting point.  A case
> > > > > >  JH> could be made that empirically the compaction throughput
> > > throttle
> > > > > may
> > > > > > have less effect
> > > > > >  JH> than many people think, but I still think an updated default
> > > would
> > > > > > make sense.
> > > > > >
> > > > > >  JH> For number of vnodes, Michael Shuler made the point in the
> > > > > discussion
> > > > > > that we already test
> > > > > >  JH> with 32, which is a far better number than the 256
> default.  I
> > > > know
> > > > > > many new users that
> > > > > >  JH> just leave the 256 default and then discover later that it's
> > > > better
> > > > > > to go lower.  I think
> > > > > >  JH> 32 is a good balance.  One could go lower with the new
> > algorithm
> > > > but
> > > > > > I think 32 is much
> > > > > >  JH> better than 256 without being too skewed, and it's what we
> > > > currently
> > > > > > test.
> > > > > >
> > > > > >  JH> Jeff brought up a good point that we want to be careful with
> > > > > defaults
> > > > > > since changing them
> > > > > >  JH> could come as an unpleasant surprise to people who don't
> > > > explicitly
> > > > > > set them.  As a
> > > > > >  JH> general rule, we should always update release notes to
> clearly
> > > > state
> > > > > > that a default has
> > > > > >  JH> changed.  For these two defaults in particular, I think it's
> > > safe.
> > > > > > For compaction
> > > > > >  JH> throughput I think a release not is sufficient in case they
> > want
> > > > to
> > > > > > modify it.  For number
> > > > > >  JH> of vnodes, it won't affect existing deployments with data -
> it
> > > > would
> > > > > > be for new clusters,
> > > > > >  JH> which would honestly benefit from this anyway.
> > > > > >
> > > > > >  JH> The other point is whether it's too late to go into 4.0.
> For
> > > > these
> > > > > > two changes, I think
> > > > > >  JH> significant testing can still be done with these new
> defaults
> > > > before
> > > > > > release and I think
> > > > > >  JH> testing more explicitly with 32 vnodes in particular will
> give
> > > > > people
> > > > > > more confidence in
> > > > > >  JH> the lower number with a wider array of testing (where we
> don't
> > > > > > already use 32 explicitly).
> > > > > >
> > > > > >  JH> In summary, are people okay with considering updating these
> > > > defaults
> > > > > > and possibly others
> > > > > >  JH> in the alpha stage of a new major release?  Are there other
> > > > > > properties to consider?
> > > > > >
> > > > > >  JH> Jeremy
> > > > > >  JH>
> > > > >
> ---------------------------------------------------------------------
> > > > > >  JH> To unsubscribe, e-mail:
> dev-unsubscr...@cassandra.apache.org
> > > > > >  JH> For additional commands, e-mail:
> > dev-h...@cassandra.apache.org
> > > > > >
> > > > > >
> > > > > >
> > > > > > --
> > > > > > With best wishes,                    Alex Ott
> > > > > > Principal Architect, DataStax
> > > > > > http://datastax.com/
> > > > > >
> > > > > >
> > ---------------------------------------------------------------------
> > > > > > To unsubscribe, e-mail: dev-unsubscr...@cassandra.apache.org
> > > > > > For additional commands, e-mail: dev-h...@cassandra.apache.org
> > > > > >
> > > > > >
> > > > >
> > > >
> > >
> >
>

Reply via email to