Re: scylladb

2017-03-09 Thread Avi Kivity
ScyllaDB engineer here. C++ is really an enabling technology here. It is directly responsible for a small fraction of the gain by executing faster than Java. But it is indirectly responsible for the gain by allowing us direct control over memory and threading. Just as an example, Scylla star

Re: scylladb

2017-03-09 Thread Bhuvan Rawal
I'd say the benchmark would be complete only when at the point of inflexion the necessary system benchmarks are provided. Looking at scylladb report it is unclear as to what system parameter was being the bottleneck. Also an observation - its mentioned in the report that they are using 1KB ro and

Re: HELP with bulk loading

2017-03-09 Thread Stefania Alborghetti
When I tested cqlsh COPY FROM for CASSANDRA-11053 , I was able to import about 20 GB in under 4 minutes on a cluster with 8 nodes u

Re: HELP with bulk loading

2017-03-09 Thread Ryan Svihla
I suggest using cassandra loader https://github.com/brianmhess/cassandra-loader On Mar 9, 2017 5:30 PM, "Artur R" wrote: > Hello all! > > There are ~500gb of CSV files and I am trying to find the way how to > upload them to C* table (new empty C* cluster of 3 nodes, replication > factor 2) with

Re: scylladb

2017-03-09 Thread Kant Kodali
I dont think ScyllaDB performance is because of C++. The design decisions in scylladb are indeed different from Cassandra such as getting rid of SEDA and moving to TPC and so on. If someone thinks it is because of C++ then just show the benchmarks that proves it is indeed the C++ which gave 10X pe

HELP with bulk loading

2017-03-09 Thread Artur R
Hello all! There are ~500gb of CSV files and I am trying to find the way how to upload them to C* table (new empty C* cluster of 3 nodes, replication factor 2) within reasonable time (say, 10 hours using 3-4 instance of c3.8xlarge EC2 nodes). My first impulse was to use CQLSSTableWriter, but it i

Re: scylladb

2017-03-09 Thread Richard L. Burton III
They spend an enormous amount of time focusing on performance. You can expect them to continue on with their optimization and keep crushing it. P.S., I don't work for ScyllaDB. On Thu, Mar 9, 2017 at 6:02 PM, Rakesh Kumar wrote: > In all of their presentation they keep harping on the fact that

Re: scylladb

2017-03-09 Thread Rakesh Kumar
In all of their presentation they keep harping on the fact that scylladb is written in C++ and does not carry the overhead of Java. Still the difference looks staggering. From: daemeon reiydelle Sent: Thursday, March 9, 2017 14:21 To: user@cassandra.apac

Re: autoscaling

2017-03-09 Thread Jonathan Haddad
I wouldn't say it's not possible. It's just not very straightforward. The needs of each cluster will be different. For instance, some clusters will need to scale based on disk usage and ingestion rate, while others will need to scale based on latency and throughput. A p90 SLA looks a lot differ

Re: autoscaling

2017-03-09 Thread Matija Gobec
Hi, Autoscaling is not possible with a Cassandra cluster. Any topology change triggers series of streaming and data shuffle around the cluster. Scaling the cluster up or down is an operational challenge which is usually planned in production because of the performance impact it can make. Matija

Re: A Single Dropped Node Fails Entire Read Queries

2017-03-09 Thread Ryan Svihla
whats your keyspace replication settings and what's your query? On Thu, Mar 9, 2017 at 9:32 AM, Shalom Sagges wrote: > Hi Cassandra Users, > > I hope someone could help me understand the following scenario: > > Version: 3.0.9 > 3 nodes per DC > 3 DCs in the cluster. > Consistency Local_Quorum. >

A Single Dropped Node Fails Entire Read Queries

2017-03-09 Thread Shalom Sagges
Hi Cassandra Users, I hope someone could help me understand the following scenario: Version: 3.0.9 3 nodes per DC 3 DCs in the cluster. Consistency Local_Quorum. I did a small resiliency test and dropped a node to check the availability of the data. What I assumed would happen is nothing at all.

autoscaling

2017-03-09 Thread D. Salvatore
Hi, I am using Cassandra 3 with a single DC. I would like to know if there is any tool available for scaling up and down Cassandra automatically. Thanks