lots of DigestMismatchException in cassandra3

2016-11-17 Thread Adeline.Pan
Hi, We are using Cassandra 3.7 and we have a single datacenter of 4 nodes, RF =3, and the consistency level is quorum. I found lots of DigestMismatchException in debug.log, as below: DEBUG [ReadRepairStage:11] 2016-11-17 08:24:57,066 ReadCallback.java:235 - Digest mismatch: org.apache.cassandra.

Re: lots of DigestMismatchException in cassandra3

2016-11-17 Thread Tommy Stendahl
Hi, Could it be CASSANDRA-12090 ? Regards, /Tommy On 2016-11-17 09:47, adeline@thomsonreuters.com wrote: Hi, We are using Cassandra 3.7 and we have a single datacenter of 4 nodes, RF =3, and the consistency level is quorum. I fo

RE: Some questions to updating and tombstone

2016-11-17 Thread Lu, Boying
Very appreciate to all of you, I’ll study the blog. From: Alain RODRIGUEZ [mailto:arodr...@gmail.com] Sent: 2016年11月16日 23:26 To: user@cassandra.apache.org Cc: Fabrice Facorat Subject: Re: Some questions to updating and tombstone Hi Boying, Old value is not tombstone, but remains until compactio

Re: [RELEASE] Apache Cassandra 3.0.10 released

2016-11-17 Thread Oleksandr Shulgin
On Wed, Nov 16, 2016 at 9:17 PM, Michael Shuler wrote: > > The Cassandra team is pleased to announce the release of Apache > Cassandra version 3.0.10. > > Apache Cassandra is a fully distributed database. It is the right choice > when you need scalability and high availability without compromising

Re: [RELEASE] Apache Cassandra 3.0.10 released

2016-11-17 Thread Vladimir Yudovin
Hi, >Does this mean that offheap_objects is still available or that there is no longer support for offheap memtables in version 3.0? If you set offheap_buffers in cassandra.yaml in 3.0.10, you'll get exception offheap_buffers are not available in 3.0. They will be re-introduced in a future

Re: Can nodes in c* cluster run different versions ?

2016-11-17 Thread Fabrice Facorat
As said already by Alain you should make this as short as possible: - streaming operations won't work (repair, bootstrap) - Hinted Handoff won't work as 2 differents major version of cassandra can't shared the same schema version - So no DDL operations (CREATE/ALTER) as you change won't be propagat

Re: [RELEASE] Apache Cassandra 3.0.10 released

2016-11-17 Thread Oleksandr Shulgin
On Thu, Nov 17, 2016 at 1:05 PM, Vladimir Yudovin wrote: > Hi, > >Does this mean that offheap_objects is still available or that there is > no longer support for offheap memtables in version 3.0? > > If you set *offheap_buffers* in cassandra.yaml in 3.0.10, you'll get > exception > > offheap_buf

Re: Schema Changes

2016-11-17 Thread Fabrice Facorat
Schema are propagated by GOSSIP you can check schema propagation cluster wide with nodetool describecluster or "nodetool gossipinfo | grep SCHEMA | cut -f3 -d: | sort | uniq -c" You'd better send your DDL instruction to only one node (for example by using the whitelist load balancing policy with

Re: [RELEASE] Apache Cassandra 3.0.10 released

2016-11-17 Thread Vladimir Yudovin
>My question was about a different option named "offheap_objects". Sorry. Best regards, Vladimir Yudovin, Winguzone - Hosted Cloud Cassandra Launch your cluster in minutes. On Thu, 17 Nov 2016 07:56:10 -0500Oleksandr Shulgin wrote On Thu, N

Any Bulk Load on Large Data Set Advice?

2016-11-17 Thread Joe Olson
I received a grant to do some analysis on netflow data (Local IP address, Local Port, Remote IP address, Remote Port, time, # of packets, etc) using Cassandra and Spark. The de-normalized data set is about 13TB out the door. I plan on using 9 Cassandra nodes (replication factor=3) to store the d

WriteTimeoutExceptions from Storm Topology

2016-11-17 Thread Shalom Sagges
Hi Everyone, We're seeing a lot of WriteTimeoutExceptions coming from our Storm Topology that's writing to Cassandra. The write fails due to failed consistency requirements, however, the Cassandra cluster is working properly and writes shouldn't fail on consistency. We use a RF of 3, but sometimes

Data migration from Oracle to Cassandra

2016-11-17 Thread Shashidhar Rao
Hi, Has anyone done data migration from Oracle to Cassandra taking care of Change data capture. Kindly share the experience about the tools used. Golden Gate, IBM CDC or any tools. Recommendation of any Open Source tools would be highly useful. I need to constantly capture the commits from Oracl

Re: Data migration from Oracle to Cassandra

2016-11-17 Thread Chidambaran Subramanian
More curious than answering the question. Would it be possible to even design something generic here? Would it not depend on the schema? On Thu, Nov 17, 2016 at 8:21 PM, Shashidhar Rao wrote: > Hi, > > Has anyone done data migration from Oracle to Cassandra taking care of > Change data capture.

Re: Any Bulk Load on Large Data Set Advice?

2016-11-17 Thread Jonathan Haddad
If you're only doing this for spark, you'll be much better off using parquet and HDFS or S3. While you *can* do analytics with cassandra, it's not all that great at it. On Thu, Nov 17, 2016 at 6:05 AM Joe Olson wrote: > I received a grant to do some analysis on netflow data (Local IP address, > L

Re: Re : Generic keystore when enabling SSL

2016-11-17 Thread sai krishnam raju potturi
hi Jacob; I would suggest you create your own Certificate Authority, and create a generic keystore and trustore. Cassandra by default does not implement HostName Verification in it's code. All it does is to check if it's peer certificate is signed by the trusted authority ( the root CA i

Bulkloading using CqlBulkOutputFormat Format

2016-11-17 Thread Muhammad Afzal
Hi, I could not find any documentation or help on how to use CqlBulkOutputFormat for bulk loading data into Cassandra. Could anyone please share some guidelines on how to write MR job to bulkload data into Cassandra using CqlBulkOutputFormat. I tried something like shown below, which failed with a

Re: Any Bulk Load on Large Data Set Advice?

2016-11-17 Thread Ben Bromhead
+1 on parquet and S3. Combined with spark running on spot instances your grant money will go much further! On Thu, 17 Nov 2016 at 07:21 Jonathan Haddad wrote: > If you're only doing this for spark, you'll be much better off using > parquet and HDFS or S3. While you *can* do analytics with cassa

Re: Bulkloading using CqlBulkOutputFormat Format

2016-11-17 Thread Muhammad Afzal
Hi, I have tried enabling client mode by Config.setClientMode(true); but it had no effect. Getting the same exception with reducer as well Regards Afzal On Thu, Nov 17, 2016 at 9:54 PM, Muhammad Afzal wrote: > Hi, > I could not find any documentation or help on how to use > CqlBulkO

Re: Any Bulk Load on Large Data Set Advice?

2016-11-17 Thread Jeff Jirsa
Other people are commenting on the appropriateness of Cassandra – they may have a point you should consider, but I’m going to answer the question. 1) Yes, you can generate the sstables in parallel 2) If you use sstable bulk loader interface (sstableloader), it’ll stream to all a

User creation issue

2016-11-17 Thread Jai Bheemsen Rao Dhanwada
Hello, I have a 5*5 node cluster which has Authentication enabled. I created a user and did a list user, but I don't see the user created command: CREATE USER test WITH PASSWORD 'xyz' SUPERUSER; replication : NetworkTopology dc1:5 and dc2:5 for the system_auth keyspace. C* version : 3.0.8 Enabl

Re: User creation issue

2016-11-17 Thread Aoi Kadoya
Did you try to run nodetool repair for system_auth keyspace? 2016-11-17 13:45 GMT-08:00 Jai Bheemsen Rao Dhanwada : > Hello, > > I have a 5*5 node cluster which has Authentication enabled. > > I created a user and did a list user, but I don't see the user created > > command: CREATE USER test WITH

Re: User creation issue

2016-11-17 Thread Jai Bheemsen Rao Dhanwada
Thanks Aoi, The actual issue is because of the Cassandra upgrade. http://www.datastax.com/dev/blog/role-based-access-control-in-cassandra -> upgrade section helped me fix the issue On Thu, Nov 17, 2016 at 2:17 PM, Aoi Kadoya wrote: > Did you try to run nodetool repair for system_auth keyspace?