Re: openjdk for cassandra production cluster

2018-10-10 Thread Christophe Schmitz
It is fixed in 3.11.2 https://issues.apache.org/jira/browse/CASSANDRA-13916 On Thu, 11 Oct 2018 at 02:10 prachirath72 wrote: > Thanks Jonathan, > Is there a ticket/bugid to remove this openjdk WARN. > Want to have a look . > > > Original message > From: Jonathan Haddad > Dat

Re: SSTableMetadata Util

2018-10-01 Thread Christophe Schmitz
://github.com/instaclustr/cassandra-sstable-tools which will provided you with the desired info, plus more info you might find useful. Christophe Schmitz - Instaclustr <https://www.instaclustr.com/> - Cassandra | Kafka | Spark Consulting On Tue, 2 Oct 2018 at 11:31 Pranay akula wrote: >

Re: Cassandra loading data from another table

2018-10-01 Thread Christophe Schmitz
Have a look at using Spark on Cassandra. It's commonly used for data movement / data migration / reconciliation (on top of analytics). You will get much better performances. Christophe Schmitz - Instaclustr <https://www.instaclustr.com/> - Cassandra | Kafka | Spark Consulting On

Re: [EXTERNAL] Re: Rolling back Cassandra upgrades (tarball)

2018-10-01 Thread Christophe Schmitz
ication on the secondary DC. Once you are happy, you can decommission the old version DC (check carefully your application endpoint configuration, local_dc configuration) Hope this helps. Christophe Schmitz - Instaclustr <https://www.instaclustr.com/> - Cassandra | Kafka | Spark Cons

Re: Cassandra loading data from another table

2018-10-01 Thread Christophe Schmitz
Table B during this process. Please test first :) Cheers, Christophe Christophe Schmitz - Instaclustr <https://www.instaclustr.com/> - Cassandra | Kafka | Spark Consulting On Tue, 2 Oct 2018 at 09:18 Richard Xin wrote: > I have a tableA with about a few ten millions record, and I ha

Re: Cassandra Storage per node

2018-09-27 Thread Christophe Schmitz
STCS. Hope this help! Cheers, Christophe Christophe Schmitz - Instaclustr <https://www.instaclustr.com/> - Cassandra | Kafka | Spark Consulting On Fri, 28 Sep 2018 at 14:43 Suresh Rajagopal wrote: > Hi, > > I am new to Cassandra. Is there any recommended maximum data size

Re: GUI clients for Cassandra

2018-05-04 Thread Christophe Schmitz
MV yes, SASI not sure, I would guess yes. On 2 May 2018 at 18:00, Hannu Kröger wrote: > Ah, you are correct! > > However, it’s not being updated anymore AFAIK. Do you know if it support > the latest 3.x features? SASI, MV, etc. ? > > Hannu > > > On 24 Apr 2018, a

Re: Version Upgrade

2018-04-25 Thread Christophe Schmitz
les?? > > I knew when we are doing a major upgrade we have to run upgrade sstables > so that sstables will be re-written to newer version with additional meta > data. > > But do we need to run upgrade sstables for upgrading from let's say 3.0.15 > to 3.0.16 or 3.0.y to 3.11.

Re: GUI clients for Cassandra

2018-04-23 Thread Christophe Schmitz
Hi Hannu ;) > > I have been asked many times that what is a good GUI client for Cassandra. >> DevCenter is not available anymore and DataStax has a DevStudio but that’s >> for DSE only. >> > DevCenter is still available, I just downloaded it. Cheers, Christophe

Re: A Cassandra Storage Estimation Mechanism

2018-04-18 Thread Christophe Schmitz
, i.e. using cassandra-stress. The goal is usually to confirm the throughput / latency. As a side effect, this also gives the disk usage. Hope it helps! Cheers, Christophe > > Thanks in advance > > Sent using Zoho Mail <https://www.zoho.com/mail/> > > > -- *Christophe Schmitz - **VP Consulting* AU: +61 4 03751980 / FR: +33 7 82022899

Re: Why nodetool cleanup should be run sequentially after node joined a cluster

2018-04-10 Thread Christophe Schmitz
ning nodetool cleanup on the next node. > > I had added a new node to the cluster, and running nodetool cleanup > according to this recommendation - but it takes near 10 days to complete on > a single node. Is it safe to start it on all nodes? > -- *Christophe Schmitz - **VP

Re: copy from one table to another

2018-04-08 Thread Christophe Schmitz
ent from my iPhone > > On Apr 8, 2018, at 4:15 AM, onmstester onmstester > wrote: > > Is there any way to copy some part of a table to another table in > cassandra? A large amount of data should be copied so i don't want to fetch > data to client and stream it bac

Re: Apache Cassandra start up script

2018-03-27 Thread Christophe Schmitz
718-844-3841 > PA,USA > > Sent from my iPhone > - > To unsubscribe, e-mail: user-unsubscr...@cassandra.apache.org > For additional commands, e-mail: user-h...@cassandra.apache.org > > -- *Christophe Schmitz - **VP Consulting* AU: +61 4 03751980 / FR: +33 7 82

Re: Execute an external program

2018-03-27 Thread Christophe Schmitz
the edge here, but is there a way to execute an external > program if a new row is added or if an existing row is updated on a table? > > Cheers, > Earl > > -- > There are seven words in this sentence. > -- *Christophe Schmitz - **VP Consulting* AU: +61 4 0375

Re: high latency on one node after replacement

2018-03-27 Thread Christophe Schmitz
first thought was that this is related > to the EBS volumes, but it seems too consistent to be actually caused by > that. The problem is consistent across multiple replacements, and multiple > EC2 regions. > > I appreciate any suggestions! > > - Mike > -- *Christophe Schmi

Re: Measuring eventual consistency latency

2018-03-25 Thread Christophe Schmitz
one > could point me the direction, it will be of great help. > > Thanks ! > -- *Christophe Schmitz - **VP Consulting* AU: +61 4 03751980 / FR: +33 7 82022899 <https://www.facebook.com/instaclustr> <https://twitter.com/instaclustr> <https://www.linkedin.com/compa

Re: Deserialize Map[Int, UDT] to a case class from Spark Connector

2018-03-25 Thread Christophe Schmitz
lass. > targetTypeName(TypeConverter.scala:36) > at com.datastax.spark.connector.types.TypeConverter$ > CollectionConverter.targetTypeName(TypeConverter.scala:682) > at com.datastax.spark.connector.rdd.reader. > GettableDataToMappedTypeConverter.tryConvert( > GettableDataToMappedTypeConverter.scala:156) &g

Re: Cassandra CF Level Metrics (Read, Write Count and Latency)

2017-08-31 Thread Christophe Schmitz
ument and tried setting up the metric as following, > does this is not matching with the total read requests. I am using > "ReadLatency_OneMinuteRate" > > /org.apache.cassandra.metrics:type=ColumnFamily,keyspace=*, > scope=*,name=ReadLatency > > On Thu, Aug 31, 2017 at

Re: Cassandra CF Level Metrics (Read, Write Count and Latency)

2017-08-31 Thread Christophe Schmitz
Hello Jai, Did you have a look at the following page: http://cassandra.apache.org/doc/latest/operating/metrics.html In your case, you would want the following MBeans: org.apache.cassandra.metrics:type=Table keyspace= scope= name= With MetricName set to ReadLatency and WriteLatency Cheers, Chris

Re: Getting all unique keys

2017-08-21 Thread Christophe Schmitz
i > > On Mon, Aug 21, 2017 at 3:43 AM, Christophe Schmitz < > christo...@instaclustr.com> wrote: > >> Hi Avi, >> >> Have you thought of using Spark for that work? If you collocate the spark >> workers on each Cassandra nodes, the spark-cassandra connector wi

Re: Getting all unique keys

2017-08-20 Thread Christophe Schmitz
efficient but how about the approach presented here >>> <http://www.scylladb.com/2017/02/13/efficient-full-table-scans-with-scylla-1-6/> >>> sending queries in parallel and using the token >>> >>> *SELECT DISTINCT id FROM my_table WHERE token(id) >= -9204925292781066255 >>> AND token(id) <= -9223372036854775808; * >>> >>> *or I can just maintain another table with the unique keys * >>> >>> *CREATE TABLE id_only ( id text, >>> >>> PRIMARY KEY (id) )* >>> >>> but I tend not to since it is error prone and will enforce other procedures >>> to maintain data integrity between those two tables . >>> >>> any ideas ? >>> >>> Thanks >>> >>> Avi >>> >>> > -- *Christophe Schmitz* *Director of consulting EMEA*

Re: Cassandra Writes Duplicated/Concatenated List Data

2017-08-16 Thread Christophe Schmitz
Hi Nathan, The code may occasionally write to the same row multiple times. > > Can you run a test using IF NOT EXISTS in your inserts to see if that makes a difference? That shouldn't make a difference, but I don't see what the problem might be at the moment. -- *Ch

Re: Large tombstones creation

2017-08-13 Thread Christophe Schmitz
hold)* > > How it can happen? I have several SASI indexes for this table, can this be > a reason? > > Regards, Vlad > -- *Christophe Schmitz* *Director of consulting EMEA*AU: +61 4 03751980 / FR: +33 7 82022899 <https://www.instaclustr.com> <https://www.facebook.com

DateTieredCompactionStrategy DTCS sometimes stop dropping SSTables

2015-07-20 Thread Christophe Schmitz
Hi there, I am running a 6 node cluster on 2.1.7 with a table using DTCS to store time series data, for up to 12 hours (using ttl of 12h). Data are written as they arrive, without any update or active delete. During the first 12h, the cluster gets filled with data. And a bit later, the amount of d