Installing a Cassandra cluster with multiple Linux OSs (Ubuntu+CentOS)

2018-10-14 Thread Eyal Bar
Hi all,

Did anyone installed a Cassandra cluster with mixed Linux OSs where some of
the nodes were ubuntu 12\14\16 and some of the nodes where CentOS7?

Will it work without issues?

*Rational*: We have a 40 servers cluster which was originally installed
only with Ubuntu servers. Now we want to move to CentOS 7 but the effort of
reinstalling the entire cluster + migration to CentOS 7 is not simple. So
we thought about adding new CentOS 7 nodes to the existing cluster and
gradually remove the Ubuntu ones.

Would love to read your thoughts.

Best,

-- 
Eyal Bar
Big Data Ops Team Lead | Data Platform and Monitoring  | Kenshoo
*Office* +972 (3) 746-6500 *473
*Mobile* +972 (52) 458-6100
www.Kenshoo.com 
[image: Transform your marketing. Grow your business.]


-- 
This e-mail, as well as any attached document, may contain material which 
is confidential and privileged and may include trademark, copyright and 
other intellectual property rights that are proprietary to Kenshoo Ltd,  
its subsidiaries or affiliates ("Kenshoo"). This e-mail and its attachments 
may be read, copied and used only by the addressee for the purpose(s) for 
which it was disclosed herein. If you have received it in error, please 
destroy the message and any attachment, and contact us immediately. If you 
are not the intended recipient, be aware that any review, reliance, 
disclosure, copying, distribution or use of the contents of this message 
without Kenshoo's express permission is strictly prohibited.


Re: Installing a Cassandra cluster with multiple Linux OSs (Ubuntu+CentOS)

2018-10-14 Thread Jeff Jirsa
Should be fine, just get the java and kernel versions and kernel tuning params 
as close as possible



-- 
Jeff Jirsa


> On Oct 14, 2018, at 5:09 PM, Eyal Bar  wrote:
> 
> Hi all,
> 
> Did anyone installed a Cassandra cluster with mixed Linux OSs where some of 
> the nodes were ubuntu 12\14\16 and some of the nodes where CentOS7?
> 
> Will it work without issues?
> 
> Rational: We have a 40 servers cluster which was originally installed only 
> with Ubuntu servers. Now we want to move to CentOS 7 but the effort of 
> reinstalling the entire cluster + migration to CentOS 7 is not simple. So we 
> thought about adding new CentOS 7 nodes to the existing cluster and gradually 
> remove the Ubuntu ones. 
> 
> Would love to read your thoughts.
> 
> Best,
> 
> -- 
> Eyal Bar
> Big Data Ops Team Lead | Data Platform and Monitoring  | Kenshoo
> Office +972 (3) 746-6500 *473
> Mobile +972 (52) 458-6100
> www.Kenshoo.com
> 
> 
> This e-mail, as well as any attached document, may contain material which is 
> confidential and privileged and may include trademark, copyright and other 
> intellectual property rights that are proprietary to Kenshoo Ltd,  its 
> subsidiaries or affiliates ("Kenshoo"). This e-mail and its attachments may 
> be read, copied and used only by the addressee for the purpose(s) for which 
> it was disclosed herein. If you have received it in error, please destroy the 
> message and any attachment, and contact us immediately. If you are not the 
> intended recipient, be aware that any review, reliance, disclosure, copying, 
> distribution or use of the contents of this message without Kenshoo's express 
> permission is strictly prohibited.


Re: Cassandra: Inconsistent data on reads (LOCAL_QUORUM)

2018-10-14 Thread Jeff Jirsa
If this is 2.1 AND you do deletes AND you have a non-zero number of failed 
writes (timeouts), it’s possibly short reads

3.0 fixes this ( https://issues.apache.org/jira/browse/CASSANDRA-12872 ), it 
won’t be backported to 2.1 because it’s a significant change to how reads are 
executed


-- 
Jeff Jirsa


> On Oct 13, 2018, at 7:24 PM, Naik, Ninad  wrote:
> 
> Thanks Maitrayee. I should have mentioned this as one of the things we 
> verified. The clocks on cassandra nodes are in sync. 
> From: maitrayee shah 
> Sent: Friday, October 12, 2018 6:40:25 PM
> To: user@cassandra.apache.org
> Subject: Re: Cassandra: Inconsistent data on reads (LOCAL_QUORUM)
>  
> [ This email has been sent from a source external to Epsilon. Please use 
> caution when clicking links or opening attachments. ]
> 
> We have seen inconsistent read if the clock on the nodes are not in sync. 
> 
> 
> Thank you 
> 
> Sent from my iPhone
> 
> On Oct 12, 2018, at 1:50 PM, Naik, Ninad  wrote:
> 
>> Hello,
>> 
>> We're seeing inconsistent data while doing reads on cassandra. Here are the 
>> details:
>> 
>> It's is a wide column table. The columns can be added my multiple machines, 
>> and read by multiple machines. The time between writes and reads are in 
>> minutes, but sometimes can be in seconds. Writes happen every 2 minutes.
>> 
>> Now, while reading we're seeing the following cases of inconsistent reads:
>> 
>> One column was added. If a read was done after the column was added (20 secs 
>> to 2 minutes after the write), Cassandra returns no data. As if the key 
>> doesn't exist. If the application retries, it gets the data.
>> A few columns exist for a row key. And a new column 'n' was added. Again, a 
>> read happens a few minutes after the write. This time, only the latest 
>> column 'n' is returned. In this case the app doesn't know that the data is 
>> incomplete so it doesn't retry. If we manually retry, we see all the columns.
>> A few columns exist for a row key. And a new column 'n' is added. When a 
>> read happens after the write, all columns but 'n' are returned.
>> Here's what we've verified:
>> 
>> Both writes and reads are using 'LOCAL_QUORUM' consistency level.
>> The replication is within local data center. No remote data center is 
>> involved in the read or write.
>> During the inconsistent reads, none of the nodes are undergoing GC pauses
>> There are no errors in cassandra logs
>> Reads always happen after the writes.
>> A few other details: Cassandra version: 2.1.9 DataStax java driver version: 
>> 2.1.10.2 Replication Factor: 3
>> 
>> We don't see this problem in lower environments. We have seen this happen 
>> once or twice last year, but since last few days it's happening quite 
>> frequently. On an average 2 inconsistent reads every minute.
>> 
>> Here's how the table definition looks like:
>> 
>> CREATE TABLE "MY_TABLE" (
>>   key text,
>>   sub_key text,
>>   value text,
>>   PRIMARY KEY ((key), sub_key)
>> ) WITH
>>   bloom_filter_fp_chance=0.01 AND
>>   caching='{"keys":"ALL", "rows_per_partition":"NONE"}' AND
>>   comment='' AND
>>   dclocal_read_repair_chance=0.10 AND
>>   gc_grace_seconds=864000 AND
>>   read_repair_chance=0.00 AND
>>   default_time_to_live=0 AND
>>   speculative_retry='ALWAYS' AND
>>   memtable_flush_period_in_ms=0 AND
>>   compaction={'class': 'SizeTieredCompactionStrategy'} AND
>>   compression={'sstable_compression': 'LZ4Compressor'};
>> Please point us in the right direction. Thanks !
>> 
>>  
>> 
>> The information contained in this e-mail message and any attachments may be 
>> privileged and confidential. If the reader of this message is not the 
>> intended recipient or an agent responsible for delivering it to the intended 
>> recipient, you are hereby notified that any review, dissemination, 
>> distribution or copying of this communication is strictly prohibited. If you 
>> have received this communication in error, please notify the sender 
>> immediately by replying to this e-mail and delete the message and any 
>> attachments from your computer.
>> 
> 
>  
> 
> The information contained in this e-mail message and any attachments may be 
> privileged and confidential. If the reader of this message is not the 
> intended recipient or an agent responsible for delivering it to the intended 
> recipient, you are hereby notified that any review, dissemination, 
> distribution or copying of this communication is strictly prohibited. If you 
> have received this communication in error, please notify the sender 
> immediately by replying to this e-mail and delete the message and any 
> attachments from your computer.