Ordering between partitions.

2014-10-10 Thread Aleksandar Stojadinovic

Hello,

I've been searching for this for some time and I don't know if I'm just 
not looking for the right thing or it is just something not to be done.


We are analyzing Cassandra as a time series storage for some sensory 
data uploaded from a mobile phone. Of course, we would use the wide row 
technique partitioned by some parameter. We are not sure should it be a 
(value type, rounded time stamp) pair or just a rounded time stamp or a 
programmatic concatenation of those two so we can take an advice about that.


But the main issue is how can we get values stored in different 
partitions in an ordered manner. Using a Java client it looks like it is 
not really possible. Should we order the values in code or select all 
the primary keys with an IN keyword (but it seems we lose the pagination 
option then)? Is there a common pattern for this situation? All in all, 
this one bugs me a lot.


Best regards,
Aleksandar


Internode latency requirements

2014-10-10 Thread Siddharth Karandikar
Hello Everyone!

I am going over multiple cassandra documents to understand network
requirements of deploying cassandra cluster. Doacuments mention that
it is recommended to have 1gbps (or more) bandwidth between nodes and
its better to have multiple interfaces. I did not see any specific
latency numbers mentioned there.

Lets say I want to deploy ~200 cassandra nodes and each of them
running at different geographic location. Distance between any 2 nodes
will range from 5ms to 40ms in terms of RTT. I plan to configure nodes
close to each other in one datacenter. And such multiple datacenters
will be connected to each other.

I know this is a very specific requirement. So wanted to know if
cassandra is a right fit there.


Any inputs/suggestions/questions are welcome!


Thanks,
Siddharth


Re: Disabling compaction

2014-10-10 Thread Parag Shah
Cassandra Version: 2.0.7

In my application, I am using Cassandra Java Driver 2.0.2

Thanks
Parag

From: Marcus Eriksson mailto:krum...@gmail.com>>
Reply-To: "user@cassandra.apache.org" 
mailto:user@cassandra.apache.org>>
Date: Thursday, October 9, 2014 at 11:56 PM
To: "user@cassandra.apache.org" 
mailto:user@cassandra.apache.org>>
Subject: Re: Disabling compaction

what version are you on?

On Thu, Oct 9, 2014 at 10:33 PM, Parag Shah 
mailto:ps...@proofpoint.com>> wrote:
Hi all,

 I am trying to disable compaction for a few select tables. Here is a 
definition of one such table:

CREATE TABLE blob_2014_12_31 (
  blob_id uuid,
  blob_index int,
  blob_chunk blob,
  PRIMARY KEY (blob_id, blob_index)
) WITH
  bloom_filter_fp_chance=0.01 AND
  caching='KEYS_ONLY' AND
  comment='' AND
  dclocal_read_repair_chance=0.00 AND
  gc_grace_seconds=864000 AND
  index_interval=128 AND
  read_repair_chance=0.10 AND
  replicate_on_write='true' AND
  populate_io_cache_on_flush='false' AND
  default_time_to_live=0 AND
  speculative_retry='99.0PERCENTILE' AND
  memtable_flush_period_in_ms=0 AND
  compaction={'enabled': 'false', 'class': 'SizeTieredCompactionStrategy'} AND
  compression={'sstable_compression': 'LZ4Compressor’};

I have set compaction ‘enabled’ : ‘false’ on the above table.

However, I do see compactions being run for this node:

-bash-3.2$ nodetool compactionstats
pending tasks: 55
  compaction typekeyspace   table   completed   
total  unit  progress
   Compaction ids_high_awslab blob_2014_11_15 
18122816990 35814893020 bytes50.60%
   Compaction ids_high_awslab blob_2014_12_31 
18576750966 34242866468 bytes54.25%
   Compaction ids_high_awslab blob_2014_12_15 19213914904 
35956698600 bytes53.44%
Active compaction remaining time :   0h49m46s

Can you someone tell me why this is happening? Do I need to set the compaction 
threshold  to 0 0?

Regards
Parag



Re: Internode latency requirements

2014-10-10 Thread Robert Coli
On Fri, Oct 10, 2014 at 1:56 AM, Siddharth Karandikar <
siddharth.karandi...@gmail.com> wrote:

> Lets say I want to deploy ~200 cassandra nodes and each of them
> running at different geographic location. Distance between any 2 nodes
> will range from 5ms to 40ms in terms of RTT. I plan to configure nodes
> close to each other in one datacenter. And such multiple datacenters
> will be connected to each other.
>

That's fine?

People run multi-DC over much slower WAN links than that.

=Rob
http://twitter.com/rcolidba


Re: Disabling compaction

2014-10-10 Thread Marcus Eriksson
this is fixed in 2.0.8; https://issues.apache.org/jira/browse/CASSANDRA-7187

/Marcus

On Fri, Oct 10, 2014 at 7:11 PM, Parag Shah  wrote:

>  Cassandra Version: 2.0.7
>
>  In my application, I am using Cassandra Java Driver 2.0.2
>
>  Thanks
> Parag
>
>   From: Marcus Eriksson 
> Reply-To: "user@cassandra.apache.org" 
> Date: Thursday, October 9, 2014 at 11:56 PM
> To: "user@cassandra.apache.org" 
> Subject: Re: Disabling compaction
>
>   what version are you on?
>
> On Thu, Oct 9, 2014 at 10:33 PM, Parag Shah  wrote:
>
>>  Hi all,
>>
>>   I am trying to disable compaction for a few select tables. Here is
>> a definition of one such table:
>>
>>  CREATE TABLE blob_2014_12_31 (
>>   blob_id uuid,
>>   blob_index int,
>>   blob_chunk blob,
>>   PRIMARY KEY (blob_id, blob_index)
>> ) WITH
>>   bloom_filter_fp_chance=0.01 AND
>>   caching='KEYS_ONLY' AND
>>   comment='' AND
>>   dclocal_read_repair_chance=0.00 AND
>>   gc_grace_seconds=864000 AND
>>   index_interval=128 AND
>>   read_repair_chance=0.10 AND
>>   replicate_on_write='true' AND
>>   populate_io_cache_on_flush='false' AND
>>   default_time_to_live=0 AND
>>   speculative_retry='99.0PERCENTILE' AND
>>   memtable_flush_period_in_ms=0 AND
>>   compaction={'enabled': 'false', 'class':
>> 'SizeTieredCompactionStrategy'} AND
>>   compression={'sstable_compression': 'LZ4Compressor’};
>>
>>  I have set compaction ‘enabled’ : ‘false’ on the above table.
>>
>>  However, I do see compactions being run for this node:
>>
>>  -bash-3.2$ nodetool compactionstats
>> pending tasks: 55
>>   compaction typekeyspace   table   completed
>>   total  unit  progress
>>Compaction ids_high_awslab blob_2014_11_15 18122816990
>> 35814893020 bytes50.60%
>>Compaction ids_high_awslab blob_2014_12_31 18576750966
>> 34242866468 bytes54.25%
>>Compaction ids_high_awslab blob_2014_12_15 19213914904
>> 35956698600 bytes53.44%
>> Active compaction remaining time :   0h49m46s
>>
>>  Can you someone tell me why this is happening? Do I need to set the
>> compaction threshold  to 0 0?
>>
>>  Regards
>>  Parag
>>
>
>


Re: Disabling compaction

2014-10-10 Thread Parag Shah
Thanks, Marcus.

Regards
Parag

From: Marcus Eriksson mailto:krum...@gmail.com>>
Reply-To: "user@cassandra.apache.org" 
mailto:user@cassandra.apache.org>>
Date: Friday, October 10, 2014 at 10:59 AM
To: "user@cassandra.apache.org" 
mailto:user@cassandra.apache.org>>
Subject: Re: Disabling compaction

this is fixed in 2.0.8; 
https://issues.apache.org/jira/browse/CASSANDRA-7187

/Marcus

On Fri, Oct 10, 2014 at 7:11 PM, Parag Shah 
mailto:ps...@proofpoint.com>> wrote:
Cassandra Version: 2.0.7

In my application, I am using Cassandra Java Driver 2.0.2

Thanks
Parag

From: Marcus Eriksson mailto:krum...@gmail.com>>
Reply-To: "user@cassandra.apache.org" 
mailto:user@cassandra.apache.org>>
Date: Thursday, October 9, 2014 at 11:56 PM
To: "user@cassandra.apache.org" 
mailto:user@cassandra.apache.org>>
Subject: Re: Disabling compaction

what version are you on?

On Thu, Oct 9, 2014 at 10:33 PM, Parag Shah 
mailto:ps...@proofpoint.com>> wrote:
Hi all,

 I am trying to disable compaction for a few select tables. Here is a 
definition of one such table:

CREATE TABLE blob_2014_12_31 (
  blob_id uuid,
  blob_index int,
  blob_chunk blob,
  PRIMARY KEY (blob_id, blob_index)
) WITH
  bloom_filter_fp_chance=0.01 AND
  caching='KEYS_ONLY' AND
  comment='' AND
  dclocal_read_repair_chance=0.00 AND
  gc_grace_seconds=864000 AND
  index_interval=128 AND
  read_repair_chance=0.10 AND
  replicate_on_write='true' AND
  populate_io_cache_on_flush='false' AND
  default_time_to_live=0 AND
  speculative_retry='99.0PERCENTILE' AND
  memtable_flush_period_in_ms=0 AND
  compaction={'enabled': 'false', 'class': 'SizeTieredCompactionStrategy'} AND
  compression={'sstable_compression': 'LZ4Compressor’};

I have set compaction ‘enabled’ : ‘false’ on the above table.

However, I do see compactions being run for this node:

-bash-3.2$ nodetool compactionstats
pending tasks: 55
  compaction typekeyspace   table   completed   
total  unit  progress
   Compaction ids_high_awslab blob_2014_11_15 
18122816990 35814893020 bytes50.60%
   Compaction ids_high_awslab blob_2014_12_31 
18576750966 34242866468 bytes54.25%
   Compaction ids_high_awslab blob_2014_12_15 19213914904 
35956698600 bytes53.44%
Active compaction remaining time :   0h49m46s

Can you someone tell me why this is happening? Do I need to set the compaction 
threshold  to 0 0?

Regards
Parag