Re: autoscaling cassandra cluster

2014-05-21 Thread James Horey
You normally don't (ferry auto-generates the IP addresses). Let's move this conversation to the ferry-user google group so that we don't pollute this mailing list... James Sent from my iPhone > On May 21, 2014, at 3:15 PM, Jabbar Azam wrote: > > Hello James, > > How do you alter your cassan

RE: Can SSTables overlap with SizeTieredCompactionStrategy?

2014-05-21 Thread DuyHai Doan
Are you sure there is no TTL set on your data? It might explain the shrink in sstable size after compaction. Le 21 mai 2014 23:17, "Phil Luckhurst" a écrit : > We based the estimate on a previous controlled observation. We generated a > year's worth of one minute data for a single identifier and

RE: Can SSTables overlap with SizeTieredCompactionStrategy?

2014-05-21 Thread Phil Luckhurst
We based the estimate on a previous controlled observation. We generated a year's worth of one minute data for a single identifier and recorded the size of the resulting sstable. By adding the data one month at a time we observed that there was a linear predictable increase in the sstable size. Usi

Re: Is the tarball for a given release in a Maven repository somewhere?

2014-05-21 Thread Clint Kelly
Thanks, Lewis. I created a ticket here: https://issues.apache.org/jira/browse/CASSANDRA-7283 For now I just copied the "cassandra" and "cassandra.in.sh" scripts into my project, along with custom configuration files. We already have all of the necessary JARs in our project's "lib" directory, si

Re: Cassandra pre 2.1 vs 2.1 counter implementation

2014-05-21 Thread Nate McCall
Jonathan covered the changes in some detail at one of our recent meetups (at about 36 minutes in, give or take): http://capitalfactory.lifesize.com/videos/video/309/?access_token=shr0003098845257289498283770596639066969 From: http://www.meetup.com/Austin-Cassandra-Users/events/158857962/ tl;d

Fwd: Cassandra pre 2.1 vs 2.1 counter implementation

2014-05-21 Thread Localhost shell
Hey All, I am new to C* community. We are planning to use Datastax C* (pre 2.1) in production. We heavily use counters and it is mostly what we do apart from storing the few months raw logs in C*. I have gone through the excellent Sylvain Lesbresne ppt

Re: Is the tarball for a given release in a Maven repository somewhere?

2014-05-21 Thread Lewis John Mcgibbney
Hi Clint, On Wed, May 21, 2014 at 5:29 AM, wrote: > > Is the tarball for a given release in a Maven repository somewhere? > > Hi all, > > ...snip > I poked around online and could not find what I was looking for. Any > help would be appreciated! > A Maven repos? No. Currently tar.gz builds ar

Re: autoscaling cassandra cluster

2014-05-21 Thread Jabbar Azam
Hello Ben, I''m looking forward to reading the netflix links. Thanks :) Thanks Jabbar Azam On 21 May 2014 18:08, Ben Bromhead wrote: > The mechanics for it are simple compared to figuring out when to scale, > especially when you want to be scaling before peak load on your cluster > (adding

Re: autoscaling cassandra cluster

2014-05-21 Thread Jabbar Azam
Hello James, How do you alter your cassandra.yaml file with each nodes IP address? I want to use the scaling software(which I've not got yet) to create and destroy the GCE instances. I want to use fleet to deploy and undeploy the cassandra nodes inside the docker instances. I do realise I will ha

Re: How to enable a Cassandra node to participate in multiple cluster

2014-05-21 Thread Jabbar Azam
Hello Salih, As far as I'm aware a node can't be in two clusters. In the casdandra.yaml file you can only specify one cluster. The storage system and all the protocols would have to be modified so information about multiple clusters is passed around. I'm sure somebody else could give you more and

How to enable a Cassandra node to participate in multiple cluster

2014-05-21 Thread Salih Kardan
Hello everyone, I want to use Cassandra cluster for some specific purpose across data centers. What I want to figure out is how can I enable a single Cassandra node to participate in multiple clusters at the same time? I googled it, however I could not find any use case of Cassandra as I mentioned

Re: autoscaling cassandra cluster

2014-05-21 Thread Ben Bromhead
The mechanics for it are simple compared to figuring out when to scale, especially when you want to be scaling before peak load on your cluster (adding and removing nodes puts additional load on your cluster). We are currently building our own in-house solution for this for our customers. If yo

Re: ownership not equally distributed

2014-05-21 Thread Rameez Thonnakkal
This issue is resolved. Don't know the exact root cause though. Did a re-image of the server which was taking less token ownership and done the configuration through chef. Thanks, Rameez On Sat, May 17, 2014 at 1:06 AM, Rameez Thonnakkal wrote: > Hello > > I am having a 4 node cluster where 2

RE: Can SSTables overlap with SizeTieredCompactionStrategy?

2014-05-21 Thread Andreas Finke
Hi Phil, there is no dump question ;) What is your size estimation based on e.g. what size is a column in your calculation? From: Phil Luckhurst [phil.luckhu...@powerassure.com] Sent: Wednesday, May 21, 2014 5:42 PM To: cassandra-u...@incubator.apache.org

Re: Can SSTables overlap with SizeTieredCompactionStrategy?

2014-05-21 Thread Prem Yadav
I would think its because of the index and filter files. Also the additional data which gets added because of serialization. Also, since SStables are only deleted after the compaction us finished, it might be possible that when you checked, the intermediate SSTables were not yet deleted. However,

Data locality with cash

2014-05-21 Thread Jens Rantil
Hi, I've had a look at the Hive plugin for Cassandra[1]. Does anyone know if it supports data locality if I install task trackers and job trackers on my Cassandra instances? [1] https://github.com/tuplejump/cash Thanks, Jens

Re: Can SSTables overlap with SizeTieredCompactionStrategy?

2014-05-21 Thread Phil Luckhurst
I'm wondering if the lack of response to this means it was a dumb question however I've searched the documentation again but I still can't find an answer :-( Phil -- View this message in context: http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Can-SSTables-overlap-with-SizeTie

Re: autoscaling cassandra cluster

2014-05-21 Thread James Horey
If you're interested and/or need some Cassandra docker images let me know I'll shoot you a link. James Sent from my iPhone > On May 21, 2014, at 10:19 AM, Jabbar Azam wrote: > > That sounds interesting. I was thinking of using coreos with docker > containers for the business logic, fronten

Re: autoscaling cassandra cluster

2014-05-21 Thread Jabbar Azam
That sounds interesting. I was thinking of using coreos with docker containers for the business logic, frontend and Cassandra. I'll also have a look at cassandra-mesos Thanks Jabbar Azam On 21 May 2014 14:04, "Panagiotis Garefalakis" wrote: > I agree with Prem, but recently a guy send this pr

Re: autoscaling cassandra cluster

2014-05-21 Thread Panagiotis Garefalakis
I agree with Prem, but recently a guy send this promising project called Mesos in this list. https://github.com/mesosphere/cassandra-mesos One of its goals is to make scaling easier. I don’t have any personal opinion yet but maybe you could give it a try. Regards, Panagiotis On Wed, May 21, 201

Re: autoscaling cassandra cluster

2014-05-21 Thread Jabbar Azam
Hello Prem, I'm trying to find out whether people are autoscaling up and down automatically, not manually. I'm also interested in whether they are using a cloud based solution and creating and destroying instances. I've found the following regarding GCE https://cloud.google.com/developers/article

Re: CqlStorage can't perform INSERTs with Pig?

2014-05-21 Thread James Schappet
In CQL Updates and Inserts are the same thing. You need to convert your insert statements to UPDATE Here is a quick example loading from a JSON file, into two cassandra tables Notice the the output query is URL Encoded. a = load 'barcode_uuid_mapping_current.json' using JsonLoader('u

Re: autoscaling cassandra cluster

2014-05-21 Thread Prem Yadav
Hi Jabbar, with vnodes, scaling up should not be a problem. You could just add a machines with the cluster/seed/datacenter conf and it should join the cluster. Scaling down has to be manual where you drain the node and decommission it. thanks, Prem On Wed, May 21, 2014 at 12:35 PM, Jabbar Azam

autoscaling cassandra cluster

2014-05-21 Thread Jabbar Azam
Hello, Has anybody got a cassandra cluster which autoscales depending on load or times of the day? I've seen the documentation on the datastax website and that only mentioned adding and removing nodes, unless I've missed something. I want to know how to do this for the google compute engine. Thi