Re: [ANN] pithos is cassandra-backed S3 compatible object store

2014-03-27 Thread Pierre-Yves Ritschard
gt; Pierre-Yves Ritschard a écrit sur 27/03/2014 08:12:15 : > > > De : Pierre-Yves Ritschard > > A : user , > > Date : 27/03/2014 08:12 > > Objet : [ANN] pithos is cassandra-backed S3 compatible object store > > > > Hi, > > > > If you're alread

[ANN] pithos is cassandra-backed S3 compatible object store

2014-03-27 Thread Pierre-Yves Ritschard
Hi, If you're already using cassandra for storing your data, you might be interested in http://pithos.io which provides s3 compatibility. The underlying schema splits files in several blocks, themselves being split in chunks. I'm looking forward to all your comments on the schema, code and of cou

Re: Cassandra source code explained

2013-01-23 Thread Pierre-Yves Ritschard
I found this talk to be a good starting point to dive into the code: http://blip.tv/datastax/cassandra-internals-5491363 On Wed, Jan 23, 2013 at 2:33 AM, Manu Zhang wrote: > On Wed 23 Jan 2013 01:10:58 AM CST, Radek Gruchalski wrote: > >> Thank you. I found this but was hoping that there's anyt

Re: Last Modified Time Series in cassandra

2013-01-03 Thread Pierre-Yves Ritschard
You can use an approach with two CFs The first one would be ExpiredCF | -- File = Key |-- Reversed(TimeUUID) (representing) last change In this CF, each entry is expired (after a day, an hour, whatever) EventualCF |-- File = Key |-- String Storing a file update for a key 'K' at a time

Re: CQL3 Compound Primary Keys - Do I have the right idea?

2012-12-28 Thread Pierre-Yves Ritschard
OK, so great news, it is now possible to do in CQL with the following syntax, as per CASSANDRA-4179 CREATE TABLE foo ( host text, service text, metric int, PRIMARY KEY ((host,service))); (note the double parentheses). This will effectively create a CF whose row key is a composite type.

Re: CQL3 Compound Primary Keys - Do I have the right idea?

2012-12-23 Thread Pierre-Yves Ritschard
only ONE internal column / cell, the shipname > * the internal column / cell "shipname" is a composite of the *value* of > time_seen. e.g. > > Hope that helps. > > > - > Aaron Morton > Freelance Cassandra Developer > New Zealand > >

Re: CQL3 Compound Primary Keys - Do I have the right idea?

2012-12-22 Thread Pierre-Yves Ritschard
Is there still a way to have composite row keys ? There are times when you want to partition wide rows by a tuple instead of pushing the composites into column names. Lists could do the trick but would not allow multiple types and aren't allowed as primary keys anyhow. At some point I remember se

Re: RF on per column family basis ?

2012-07-28 Thread Pierre-Yves Ritschard
On Sat, Jul 28, 2012 at 10:46 PM, Ertio Lew wrote: > Is it possible to set Replication Factor on per column family basis so that > I can avoid replicating large text data or other not-so-important data in > CFs with low RF? > > I use hector API. Replication factor is keyspace bound, not CF bound.

Re: restoring a counter

2012-07-26 Thread Pierre-Yves Ritschard
writes: >> restoring a counter column family. I have two questions related to >> this: >> >> a) how does that setting affect C* in a non-restoring start? renew_counter_id regenerates a new "NodeId" for the cassandra VM which is used to keep track of the counter shards the node holds. If you

Re: going back in time

2012-07-24 Thread Pierre-Yves Ritschard
writes: > De : Pierre-Yves Ritschard [mailto:p...@spootnik.org] >> Snapshot and restores are great for point in time recovery. There's no >> particular side-effect if you're willing to accept the downtime. > > Are you sure? The system KS has no book-keeping abo

Re: going back in time

2012-07-24 Thread Pierre-Yves Ritschard
writes: > One of the scenarios I have to have in account for a small Cassandra > cluster (N=4) is > restoring the data back in time. I will have full backups for 15 days, and > it's possible > that I will need to restore, let's say, the data from 10 days ago (don't ask, > I'm not > going in

Attaching a virtual DC

2012-03-21 Thread Pierre-Yves Ritschard
Hi, I need to attach a virtual DC to an existing cluster. My existing cluster uses the Ec2Snitch as endpoint_snitch in cassandra.yaml. Since both my clusters will live in the same AZ in AWS, it seems as though my best bet is to switch to a property file snitch to divide. A couple of questions: *

Re: What linux distro for the Cassandra nodes ?

2012-02-21 Thread Pierre-Yves Ritschard
The FAQ page on ubuntu has valid pointers: http://wiki.apache.org/cassandra/FAQ#ubuntu_hangs On Tue, Feb 21, 2012 at 9:10 PM, Aditya Gupta wrote: > @Edward: That's nice to know, but I also heard that there are some bugs in > some recent Ubuntu version that bubble up with Cassandra specifically, s

Re: cassandra on ec2 lock-ups

2012-02-17 Thread Pierre-Yves Ritschard
sorry for not doing my homework properly: http://wiki.apache.org/cassandra/FAQ#ubuntu_hangs On Fri, Feb 17, 2012 at 2:00 PM, Pierre-Yves Ritschard wrote: > Hi, > > I've experienced several node lock-ups on EC2 instances. I'm running > with the following set-up: > > h

cassandra on ec2 lock-ups

2012-02-17 Thread Pierre-Yves Ritschard
Hi, I've experienced several node lock-ups on EC2 instances. I'm running with the following set-up: heap-new: 800M max-heap: 8G instance type: m2.xlarge java is java version "1.6.0_26" Java(TM) SE Runtime Environment (build 1.6.0_26-b03) Java HotSpot(TM) 64-Bit Server VM (build 20.1-b02, mixed m

Re: Cassandra - row range and column slice

2012-02-17 Thread Pierre-Yves Ritschard
In this case, you have one query predicate that operates on a much lower range (years) you could use it as the row key and issue a multigetslicequery where you set all row keys and specify the slice you're interested in (here: 2001 2002 2003, then >= "A", < "D") On Fri, Feb 17, 2012 at 11:46 AM,

Re: how stable is 1.0 these days?

2012-01-26 Thread Pierre-Yves Ritschard
Can you elaborate on the composite types instabilities ? is this specific to hector as the radim's posts suggests ? These one liner answers are quite stressful :) On Thu, Jan 26, 2012 at 1:28 PM, Carlo Pires wrote: > If you need to use composite types and create/drop column families on the > fly

Re: Handling topology changes

2011-12-21 Thread Pierre-Yves Ritschard
A couple of nodes per month, but with peaks. I will test the nodetool move based scenario then. Cheers, - pyr On Wed, Dec 21, 2011 at 10:10 PM, aaron morton wrote: > How often is "relatively often" ? > > * having a fixed amount of nodes with initial tokens and letting new >  ones auto bootstr

Re: Cassandra 0.8 Counters Inverted Index?

2011-10-03 Thread Pierre-Yves Ritschard
Unfortunately there's no way to do this in Cassandra right now, except by using another row as index, like you're doing right now. Of course you could also store by source_id.date and have a batch job iterate over all sources to compute the top 100. It would not be real time any more though. -

Re: [RELEASE] Apache Cassandra 0.8.0 beta1

2011-04-29 Thread Pierre-Yves Ritschard
wse/CASSANDRA-2549 is open to fix this > > On Tue, Apr 26, 2011 at 9:41 AM, Pierre-Yves Ritschard > wrote: > > > > On ven., 2011-04-22 at 16:49 -0500, Eric Evans wrote: > >> I am pleased to announce the release of Apache Cassandra 0.8.0 beta1. > >> > > >

Re: [RELEASE] Apache Cassandra 0.8.0 beta1

2011-04-27 Thread Pierre-Yves Ritschard
Thanks Jonathan, Should I repackage myself or do you think updated Debian packages will be made available shortly ? Regards, - pyr On mar., 2011-04-26 at 11:47 -0500, Jonathan Ellis wrote: > https://issues.apache.org/jira/browse/CASSANDRA-2549 is open to fix this >

Re: [RELEASE] Apache Cassandra 0.8.0 beta1

2011-04-26 Thread Pierre-Yves Ritschard
On ven., 2011-04-22 at 16:49 -0500, Eric Evans wrote: > I am pleased to announce the release of Apache Cassandra 0.8.0 beta1. > Hi, First of all thanks for this release, here are a few annoyances I spotted while trying it out the published debian packages: The cassandra-env.sh is ran by /bin/