Re: Cassandra Certification

2011-08-14 Thread Joe Stein
Certification is good when a community gets to the point that proverbial management cannot easily discern between posers and those that know what they are talking about. I hope one day Cassandra and it's community grows to that point but as of now there is enough transparency in my opinion. I wou

Re: Cassandra Certification

2011-08-14 Thread samal
Does it really make sense? If yes, I think Apache Cassandra Project (ASF) should offer Open Certification. Other entity can offer courses, training materials.

Cassandra Certification

2011-08-14 Thread Sal Fuentes
Are there any organizations planning on offering certification for Cassandra? -- Salvador Fuentes Jr.

Re: Merged counter shard with a count != 0

2011-08-14 Thread Jonathan Ellis
Did you try what it says to do first? "You need to restart this node with -Dcassandra.renew_counter_id=true to fix." On Sun, Aug 14, 2011 at 12:28 PM, Philippe wrote: > Hi I'm getting the following at startup on one of the nodes on my 3 node > cluster with RF=3. > I have 6 keyspaces each with 10

Re: node restart taking too long

2011-08-14 Thread Jonathan Ellis
Or leave row cache enabled but disable cache saving (and remove the one already on disk). On Sun, Aug 14, 2011 at 5:05 PM, aaron morton wrote: >  INFO [main] 2011-08-14 09:24:52,198 ColumnFamilyStore.java (line 547) > completed loading (1744370 ms; 20 keys) row cache for COMMENT > > It's taki

Re: Best practices when deploying & upgrading a cassandra cluster

2011-08-14 Thread aaron morton
There can be bugs in the trunk. Stuff can be put in and the taken out. The tagged and packages releases are considered stable and supported. You can run trunk in prod if you want to, it's good to get tested, but you will need to pay attention to whats happening with the code base and be prepared

Re: Scalability question

2011-08-14 Thread aaron morton
Multi threaded compaction helps there https://issues.apache.org/jira/browse/CASSANDRA-2191 It's another reason to avoid major / manual compactions which create a single big SSTable. Minor compactions keep things in buckets which means newer SSTable can be compacted needing to read the bigger

Re: Merged counter shard with a count != 0

2011-08-14 Thread aaron morton
It looks like the error was thrown during a minor compaction. There should be a log line from the CompactionManager before hand that says "Compacting…" and lists the SSTables it is going to compact. Check that it's from the same thread, i.e. [CompactionExecutor:6] in the example below With the

Re: node restart taking too long

2011-08-14 Thread aaron morton
> INFO [main] 2011-08-14 09:24:52,198 ColumnFamilyStore.java (line 547) > completed loading (1744370 ms; 20 keys) row cache for COMMENT It's taking 29 minutes to load 200,000 rows in the row cache. Thats a pretty big row cache, I would suggest reducing or disabling it. Background http://w

Re: Unable to repair a node

2011-08-14 Thread Philippe
@Teijo : thanks for the procedure, I hope I won't have to do that Peter, I'll answer inline. Thanks for the detailed answer. > > the number of SSTables for some keyspaces goes dramatically up (from 3 or > 4 > > to several dozens). > > Typically with a long running compaction, such as that trigge

Re: Unable to repair a node

2011-08-14 Thread Teijo Holzer
Forgot to mention, you want to check the following in cassandra.yaml on the node that you bootstrap before you initiate the bootstrap: * Ensure that the initial_token is set to the correct value (see nodetool) * Ensure that the seeds list doesn't contain the IP of the node you are trying to boo

Re: Unable to repair a node

2011-08-14 Thread Peter Schuller
> oh i know you can run rf 3 on a 3 node cluster. more i thought that if you > have one fail you have less nodes than the rf, so the cluster is at less > than rf, and writes might be disabled or something like that, while at 4 you > still have met the rf... A node failing is independent of RF. *De

Re: Unable to repair a node

2011-08-14 Thread Peter Schuller
Sorry about the lack of response to your actual issue. I'm afraid I don't have an exhaustive analysis, but some quick notes: > balanced ring but the other nodes are at 60GB. Each repair basically > generates thousands of pending compactions of various types (SSTable build, > minor, major & validat

Re: Unable to repair a node

2011-08-14 Thread Teijo Holzer
Hi, I took the following steps to get a node that refused to repair back under control. WARNING: This resulted in some data loss for us, YMMV with your replication factor * Turn off all row & key caches via cassandra-cli * Set "disk_access_mode: standard" in cassandra.yaml * Kill Cassandra on

Re: thrift c++ insert Exception [Column value is required]

2011-08-14 Thread Jeremiah Jordan
You can checkout libcassandra for a C++ client built on top of thrift. It is not feature complete, but it is pretty good. https://github.com/matkor/libcassandra On Aug 14, 2011, at 3:59 AM, Konstantinos Chasapis wrote: > Hi, > Thank you for your answer. Is there any documentation that describ

Re: Best practices when deploying & upgrading a cassandra cluster

2011-08-14 Thread Eldad Yamin
Is there any good reason why shouldn't we build the latest version from source? Thanks! On Fri, Aug 12, 2011 at 12:18 AM, aaron morton wrote: > In a non dev system it's a lot easier to use the packages > http://wiki.apache.org/cassandra/DebianPackaging > http://www.datastax.com/docs/0.8/install/p

Scalability question

2011-08-14 Thread Philippe
Hi, As on-disk SSTables become bigger and bigger because more data is added in the ring, compactions take longer and longer because each file is becoming bigger. Isn't there a time where compacting will take so long that compaction just can't keep up with the amount of data ? It looks to me like t

Re: Unable to repair a node

2011-08-14 Thread Philippe
No it depends on the consistency level. It's different : for example, QUORUM = 2 for RF=3 Anyway, anyone have an answer to my real issue ? Thanks 2011/8/14 Stephen Connolly > oh i know you can run rf 3 on a 3 node cluster. more i thought that if you > have one fail you have less nodes than the

Merged counter shard with a count != 0

2011-08-14 Thread Philippe
Hi I'm getting the following at startup on one of the nodes on my 3 node cluster with RF=3. I have 6 keyspaces each with 10 column families that contain supercolumns that contain only counter columns. Looking at http://www.datastax.com/dev/blog/whats-new-in-cassandra-0-8-part-2-countersI see that

Re: Unable to repair a node

2011-08-14 Thread Stephen Connolly
oh i know you can run rf 3 on a 3 node cluster. more i thought that if you have one fail you have less nodes than the rf, so the cluster is at less than rf, and writes might be disabled or something like that, while at 4 you still have met the rf... - Stephen --- Sent from my Android phone, so ra

node restart taking too long

2011-08-14 Thread Yan Chunlu
I got 3 nodes and RF=3, when I repairing ndoe3, it seems alot data generated. and server can not afford the load then crashed. after come back, node 3 can not return for more than 96 hours for 34GB data, the node 2 could restart and back online within 1 hour. I am not sure what's wrong with node

Re: Unable to repair a node

2011-08-14 Thread Philippe
5 hours later, the number of pending compactions host up to 8k as usual, the number of SST tables for another keyspace shot up to 160 (from 4). At 4pm, a daily cron job that runs repair starts on that same node and all of a sudden, the number of pending compactions went down to 4k and to number of

Re: CQL: No indexed column error when < or <= in WHERE clause

2011-08-14 Thread Jens Hartung
Thanks for all the explanations. It's odd, that pure range queries are not supported at this time. But I think I'll find a solution for my problem. Greetings Original-Nachricht > Datum: Sun, 14 Aug 2011 07:09:55 -0700 > Von: Jason Rutherglen > An: user@cassandra.apache.org >

Re: Unable to repair a node

2011-08-14 Thread Peter Schuller
> i am always wondering why people run clusters with number of nodes == rf > > i thought you needed to have number of nodes > rf ti gave any sensible > behaviour... but i am no expert at all No. The only requirement is that the number of nodes be >= RF, since clearly in a cluster with fewer nodes

Re: thrift c++ insert Exception [Column value is required]

2011-08-14 Thread Jonathan Ellis
https://svn.apache.org/repos/asf/cassandra/branches/cassandra-0.8/interface/cassandra.thrift On Sun, Aug 14, 2011 at 1:59 AM, Konstantinos Chasapis wrote: > Hi, > Thank you for your answer. Is there any documentation that describes all > this values that I have to set? > > Konstantinos Chasapis >

Re: Unable to repair a node

2011-08-14 Thread Stephen Connolly
i am always wondering why people run clusters with number of nodes == rf i thought you needed to have number of nodes > rf ti gave any sensible behaviour... but i am no expert at all - Stephen --- Sent from my Android phone, so random spelling mistakes, random nonsense words and other nonsense a

Re: CQL: No indexed column error when < or <= in WHERE clause

2011-08-14 Thread Jason Rutherglen
This will be fully solved with CASSANDRA-2915 which will use Lucene as a the secondary index type implementation. Lucene has extremely fast range queries built in. On Sun, Aug 14, 2011 at 6:38 AM, Sal Fuentes wrote: > The important piece that is mentioned in Jonathan's link is this: > "One conse

Re: CQL: No indexed column error when < or <= in WHERE clause

2011-08-14 Thread Sal Fuentes
The important piece that is mentioned in Jonathan's link is this: "One consequence of the KEYS index type being more like a hash index than a btree is shown here: even though birth_date is indexed, Cassandra couldn’t perform the range query “> 1970″ against it." hash index vs a btree index. Basic

Re: CQL: No indexed column error when < or <= in WHERE clause

2011-08-14 Thread Martin von Zweigbergk
Hi Jens, I have never used CQL myself and I have barely used Cassandra, but I think I've seen it mentioned before on this list that you need to use compare for equality on at least one column (as indicated by "No indexed columns present in by-columns clause with "equals" operator"). The lookup wil

Re: Chronologic

2011-08-14 Thread Sal Fuentes
Looks like Chronologic is out now. Much thanks to the folks at Gowalla for putting this out. https://github.com/gowalla/chronologic http://speakerdeck.com/u/therealadam/p/chronologic-you-put-your-feeds-in-it http://lanyrd.com/2011/lone-star-ruby-conference/sgptd/ On Mon, Aug 8, 2011 at 1:30 PM,

Re: CQL: No indexed column error when < or <= in WHERE clause

2011-08-14 Thread Jens Hartung
I had indexed the number column in station column family. Do I also have to index another column? What I'm wondering, when I type "get station where number = 8210;" all works fine, but when I type "get station where number < 8210;" I'll get mentioned exception. Is there something, that I misun

Unable to repair a node

2011-08-14 Thread Philippe
Hello, I've been fighting with my cluster for a couple days now... Running 0.8.1.3, using Hector and loadblancing requests across all nodes. My question is : how do I get my node back under control so that it runs like the other two nodes. It's a 3 node, RF=3 cluster with reads & writes at LC=QUO

Re: thrift c++ insert Exception [Column value is required]

2011-08-14 Thread Konstantinos Chasapis
Hi, Thank you for your answer. Is there any documentation that describes all this values that I have to set? Konstantinos Chasapis On Aug 14, 2011, at 6:28 AM, Jonathan Ellis wrote: In C++ you need to set .__isset.fieldname on optional fields (e.g. .__isset.value). 2011/8/13 Hassapis Const