4/20 nodes get disproportionate amount of mutations

2011-08-22 Thread Jeremy Hanna
We've been having issues where as soon as we start doing heavy writes (via hadoop) recently, it really hammers 4 nodes out of 20. We're using random partitioner and we've set the initial tokens for our 20 nodes according to the general spacing formula, except for a few token offsets as we've re

Re: get mycf['rowkey']['column_name'] return 'Value was not found' in cassandra-cli

2011-08-22 Thread Yan Chunlu
thanks a lot! On Mon, Aug 22, 2011 at 10:14 PM, Edward Capriolo wrote: > > > On Mon, Aug 22, 2011 at 1:08 AM, Yan Chunlu wrote: > >> connect to cassandra-cli and issue the list my cf I got >> >> RowKey: comments_62559 >> => (column=76616c7565, >> value=28286c70310a4c3236373632334c0a614c323637373

RE: Completely removing a node from the cluster

2011-08-22 Thread Bryce Godfrey
Could this ghost node be causing my hints column family to grow to this size? I also crash after about 24 hours due to commit logs growth taking up all the drive space. A manual nodetool flush keeps it under control though. Column Family: HintsColumnFamily SSTa

Re: Recover from startup problems

2011-08-22 Thread Jonathan Ellis
Oh, right: my plan didn't work, because schema changes aren't really commitlog-ified. (So, commitlog replayed, but the schema didn't get recreated to match its earlier state.) Can you reproduce the old-fashioned way by sending the create + insert commands, after nuking things? On Mon, Aug 22, 20

Re: Recover from startup problems

2011-08-22 Thread Dave Brosius
0.8.4 seems to fail to load in the same way. When i deleted the data directory, it appears to start up correctly, I see INFO 23:30:44,067 JNA not found. Native methods will be disabled. INFO 23:30:44,100 Loading settings from file:/home/dave/apache-cassandra-0.8.4/conf/cassandra.yaml INFO 23:

Re: Recover from startup problems

2011-08-22 Thread Jonathan Ellis
It's erroring out trying to load the schema itself, though, which isn't supposed to happen. On Mon, Aug 22, 2011 at 10:14 PM, Dave Brosius wrote: > Sure i'll try that, but I'm pretty sure it was creating a column family > without any column meta data (types), then, client.insert'ing a ByteBuffer

Re: Recover from startup problems

2011-08-22 Thread Dave Brosius
Sure i'll try that, but I'm pretty sure it was creating a column family without any column meta data (types), then, client.insert'ing a ByteBuffer that wasn't based on bytes from a String.getBytes call. On 08/22/2011 11:09 AM, Jonathan Ellis wrote: Yes, you can blow away both the data and co

Re: CQL count(*) counts deleted records?

2011-08-22 Thread Jonathan Ellis
Yes. On Mon, Aug 22, 2011 at 6:58 PM, Hefeng Yuan wrote: > We're using 0.8.1, and found the CQL count(*) is counting deleted columns, > CLI doesn't have this issue. Is it a bug? > > cqlsh> select * from CF where key = 'foo'; > u'foo' > > cqlsh> select count(*) from CF where key = 'foo'; > (3,)

CQL count(*) counts deleted records?

2011-08-22 Thread Hefeng Yuan
We're using 0.8.1, and found the CQL count(*) is counting deleted columns, CLI doesn't have this issue. Is it a bug? cqlsh> select * from CF where key = 'foo'; u'foo' cqlsh> select count(*) from CF where key = 'foo'; (3,)

Re: Avoid Simultaneous Minor Compactions?

2011-08-22 Thread Hefeng Yuan
Shall I lower this or increase it? Or probably ask in this way, do we suggest to let it run longer while using less CPU, or should we let it finish faster with more CPU usage? The problem we're facing is, with the default setting, they run slow and also eat a lot of CPU in the meanwhile. I'm th

run Cassandra tutorial example

2011-08-22 Thread Alvin UW
Hello, I'd like to try the cassandra tutorial example: https://github.com/zznate/cassandra-tutorial by following the readme. After typing mvn -e exec:java -Dexec.args="get" -Dexec.mainClass="com.datastax.tutorial.TutorialRunner" I got the following errors. Should I do something before the above

Re: unsubscribe

2011-08-22 Thread aaron morton
http://wiki.apache.org/cassandra/FAQ#unsubscribe How do I unsubscribe from the email list? Send an email to user-unsubscr...@cassandra.apache.org - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.com On 23/08/2011, at 3:18 AM, Rodrigo Hjort wrote

Re: help creating data model

2011-08-22 Thread aaron morton
Lets start with something quick and simple, all standard Column Families… Visitant CF key: id column name: property name column value: property value Visitant Sessions CF key: visitant id column name: session id column value: none Session CF key: session_id column_name: property value colum

Re: Cluster key distribution wrong after upgrading to 0.8.4

2011-08-22 Thread aaron morton
Not sure why it's different for the nodes at the end of the ring. But I'm going to assume Quorum is working as expected and it's an artifact of the way the ring ownership is calculated. Cheers - Aaron Morton Freelance Cassandra Developer @aaronmorton http://www.thelastpickle.co

Re: nodetool repair caused high disk space usage

2011-08-22 Thread Huy Le
After having done so many tries, I am not sure which log entries correspond to what. However, there were many of this type: WARN [CompactionExecutor:14] 2011-08-18 18:47:00,596 CompactionManager.java (line 730) Index file contained a different key or row size; using key from data file And ther

Re: Occasionally getting old data back with ConsistencyLevel.ALL

2011-08-22 Thread Kyle Gibson
Thanks for the reply. On Mon, Aug 22, 2011 at 1:11 PM, Dominic Williams wrote: > Hi there, here's my tuppence! > 1. Something to look at first: > > If you write two different values to the same column quickly in succession, > if both writes go out with the same timestamp, then it is indeterminate

Re: Occasionally getting old data back with ConsistencyLevel.ALL

2011-08-22 Thread Dominic Williams
Hi there, here's my tuppence! 1. Something to look at first: If you write two different values to the same column quickly in succession, if both writes go out with the same timestamp, then it is indeterminate which one wins i.e. write order doesn't necessarily matter. 2. Don't trust PayPal (anyo

Re: Occasionally getting old data back with ConsistencyLevel.ALL

2011-08-22 Thread Kyle Gibson
I made some changes to my code base that uses cassandra. I went back to using the "processed" column, but instead of using "0" or "1" I decided to use "no" and "yes" You can view the code here: http://pastebin.com/gRBC16e7 As you can see from the code, I perform an insert, get, check the result,

unsubscribe

2011-08-22 Thread Rodrigo Hjort
unsubscribe

Re: Avoid Simultaneous Minor Compactions?

2011-08-22 Thread Jonathan Ellis
Specifically, look at compaction_throughput_mb_per_sec in cassandra.yaml On Mon, Aug 22, 2011 at 12:39 AM, Ryan King wrote: > You should throttle your compactions to a sustainable level. > > -ryan > > On Sun, Aug 21, 2011 at 10:22 PM, Hefeng Yuan wrote: >> We just noticed that at one time, 4 nod

Re: Recover from startup problems

2011-08-22 Thread Jonathan Ellis
Yes, you can blow away both the data and commitlog directories and restart, but can you try these first to troubleshoot? 1. make a copy of the commitlog directory 2. downgrade to 0.8 with no other changes, to see if it's something on the new read path 2a. if 0.8 starts up then we will fix the read

Re: get mycf['rowkey']['column_name'] return 'Value was not found' in cassandra-cli

2011-08-22 Thread Edward Capriolo
On Mon, Aug 22, 2011 at 1:08 AM, Yan Chunlu wrote: > connect to cassandra-cli and issue the list my cf I got > > RowKey: comments_62559 > => (column=76616c7565, > value=28286c70310a4c3236373632334c0a614c3236373733304c0a614c3236373737304c0a614c3236373932324c0a614c3236373934364c0a614c3236383137314c

Re: Cassandra Cluster Admin - phpMyAdmin for Cassandra

2011-08-22 Thread André Fiedler
Hey, that´s awesome great! Keep on that work! I love it! :o) 2011/8/22 SebWajam > And thank you for your feedback! :) > > -- > View this message in context: > http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassandra-Cluster-Admin-phpMyAdmin-for-Cassandra-tp6709930p6711556.html

Re: Cassandra Cluster Admin - phpMyAdmin for Cassandra

2011-08-22 Thread SebWajam
And thank you for your feedback! :) -- View this message in context: http://cassandra-user-incubator-apache-org.3065146.n2.nabble.com/Cassandra-Cluster-Admin-phpMyAdmin-for-Cassandra-tp6709930p6711556.html Sent from the cassandra-u...@incubator.apache.org mailing list archive at Nabble.com.

help creating data model

2011-08-22 Thread Helder Oliveira
Hello all, i have a SQL structure like this: Visitant ( has several properties ) Visitant has many Sessions Sessions ( has several properties ) Sessions has many Requests ( has several properties ) Sessions has many Events ( has several properties ) i have read a lot and still confused how to p

Re: Cluster key distribution wrong after upgrading to 0.8.4

2011-08-22 Thread Thibaut Britz
Hi, Thanks for explaining: As I understood each node now only displays it's local view of the the data it cotains, and not the global view anymore. One more question: Why do the nodes at the end of the ring only show the % load from 2 nodes and not from 3? We are always writing with quorum, so th

Re: get mycf['rowkey']['column_name'] return 'Value was not found' in cassandra-cli

2011-08-22 Thread Yan Chunlu
the cassandra-cli version I am using is shipped with the cassandra 0.7.4 package; but I could get results by the column name "14np_20nl": get mycf2[14np][14np_20nl]; On Mon, Aug 22, 2011 at 1:20 PM, Jonathan Ellis wrote: > My guess: you're using an old version of the cli that isn't dealing >