1.1 not removing commit log files?

2012-05-21 Thread Bryce Godfrey
The commit log drives on my nodes keep slowly filling up. I don't see any errors in my logs that are indicating any issues that I can map to this issue. Is this how 1.1 is supposed to work now? Previous versions seemed to keep this drive at a minimum as it flushed. /dev/mapper/mpathf 25G

endless hinted handoff with 1.1

2012-05-21 Thread Arend-Jan Wijtzes
Hi, We are running a small test cluster and recently installed Cassandra 1.1 and started with a new clean database. We keep seeing these messages in the log on just one of our nodes: INFO [HintedHandoff:1] 2012-05-21 09:49:56,757 HintedHandOffManager.java (line 294) Started hinted handoff for t

RE: 1.1 not removing commit log files?

2012-05-21 Thread Pieter Callewaert
Hi, In 1.1 the commitlog files are pre-allocated with files of 128MB. (https://issues.apache.org/jira/browse/CASSANDRA-3411) This should however not exceed your commitlog size in Cassandra.yaml. commitlog_total_space_in_mb: 4096 Kind regards, Pieter Callewaert From: Bryce Godfrey [mailto:bryc

Re: unable to nodetool to remote EC2

2012-05-21 Thread Tamar Fraenkel
Hi! I am trying the tunnel and it fails. Will be gratefull for some hints: I defined - proxy_host = ubuntu@my_ec2_cassandra_node_public_ip - proxy_port = 22 I do: *ssh -N -f -i /c/Users/tamar/.ssh/Amazon/tokey.openssh -D22 ubuntu@my_ec2_cassandra_node_public_ip* I put some debug prints an

Re: 1.1 not removing commit log files?

2012-05-21 Thread Alain RODRIGUEZ
commitlog_total_space_in_mb: 4096 By default this line is commented in 1.0.x if I remember well. I guess it is the same in 1.1. You really should remove this comment or your commit logs will entirely fill up your disk as it happened to me a while ago. Alain 2012/5/21 Pieter Callewaert : > Hi, >

Re: unsubscribe

2012-05-21 Thread Dave Brosius
On 05/21/2012 02:44 AM, Qingyan(Evan) Liu wrote: send to user-unsubscr...@cassandra.apache.org

Re: Tuning cassandra (compactions overall)

2012-05-21 Thread Alain RODRIGUEZ
Hi Aaron. I wanted to try the new config. After doing a rolling restart I have all my counters false, with wrong values. I stopped my servers with the following : nodetool -h localhost disablegossip nodetool -h localhost disablethrift nodetool -h localhost drain kill cassandra sigterm (15) via ht

Wrong data after rolling restart

2012-05-21 Thread Alain RODRIGUEZ
Hi, I re-post this here because it's a new subject far away from my initial tuning questions. I wanted to try a new config. After doing a rolling restart I have all my counters false, with wrong values. I stopped my servers with the following : nodetool -h localhost disablegossip nodetool -h loca

Re: Wrong data after rolling restart

2012-05-21 Thread Alain RODRIGUEZ
Here are my 2 nodes starting logs, I hop it can help... https://gist.github.com/2762493 https://gist.github.com/2762495 Alain 2012/5/21 Alain RODRIGUEZ : > Hi, I re-post this here because it's a new subject far away from my > initial tuning questions. > > I wanted to try a new config. After doin

Ordering counters in Cassandra

2012-05-21 Thread Filippo Diotalevi
Hi, I'm trying to understand what's the best design for a simple "ranking" use cases. I have, in a row, a good number (10k - a few 100K) of counters; each one is counting the occurrence of an event. At the end of day, I want to create a ranking of the most occurred event. What's the best appro

RE Ordering counters in Cassandra

2012-05-21 Thread Romain HARDOUIN
If I understand you've got a data model which looks like this: CF Events: "row1": { "event1": 1050, "event2": 1200, "event3": 830, ... } You can't query on column values but you can build every day a ranking in a dedicated CF by iterating over events: create column family Ranking with c

Re: Counters and replication factor

2012-05-21 Thread Radim Kolar
Dne 26.3.2012 19:17, aaron morton napsal(a): Can you describe the situations where counter updates are lost or go backwards ? Do you ever get TimedOutExceptions when performing counter updates ? we got few timeouts per day but not much, less then 10. I do not think that timeouts will be root c

Re: RE Ordering counters in Cassandra

2012-05-21 Thread Filippo Diotalevi
Hi Romain, thanks for your suggestion. When you say " build every day a ranking in a dedicated CF by iterating over events:" do you mean - load all the columns for the specified row key - iterate over each column, and write a new column in the inversed index ? That's my current approach, but s

Re: RE Ordering counters in Cassandra

2012-05-21 Thread Tamar Fraenkel
I also had a similar problem. I have a temporary solution, which is not best, but may be of help. I have the coutner cf to count events, but apart from that I hold leaders CF: leaders = { // key is time bucket // values are composites(rank, event) ordered by // descending order of the rank

Re: restoring from snapshot - missing data

2012-05-21 Thread Tyler Hobbs
On Mon, May 21, 2012 at 12:01 AM, Tamar Fraenkel wrote: > If I am putting the snapshots on a clean ring, I need to first create the > data model? Yes. -- Tyler Hobbs DataStax

Re: RE Ordering counters in Cassandra

2012-05-21 Thread Filippo Diotalevi
Hi Tamar, the solution you propose is indeed a "temporary solution", but it might be the best one.Which approach did you follow?I'm a bit concerned about the deletion approach, since in case of concurrent writes on the same counter you might "lo

Number of keyspaces

2012-05-21 Thread Luís Ferreira
Hi, Does the number of keyspaces affect the overall cassandra performance? Cumprimentos, Luís Ferreira

Re: RE Ordering counters in Cassandra

2012-05-21 Thread Tamar Fraenkel
Indeed I took the not delete approach. If time bucket rows are not that big, this is a good temporary solution. I just finished implementation and testing now on a small staging environment. So far so good. Tamar Sent from my iPod On May 21, 2012, at 9:11 PM, Filippo Diotalevi wrote: > Hi Tam

Re: restoring from snapshot - missing data

2012-05-21 Thread Tamar Fraenkel
Thanks. After creating the data model and matching the correct snapshot with the correct new node (same token) all worked fine! *Tamar Fraenkel * Senior Software Engineer, TOK Media [image: Inline image 1] ta...@tok-media.com Tel: +972 2 6409736 Mob: +972 54 8356490 Fax: +972 2 5612956

RE: 1.1 not removing commit log files?

2012-05-21 Thread Bryce Godfrey
Thanks, I'll give it a try. -Original Message- From: Alain RODRIGUEZ [mailto:arodr...@gmail.com] Sent: Monday, May 21, 2012 2:12 AM To: user@cassandra.apache.org Subject: Re: 1.1 not removing commit log files? commitlog_total_space_in_mb: 4096 By default this line is commented in 1.0.x

Re: unable to nodetool to remote EC2

2012-05-21 Thread ramesh
On 05/21/2012 03:55 AM, Tamar Fraenkel wrote: Hi! I am trying the tunnel and it fails. Will be gratefull for some hints: I defined proxy_host = ubuntu@my_ec2_cassandra_node_public

Re: how can we get (a lot) more performance from cassandra

2012-05-21 Thread Yiming Sun
Hi Aaron, I don't know if you could elaborate a bit more on each of the points you suggested. Thanks. -- Y. On Sun, May 20, 2012 at 7:29 PM, aaron morton wrote: > I would look into the problems you are having with GC... > > The server log shows the GC ParNew frequently gets longer than 200ms,

Re: Couldn't detect any schema definitions in local storage - after handling schema disagreement according to FAQ

2012-05-21 Thread aaron morton
> 1) What did I wrong? - why cassandra was throwing exceptions on first startup? In 1.0.X the history of schema changes was replayed to the node when it rejoined the cluster. If the node is receiving traffic while this is going on it will log those errors until the schema mutation that created 10

Re: unable to nodetool to remote EC2

2012-05-21 Thread Tamar Fraenkel
Thanks for the response. But it still does not work. I am running the script from a git bash on my windows 7. adding some debug prints, this is what I am running ssh -i key.pem -N -f -D8123 ubuntu@ec2-*.amazonaws.com ssh pid = 11616 /c/PROGRA~2/Java/jdk1.7.0_02/bin/jconsole.exe -J-DsocksProxyHost=l

Re: Number of keyspaces

2012-05-21 Thread R. Verlangen
Yes, it does. However there's no real answer what's the limit: it depends on your hardware and cluster configuration. You might even want to search the archives of this mailinglist, I remember this has been asked before. Cheers! 2012/5/21 Luís Ferreira > Hi, > > Does the number of keyspaces af