Cassandra 0.8 & schematool

2011-08-29 Thread Jenny
Hi I notice that schematool was removed from the release of Cassandra 0.8. I would like to know the reason of doing that and how i can load the schema from configure file(like .yaml) automatically. Best Regards! Jenny

Re: Updates lost

2011-08-29 Thread Tyler Hobbs
On Mon, Aug 29, 2011 at 4:56 PM, Peter Schuller wrote: > > If the client sleeps for a few ms at each loop, the success rate > > increases. At 15 ms, the script always succeeds so far. Interestingly, > > the problem seems to be sensitive to alphabetical order. Updating the > > value from 'aaa' to

Re: cassandra auto create snapshots?

2011-08-29 Thread Yan Chunlu
thanks for the help. have you tried use those snapshot to recover a node? I have not found anything related to those auto-created snapshots in the wiki page, then dont even have a timestamp, not sure how to use those files On Tue, Aug 30, 2011 at 10:27 AM, Jonathan Ellis wrote: > No. > >

Re: Disk usage for CommitLog

2011-08-29 Thread Derek Andree
Thanks Dan, good info. > First off, what version of Cassandra are you using? Sorry my bad, 0.8.4 > Provided you are using a recent Cassandra version (late 0.7 or 0.8.x) I doubt > the commit log is your problem. My experience using Cassandra as a time > series data store (with a full 30 days of

Re: Cassandra 0.8 CLI: Inconsistent treatment of literals for keys/columns and values

2011-08-29 Thread Jonathan Ellis
I answered this earlier today, so I added it to the FAQ: http://wiki.apache.org/cassandra/FAQ#cli_keys On Mon, Aug 29, 2011 at 9:31 PM, Oleg Proudnikov wrote: > Hi, > > After installing Cassandra 0.8 I discovered that my app stopped working. The > issue is that the app is now unable to read a row

Cassandra 0.8 CLI: Inconsistent treatment of literals for keys/columns and values

2011-08-29 Thread Oleg Proudnikov
Hi, After installing Cassandra 0.8 I discovered that my app stopped working. The issue is that the app is now unable to read a row that was inserted by a CLI set command with a numeric string key. CLI in Cassandra 0.8 seems to be treating literals inconsistently. Please let me know if I am missin

Re: cassandra auto create snapshots?

2011-08-29 Thread Jonathan Ellis
No. On Mon, Aug 29, 2011 at 8:15 PM, Yan Chunlu wrote: > so it was useless?   I didn't drop any CF/KS,  could "nodetool move", > "nodetool repair" cause the problem? > > On Tue, Aug 30, 2011 at 5:23 AM, Jonathan Ellis wrote: >> >> Perhaps you are seeing auto-snapshots before destructive events s

Re: Disk usage for CommitLog

2011-08-29 Thread Jonathan Ellis
On Mon, Aug 29, 2011 at 8:04 PM, Derek Andree wrote: > Is there an easy way to keep commit log size down without killing performance? There is in 1.0: commitlog_total_space_in_mb, added in https://issues.apache.org/jira/browse/CASSANDRA-2427. Until then you'll have to make sure that you're flush

Re: Disk usage for CommitLog

2011-08-29 Thread Dan Hendry
First off, what version of Cassandra are you using? > We've noticed that when we restart cassandra disk utilization decreases dramatically Presumably you mean 'utilization' as in free space. Specifically on a restart, this type of behavior is likely due to Cassandra deleting compacted SSTables. C

Re: cassandra auto create snapshots?

2011-08-29 Thread Yan Chunlu
so it was useless? I didn't drop any CF/KS, could "nodetool move", "nodetool repair" cause the problem? On Tue, Aug 30, 2011 at 5:23 AM, Jonathan Ellis wrote: > Perhaps you are seeing auto-snapshots before destructive events such > as truncate or drop CF/KS. > > On Mon, Aug 29, 2011 at 4:19 P

Disk usage for CommitLog

2011-08-29 Thread Derek Andree
I run a single node cassandra instance, and we have lots of overwrites on a hot CF and disk utilization seems to grow pretty fast. We've noticed that when we restart cassandra disk utilization decreases dramatically (dramatic being something close to 50%). Most of this growth seems to be in th

Re: Updates lost

2011-08-29 Thread Peter Schuller
> If the client sleeps for a few ms at each loop, the success rate > increases. At 15 ms, the script always succeeds so far. Interestingly, > the problem seems to be sensitive to alphabetical order. Updating the > value from 'aaa' to 'bbb' never has problem. No pause needed. Is it possible the ver

Re: Querying a composite key with cassandra-cli

2011-08-29 Thread Anthony Ikeda
Yeah I figured out what happened. I inadvertently set the keys to utf8 on the column family: assume ColumnFamily keys as utf8; which broke whichever default mechanism was in place to perform the colon separated values for the Composite key. When I restarted cassandra-cli, the query worked again.

Re: cassandra auto create snapshots?

2011-08-29 Thread Jonathan Ellis
Perhaps you are seeing auto-snapshots before destructive events such as truncate or drop CF/KS. On Mon, Aug 29, 2011 at 4:19 PM, Yan Chunlu wrote: > just found the data dir consume a lot of space, which is because there was > many snapshots in it. > but I have set snapshot_before_compaction: fals

Re: Querying a composite key with cassandra-cli

2011-08-29 Thread Jonathan Ellis
Sounds like you need to specify a key_validation_class so the cli knows how to encode the key. (It used to assume keys were always ascii.) On Mon, Aug 29, 2011 at 4:02 PM, Anthony Ikeda wrote: > Is there currently a way to query a composite key in cassandra-cli? > I was sure I used to be able to

cassandra auto create snapshots?

2011-08-29 Thread Yan Chunlu
just found the data dir consume a lot of space, which is because there was many snapshots in it. but I have set snapshot_before_compaction: false. is that possible that cassandra create those snapshot automatically? could I delete them? the dir names is strange(normally it should contain date i

Updates lost

2011-08-29 Thread Jiang Chen
Hi, Just started developing using Cassandra (0.8.4). I noticed when updating the same row and column repeatedly, say, in a test case, updates may get lost. I found it in a Java client but the following python script also exhibits the same problem. *

Querying a composite key with cassandra-cli

2011-08-29 Thread Anthony Ikeda
Is there currently a way to query a composite key in cassandra-cli? I was sure I used to be able to call: get ColumnFamily['comp1:comp2'] But this has recently stopped working. Anthony

Recommendations on moving to Hadoop/Hive with Cassandra + RDBMS

2011-08-29 Thread Tharindu Mathew
Hi, I have an already running system where I define a simple data flow (using a simple custom data flow language) and configure jobs to run against stored data. I use quartz to schedule and run these jobs and the data exists on various data stores (mainly Cassandra but some data exists in RDBMS li

Re: The way to query a CF with "start > 10 and end < 100"

2011-08-29 Thread Benoit Perroud
queries start > 10 and end < 100 is not straight forward to modelize, you should use the value of start as column name, and check on client side the second condition. Just for comparison, modeling 10 < value < 100 is rather much easier if you set your values as column name, or using CompositeType

Re: hw requirements

2011-08-29 Thread Richard Low
Hi, The hardware you choose depends a bit on your workload - writes vs reads, amount of cacheable data, latency requirements, etc.. What sort of workload do you expect? See http://wiki.apache.org/cassandra/CassandraHardware for some general advice. People typically have 8-24 GB RAM per node wit

can't keep node running.

2011-08-29 Thread Ernst D Schoen-René
Hi, I've written about this before, but I can't find the original thread. I have one node on my 16 node cassandra cluster (running 8.1) that will not stay running. All nodes were configured the same way, but this one kept going down, so I changed these lines in my cassandra-env.sh: JVM_OPT

Matt Dennis' presentation on Cassandra best practices on EC2

2011-08-29 Thread Jeremy Hanna
Just wanted to let people know about a great presentation that Matt Dennis did here at the Cassandra Austin meetup. It's on Cassandra best practices on EC2. We found the presentation extremely helpful. http://www.slideshare.net/mattdennis/cassandra-on-ec2

AW: flushwriter all time blocked

2011-08-29 Thread Roland Gude
Hi, This still leaves me puzzled. Is it a bad thing? Why is it happening? And what does "blocked before being accepted" mean? Does it mean Cassandra did not even try to put the task into a queue? Thanks for enlightening me, roland -Ursprüngliche Nachricht- Von: Jonathan Ellis [mail

Suggest to change Thrift status string

2011-08-29 Thread Rodrigo Ferreira
Hello, We are putting on some cassandra servers and using a loadbalancer to an app connecting at the cluster. My suggest is to change string that are returned from thrift status, because some loadbalancers have a false positive status on we are expecting "running" on healthcheck. I suggest to

hw requirements

2011-08-29 Thread Helder Oliveira
Hello guys, What is the type of profile of a cassandra server. Are SSD an option ? Does cassandra needs better CPU ou lots of memory ? Are SATA II disks ok ? I am making some tests, and i started evaluating the possible hardware. If someone already has conclusions about it, please share :D Than

Re: flushwriter all time blocked

2011-08-29 Thread Jonathan Ellis
the javadoc for the mbeans explains: /** * Get the number of tasks that had blocked before being accepted (or * rejected). */ public int getTotalBlockedTasks(); /** * Get the number of tasks currently blocked, waiting to be accepted by * the executor (because

Re:

2011-08-29 Thread Stanislav Vodetskyi
Thanks On Mon, Aug 29, 2011 at 2:38 PM, Joris van der Wel wrote: > send a message to > > On Mon, Aug 29, 2011 at 1:23 PM, Stanislav Vodetskyi > wrote: > > unsubscribe >

Re: inconsistent counter value?

2011-08-29 Thread Sylvain Lebresne
On Sat, Aug 13, 2011 at 9:38 AM, Boris Yen wrote: > I posted a comment for Cassandra-3006 after 0.8.4 is released, but it seems > not be noticed there, so I re-post here, wondering if anyone could help. > --- > Follow the same steps posted on Cassandra-3006, after step

Re:

2011-08-29 Thread Joris van der Wel
send a message to On Mon, Aug 29, 2011 at 1:23 PM, Stanislav Vodetskyi wrote: > unsubscribe

[no subject]

2011-08-29 Thread Stanislav Vodetskyi
unsubscribe

Re: Scaling Out / Replication Factor too?

2011-08-29 Thread Boris Yen
I am not sure, but I think the problem might be "order preserving partitioners" you used. When using "order preserving partitioners" data might be skewed meaning most data only stay in a few servers, so that might create a few heavy load servers. On Mon, Aug 29, 2011 at 7:24 AM, Ryan Lowe wrote:

unsubscribe

2011-08-29 Thread Stanislav Vodetskyi

The way to query a CF with "start > 10 and end < 100"

2011-08-29 Thread Guofeng Zhang
Hi, I have a standard CF that has column "start" and "end". I need to query its rows using condition "start>10 and end<100". Is there any better way to do it? Using native secondary index or creating a specific CF for the search. I do not know which one is better. If the late is preferred to, h

Re: Using 5-6 bytes for cassandra timestamps vs 8…

2011-08-29 Thread Terje Marthinussen
I have a patch for trunk which I just have to get time to test a bit before I submit. It is for super columns and will use the super columns timestamp as the base and only store variant encoded offsets in the underlying columns. If the timestamp equals that of the SC, it will store nothing (ju

flushwriter all time blocked

2011-08-29 Thread Roland Gude
Hi all, On a 0.7.8 cluster In tpstats i can see flushwriter stage having several tasks in state all-time-blocked (immendiatly after node restart its 8 but grows over time to around 300). What does it mean (or how can I find out) and what can I do about it? -- YOOCHOOSE GmbH Roland Gude Softwa

Re: Using 5-6 bytes for cassandra timestamps vs 8…

2011-08-29 Thread Radim Kolar
> Using 4 bytes and 100ms resolution your can fit in 13 years of timestamps if you use the time you deploy the cassandra DB (aka 'now') as epoch. In our app we will be fine with this. 100ms is good enough. we can probably do some garbage collection on timestamps like we do on deletes. If times

Re: What are the things to watch out for with big nodes

2011-08-29 Thread Peter Schuller
> If I have a cluster with 15-20T nodes, somethings that I know will be a > potential problem are > > Compactions taking longer > Higher read latencies > Long time for adding/removing nodes > > What are other things that can be problematic with big nodes? http://wiki.apache.org/cassandra/LargeData