CORDA Connection with Cassandra

2011-07-28 Thread Nilabja Banerjee
Hi All, I am trying to connect Cassandra with Corda with following detals. *Driver: org.apache.cassandra.cql.jdbc.CassandraDriver URL: jdbc:cassandra:/@localhost:9160/nilabja* *nilabja: keyspace name* I am using no authentication. I am not able to connect itgetting thid following error

Replace multiple dead nodes at the same time. Is it possible?

2011-07-28 Thread Haryadi Gunawi
Hello, I want to run the procedure of Handling Failure option #1 in http://wiki.apache.org/cassandra/Operations (i.e. the (Recommended approach) Bring up the replacement node with a new IP address ...) Say, I have 10 dead nodes, and I want to replace it with 10 new nodes. My questions: Is it poss

Re: how to solve one node is in heavy load in unbalanced cluster

2011-07-28 Thread Yan Chunlu
and by the way, my RF=3 and the other two nodes have much more capacity, why does they always routed the request to node3? coud I do a rebalance now? before node repair? On Fri, Jul 29, 2011 at 12:01 PM, Yan Chunlu wrote: > add new nodes seems added more pressure to the cluster? how about you

Re: how to solve one node is in heavy load in unbalanced cluster

2011-07-28 Thread Yan Chunlu
add new nodes seems added more pressure to the cluster? how about your data size? On Fri, Jul 29, 2011 at 4:16 AM, Frank Duan wrote: > "Dropped read message" might be an indicator of capacity issue. We > experienced the similar issue with 0.7.6. > > We ended up adding two extra nodes and physi

Re: Cassandra 0.6.8 snapshot problem?

2011-07-28 Thread Zhu Han
On Thu, Jul 28, 2011 at 10:47 PM, Jian Fang wrote: > Hi, > > We have an old production Cassandra 0.6.8 instance without replica, i.e., > the replication factor is 1. Recently, we noticed that > the snapshot data we took from this instance are inconsistent with the > running instance data. For exam

Ordering by timestamp on optional field

2011-07-28 Thread Mike Gaffney
I have a rating system where users rate items. the item is the row key, the user is the column key. The value is the rating on a 5 point scale. People can only have one rating per item. I'm looking to add the ability to optionally add a text comment to the rating record. If the text comment is not

Re: memory_locking_policy parameter in cassandra.yaml for disabling swap - has this variable been renamed?

2011-07-28 Thread Teijo Holzer
Hi, yes I was looking for this config as well. This is really simple to achieve: Put the following line into /etc/security/limits.conf cassandra- memlock 32 Then, start Cassandra as the user cassandra, not as root (note there is never a need to run Cassandra as root, al

Re: Cassandra 0.6.8 snapshot problem?

2011-07-28 Thread Jonathan Ellis
Doesn't ring a bell. But I'd say if you upgrade and it's still a problem, then (a) you're not _worse_ off than you are now, and (b) it's a lot more likely to get fixed in modern version. On Thu, Jul 28, 2011 at 9:47 AM, Jian Fang wrote: > Hi, > > We have an old production Cassandra 0.6.8 instanc

Re: Questions over the use of CQL

2011-07-28 Thread Jonathan Ellis
So to summarize, yes, you can do that with CQL, but it's a little more of a pain if your comparator is BytesType since you'll need to convert to hex. On Thu, Jul 28, 2011 at 9:48 AM, Ikeda Anthony wrote: > Thanks Jonathan, > > I just had one of our devs playing around with it and he said he had p

Re: memory_locking_policy parameter in cassandra.yaml for disabling swap - has this variable been renamed?

2011-07-28 Thread Terje Marthinussen
Benchmarks was done with up to 96GB memory, much more caching than most people will ever have. The point anyway is that you are talking I/O in 10's or at best, a few hundred MB/sec before cassandra will eat all your CPU (with dual CPU 6 cores in our case). The memcopy involved here deep inside

Re: NotFoundException thrown for get(), but not get_slice() with a column_names predicate

2011-07-28 Thread David Allsopp
On 28 July 2011 16:23, Sylvain Lebresne wrote: > To be honest, collecting the names that were missing in the first name > query and doing a new name query for those (if there is any) is so simple > that I think it is a bit dishonest to say that "it pushes work to the > clients". > > It seems simp

Re: memory_locking_policy parameter in cassandra.yaml for disabling swap - has this variable been renamed?

2011-07-28 Thread Peter Schuller
> I would love to understand how people got to this conclusion however and try > to find out why we seem to see differences! I won't make any claims with Cassandra because I have never bothered benchmarking the different in CPU usage since all my use-cases have been more focused on I/O efficiency

The video from the Cassandra SF 2011 Data Modeling workshop is now online

2011-07-28 Thread Lynn Bender
Video: Matt Dennis' Data Modeling Workshop: http://www.datastax.com/2011/07/video-data-modeling-workshop-from-cassandra-sf-2011 A list of available videos are now available on the Cassandra SF 2011 presentation page: http://www.datastax.com/events/cassandrasf2011/presentations -- -Lynn Bender ht

Re: Cassandra timeout exception when works with hadoop

2011-07-28 Thread Jian Fang
I did not set the consistency level because I didn't find this option in the ConfigHelper class. I guess it should use level one by default. Actually, I only twisted the word count example a bit. Here is the code snippet, getConf().set(CONF_COLUMN_NAME, columnName); Job job = new

Re: memory_locking_policy parameter in cassandra.yaml for disabling swap - has this variable been renamed?

2011-07-28 Thread Jonathan Ellis
If you're actually hitting disk for most or even many of your reads then mmap doesn't matter since the extra copy to a Java buffer is negligible compared to the i/o itself (even on ssds). On Jul 28, 2011 9:04 AM, "Terje Marthinussen" wrote: > > On Jul 28, 2011, at 9:52 PM, Jonathan Ellis wrote: >

Re: how to solve one node is in heavy load in unbalanced cluster

2011-07-28 Thread Frank Duan
"Dropped read message" might be an indicator of capacity issue. We experienced the similar issue with 0.7.6. We ended up adding two extra nodes and physically rebooted the offending node(s). The entire cluster then calmed down. On Thu, Jul 28, 2011 at 2:24 PM, Yan Chunlu wrote: > I have three

Re: CQL Driver status

2011-07-28 Thread Jonathan Ellis
There is no cursor support. Moving away from thrift is still in the "hopefully someday" stage. On Jul 28, 2011 9:03 AM, "Nicholas Neuberger" wrote: > I had a few questions that I couldn't easily answer by looking through many > JIRA tickets and the wiki. We are currently developing an application

Re: Aggregation and Co-Processors

2011-07-28 Thread Ryan King
On Thu, Jul 28, 2011 at 12:08 PM, Stephen Pope wrote: > I just finished watching the video by Eric Evans on “CQL – Not just NoSQL. > It’s MoSQL”, and I heard mention of aggregation queries. He said there’s > been some talk about it, and that you guys were calling it “co-processors”. > Can somebody

Re: Cassandra timeout exception when works with hadoop

2011-07-28 Thread Jeremy Hanna
Just wondering - what consistency level are you using for hadoop reads? Also, do you have task trackers running on the cassandra nodes so that reads will be local? On Jul 28, 2011, at 2:46 PM, Jian Fang wrote: > I changed the rpc_timeout_in_ms to 3 and 4, then changed the > cassandra.

Re: Cassandra timeout exception when works with hadoop

2011-07-28 Thread Jian Fang
I changed the rpc_timeout_in_ms to 3 and 4, then changed the *cassandra.range.batch.size from 4096 to 1024, but still 40% tasks got timeout exceptions. * Not sure if this is caused by Cassandra speed performance (8G heap size for about 100G of data) or the way how the Cassandra-hadoop integ

Aggregation and Co-Processors

2011-07-28 Thread Stephen Pope
I just finished watching the video by Eric Evans on "CQL - Not just NoSQL. It's MoSQL", and I heard mention of aggregation queries. He said there's been some talk about it, and that you guys were calling it "co-processors". Can somebody give me the gist of what that's all about? I couldn't find

how to solve one node is in heavy load in unbalanced cluster

2011-07-28 Thread Yan Chunlu
I have three nodes and RF=3.here is the current ring: Address Status State Load Owns Token 84944475733633104818662955375549269696 node1 Up Normal 15.32 GB 81.09% 52773518586096316348543097376923124102 node2 Up Normal 22.51 GB 10.48% 70597222385644499881390884416714081360 node3 Up Normal 56.1 GB

Re: Cassandra timeout exception when works with hadoop

2011-07-28 Thread Jian Fang
My current setting is 1. I will try 3. Thanks, John On Thu, Jul 28, 2011 at 12:39 PM, Jeremy Hanna wrote: > See http://wiki.apache.org/cassandra/HadoopSupport#Troubleshooting - I > would probably start with setting your rpc_timeout_in_ms to something like > 3. > > On Jul 28, 2011, at

Re: Cassandra timeout exception when works with hadoop

2011-07-28 Thread Jeremy Hanna
See http://wiki.apache.org/cassandra/HadoopSupport#Troubleshooting - I would probably start with setting your rpc_timeout_in_ms to something like 3. On Jul 28, 2011, at 11:09 AM, Jian Fang wrote: > Hi, > > I run Cassandra 0.8.2 and hadoop 0.20.2 on three nodes, each node includes a > Cassa

Cassandra timeout exception when works with hadoop

2011-07-28 Thread Jian Fang
Hi, I run Cassandra 0.8.2 and hadoop 0.20.2 on three nodes, each node includes a Cassandra instance and a hadoop data node. I created a simple hadoop job to scan a Cassandra column value in a column family and write it to a file system if it meets some conditions. I keep getting the following time

Re: Changing the CLI, not a great idea!

2011-07-28 Thread Edward Capriolo
On Thu, Jul 28, 2011 at 10:59 AM, Sylvain Lebresne wrote: > On Thu, Jul 28, 2011 at 4:00 PM, Edward Capriolo > wrote: > > > > > > On Thu, Jul 28, 2011 at 9:35 AM, Jonathan Ellis > wrote: > >> > >> I'm talking about data compatibility, which is more important than cli > >> statement compatibility

Re: NotFoundException thrown for get(), but not get_slice() with a column_names predicate

2011-07-28 Thread Sylvain Lebresne
To be honest, collecting the names that were missing in the first name query and doing a new name query for those (if there is any) is so simple that I think it is a bit dishonest to say that "it pushes work to the clients". It seems simple enough at least that it does not sound like a good idea t

Re: Changing the CLI, not a great idea!

2011-07-28 Thread Sylvain Lebresne
On Thu, Jul 28, 2011 at 4:00 PM, Edward Capriolo wrote: > > > On Thu, Jul 28, 2011 at 9:35 AM, Jonathan Ellis wrote: >> >> I'm talking about data compatibility, which is more important than cli >> statement compatibility. >> >> Consider someone with a python program that creates a CF with the >>

Re: NotFoundException thrown for get(), but not get_slice() with a column_names predicate

2011-07-28 Thread David Allsopp
I understand and agree for the case where the slice predicate is a range, but I'd expect the semantics to be different where the predicate is a list of column names (even if it's implemented using a range operation under the hood?) If I ask for columns "foo" and "bar", then usually I'm not trying

Re: Questions over the use of CQL

2011-07-28 Thread Ikeda Anthony
Thanks Jonathan, I just had one of our devs playing around with it and he said he had problems with some of the column names of which we delimit using a dash (-) using the JDBC drivers e.g. SELECT m--hash(value) FROM column_family….. If this is not a problem then I have my questions answered.

Cassandra 0.6.8 snapshot problem?

2011-07-28 Thread Jian Fang
Hi, We have an old production Cassandra 0.6.8 instance without replica, i.e., the replication factor is 1. Recently, we noticed that the snapshot data we took from this instance are inconsistent with the running instance data. For example, we took snapshot in early July 2011. From the running inst

Using Cassandra for transaction logging, good idea?

2011-07-28 Thread Kent Narling
Hi! I am considering to use cassandra for clustered transaction logging in a project. What I need are in principal 3 functions: 1 - Log transaction with a unique (but possibly non-sequential) id 2 - Fetch transaction with a specific id 3 - Fetch X new transactions "after" a specific cursor/trans

Re: Changing the CLI, not a great idea!

2011-07-28 Thread Nicholas Knight
On Jul 28, 2011, at 6:35, Jonathan Ellis wrote: > I'm talking about data compatibility, which is more important than cli > statement compatibility. > > Consider someone with a python program that creates a CF with the > default settings and inserts some (say) uuid columns and long data. > > If

Re: memory_locking_policy parameter in cassandra.yaml for disabling swap - has this variable been renamed?

2011-07-28 Thread Terje Marthinussen
On Jul 28, 2011, at 9:52 PM, Jonathan Ellis wrote: > This is not advisable in general, since non-mmap'd I/O is substantially > slower. I see this again and again as a claim here, but it is actually close to 10 years since I saw mmap'd I/O have any substantial performance benefits on any real

CQL Driver status

2011-07-28 Thread Nicholas Neuberger
I had a few questions that I couldn't easily answer by looking through many JIRA tickets and the wiki. We are currently developing an application based on Cassandra 0.8.x with the CQL driver. Does the CQL driver currently support cursor / resultsets? I'd like to implement a pagination feature.

Re: Changing the CLI, not a great idea!

2011-07-28 Thread Edward Capriolo
On Thu, Jul 28, 2011 at 9:35 AM, Jonathan Ellis wrote: > I'm talking about data compatibility, which is more important than cli > statement compatibility. > > Consider someone with a python program that creates a CF with the > default settings and inserts some (say) uuid columns and long data. >

Re: Changing the CLI, not a great idea!

2011-07-28 Thread Jonathan Ellis
I'm talking about data compatibility, which is more important than cli statement compatibility. Consider someone with a python program that creates a CF with the default settings and inserts some (say) uuid columns and long data. If we changed CF creation to default to ascii we would break this p

Re: Changing the CLI, not a great idea!

2011-07-28 Thread Edward Capriolo
On Thu, Jul 28, 2011 at 8:46 AM, Jonathan Ellis wrote: > It defaults to hex because that is how bytestype is represented. The > default remains bytestype to provide the kind of backwards > compatibility you are complaining about. :) > > On Thu, Jul 28, 2011 at 6:56 AM, Edward Capriolo > wrote:

Re: Expanding 0.6.x cluster to multiple datacenters

2011-07-28 Thread Ashley Martens
Okay. So what happens when I try to add a third DC? Since we would be using RAS any new node would get the entire dataset. On Jul 28, 2011, at 5:49, Jonathan Ellis wrote: > The steps are the same for RUS -> RAS no matter what version of > Cassandra you are on, but 0.7 introduced NTS which is d

Re: Problems using Thrift API in C

2011-07-28 Thread Eric Tamme
On 07/28/2011 05:29 AM, Aleksandrs Saveljevs wrote: essentially a rewrite of the first part of the C++ example given at http://wiki.apache.org/cassandra/ThriftExamples#C.2B-.2B- . If we run it under strace, we see that it hangs on the call to recv() when setting keyspace: $ strace -s 64 ./tes

Re: Questions over the use of CQL

2011-07-28 Thread Jonathan Ellis
You can quote CQL column names to allow any column name that Thrift would allow (suitably encoded for ascii). For instance, CQL knows that UUIDs are represented as strings like 12345678-1234-5678-1234-567812345678 and will parse them correctly. If you mean the official CompositeType, that should

Re: memory_locking_policy parameter in cassandra.yaml for disabling swap - has this variable been renamed?

2011-07-28 Thread Jonathan Ellis
I don't think there's ever been a "memory_locking_policy" variable. Cassandra will call mlockall if JNA is present, no further steps required. On Thu, Jul 28, 2011 at 5:17 AM, Stephen Henderson wrote: > Hi, > > > > We’ve started having problems with cassandra and memory swapping on linux > which

Re: memory_locking_policy parameter in cassandra.yaml for disabling swap - has this variable been renamed?

2011-07-28 Thread Jonathan Ellis
This is not advisable in general, since non-mmap'd I/O is substantially slower. The OP is correct that it is best to disable swap entirely, and second-best to enable JNA for mlockall. On Thu, Jul 28, 2011 at 7:05 AM, Adi wrote: >> Hi, >> >> >> >> We’ve started having problems with cassandra and

Re: Expanding 0.6.x cluster to multiple datacenters

2011-07-28 Thread Jonathan Ellis
The steps are the same for RUS -> RAS no matter what version of Cassandra you are on, but 0.7 introduced NTS which is designed to make adding datacenters easier. On Thu, Jul 28, 2011 at 6:57 AM, Ashley Martens wrote: > Thank you. For 0.7 are the steps similar? > > > > On Jul 27, 2011, at 19:56, J

Re: Changing the CLI, not a great idea!

2011-07-28 Thread Jonathan Ellis
It defaults to hex because that is how bytestype is represented. The default remains bytestype to provide the kind of backwards compatibility you are complaining about. :) On Thu, Jul 28, 2011 at 6:56 AM, Edward Capriolo wrote: > > > On Thursday, July 28, 2011, Sasha Dolgy wrote: >> Unfortunate

Re: NotFoundException thrown for get(), but not get_slice() with a column_names predicate

2011-07-28 Thread Jonathan Ellis
No, the slice semantics are "give me whatever happens to exist between start and end." It's valid for the answer to be "nothing." On Thu, Jul 28, 2011 at 6:55 AM, David Allsopp wrote: > If I try to retrieve a column that is not present, using get(), then I'll > get a NotFoundException. > > If (f

Re: memory_locking_policy parameter in cassandra.yaml for disabling swap - has this variable been renamed?

2011-07-28 Thread Adi
> > Hi, > > ** ** > > We’ve started having problems with cassandra and memory swapping on linux > which seems to be a fairly common issue (in our particular case after about > a week all swap space will have been used up and we have to restart the > process). > > ** ** > > It sounds like t

Re: Expanding 0.6.x cluster to multiple datacenters

2011-07-28 Thread Ashley Martens
Thank you. For 0.7 are the steps similar? On Jul 27, 2011, at 19:56, Jonathan Ellis wrote: > As you know, with 0.6 adding a datacenter is not as easy as 0.7 with > NetworkTopologyStrategy. With 0.6 there is a right way that will work > with some manual effort, and a wrong way that can cause y

Re: Changing the CLI, not a great idea!

2011-07-28 Thread Edward Capriolo
On Thursday, July 28, 2011, Sasha Dolgy wrote: > Unfortunately, the perception that I have as a business consumer and > night-time hack, is that more importance and effort is placed on > ensuring information is up to date and correct on the > http://www.datastax.com/docs/0.8/index website and less

NotFoundException thrown for get(), but not get_slice() with a column_names predicate

2011-07-28 Thread David Allsopp
If I try to retrieve a column that is not present, using get(), then I'll get a NotFoundException. If (for efficiency's sake) I try to retrieve several named columns using get_slice, with a column_names predicate (i.e. a list of columns) then I won't get the exception if one of those columns is mi

Problems using Thrift API in C

2011-07-28 Thread Aleksandrs Saveljevs
Dear all, We are considering using Cassandra for storing gathered data in Zabbix (see https://support.zabbix.com/browse/ZBXNEXT-844 for more details). Because Zabbix is written in C, we are considering using Thrift API in C, too. However, we are running into problems trying to get even the b

Re: Changing the CLI, not a great idea!

2011-07-28 Thread Sasha Dolgy
Unfortunately, the perception that I have as a business consumer and night-time hack, is that more importance and effort is placed on ensuring information is up to date and correct on the http://www.datastax.com/docs/0.8/index website and less on keeping the wiki up to date or relevant... which for

AW: results of index slice query

2011-07-28 Thread Roland Gude
Created https://issues.apache.org/jira/browse/CASSANDRA-2964 -Ursprüngliche Nachricht- Von: Jonathan Ellis [mailto:jbel...@gmail.com] Gesendet: Mittwoch, 27. Juli 2011 17:35 An: user@cassandra.apache.org Betreff: Re: results of index slice query Sounds like a Cassandra bug to me. On Wed

Re: Changing the CLI, not a great idea!

2011-07-28 Thread David Boxenhorn
This is part of a much bigger problem, one which has many parts, among them: 1. Cassandra is complex. Getting a gestalt understanding of it makes me think I understand how Alzheimer's patients must feel. 2. There is no official documentation. Perhaps everything is out there somewhere, who knows? 3

Re: Changing the CLI, not a great idea!

2011-07-28 Thread Hartog C. de Mik
On Wed, Jul 27, 2011 at 11:01:17PM -0500, Jonathan Ellis wrote: > On Wed, Jul 27, 2011 at 10:53 PM, Edward Capriolo > wrote: > > You can not even put two statements on the same line. So the ';' is semi > > useless syntax. > > Nobody ever asked for that, but lots of people asked to allow > statem

Re: No Transactions: An Example

2011-07-28 Thread Jeremy Sevellec
Hi All, Making transaction is my actual preoccupation of the moment. My need is : - update data in column family #1 - insert data in column family #2 My need is to see thes opérations in a single "transaction" because the data is tightly coupled. I use zookeeper/cage to make distributed lock t