Re: when are keyspace dirs removed?

2013-01-03 Thread aaron morton
> Are they never removed in fear of removing snapshots? Aye. Their should be shapshots in there https://github.com/apache/cassandra/blob/trunk/conf/cassandra.yaml#L402 Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On

Re: Cassandra supported Avro data types

2013-01-03 Thread aaron morton
Collections might be what you are looking for http://www.datastax.com/dev/blog/cql3_collections Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 4/01/2013, at 11:35 AM, Renato Marroquín Mogrovejo wrote: > Thanks Aar

Re: AWS EMR <-> Cassandra

2013-01-03 Thread aaron morton
> Instead, I get an error from CassandraStorage that the initial address isn't > set (on the slave, the master is ok). Can you post the full error ? Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 4/01/2013, at 11:15

Re: Cassandra supported Avro data types

2013-01-03 Thread Renato Marroquín Mogrovejo
Thanks Aaron! What I am trying to map into Cassandra is an avro model like the one that follows: { "type": "record", "name": "WebPage", "namespace": "org.bytegolem.webpage.generated", "fields" : [ {"name": "url", "type": "string"}, {"name": "content", "type": ["null","bytes"]}] }

AWS EMR <-> Cassandra

2013-01-03 Thread William Oberman
Anyone ever try to read or write directly between EMR <-> Cassandra? I'm running various Cassandra resources in Ec2, so the "physical connection" part is pretty easy using security groups. But, I'm having some configuration issues. I have managed to get Cassandra + Hadoop working in the past usi

Re: Error after 1.2.0 upgrade

2013-01-03 Thread Edward Capriolo
There is a danger here disablethrift and disablegossip do not stop the fat client. On Thu, Jan 3, 2013 at 3:07 PM, aaron morton wrote: > This is what I do to shutdown. Disabling thrift and gossip will stop > incoming requests, but it wont stop existing streams. However these do not > go through t

Re: Error after 1.2.0 upgrade

2013-01-03 Thread aaron morton
This is what I do to shutdown. Disabling thrift and gossip will stop incoming requests, but it wont stop existing streams. However these do not go through the commit log. echo "Disabling thrift and gossip..." nodetool -h localhost disablethrift; nodetool -h localhost disablegossip; echo "Sleep

Re: Error after 1.2.0 upgrade

2013-01-03 Thread Edward Capriolo
The only true drain is 1) turn on ip tables to stop all incoming traffic 2) flush 3) wait 4) delete files 5) upgrade 6) restart On Thu, Jan 3, 2013 at 2:59 PM, Michael Kjellman wrote: > That's why I didn’t create a ticket as I knew there was one. But, I > thought this had been fixed in 1.1.7 ??

Re: Cassandra supported Avro data types

2013-01-03 Thread aaron morton
> I am not sure which > Avro data types Cassandra support None. Any mention to avro internally is from a dark time in the past. Cassandra presents it's own data model, which can be used to store pretty much anything. So you could serialise the avro types and store them as byte streams if you

Re: Error after 1.2.0 upgrade

2013-01-03 Thread Michael Kjellman
That's why I didn’t create a ticket as I knew there was one. But, I thought this had been fixed in 1.1.7 ?? From: Edward Capriolo mailto:edlinuxg...@gmail.com>> Reply-To: "user@cassandra.apache.org" mailto:user@cassandra.apache.org>> Date: Thursday, January 3, 2

Re: Error after 1.2.0 upgrade

2013-01-03 Thread aaron morton
Alain, Can you create a ticket on https://issues.apache.org/jira/browse/CASSANDRA ? Please include the full JVM vendor and version and the log output from /var/log/cassadra/system.log as it includes information. Can you also mention the client you are using. Thanks.

Re: Error after 1.2.0 upgrade

2013-01-03 Thread Edward Capriolo
There is a bug on this, drain has been in a weird state for a long time. In 1.0 it did not work labeled as a known limitation. https://issues.apache.org/jira/browse/CASSANDRA-4446 On Thu, Jan 3, 2013 at 2:49 PM, Michael Kjellman wrote: > Another thing: for those that use counters this might be

Re: Error after 1.2.0 upgrade

2013-01-03 Thread Michael Kjellman
Another thing: for those that use counters this might be a problem. I always do a nodetool drain before upgrading a node (as is good practice btw). However, in every case on every one of my nodes, the commit log was replayed on each node and mutations were created. Could lead to double counting

Re: Error after 1.2.0 upgrade

2013-01-03 Thread Michael Kjellman
Tracking Issues: https://issues.apache.org/jira/browse/CASSANDRA-5101 https://issues.apache.org/jira/browse/CASSANDRA-5104 which was created because of https://issues.apache.org/jira/browse/CASSANDRA-5103 https://issues.apache.org/jira/browse/CASSANDRA-5102 Also friendly reminder to all that cql

Re: Error after 1.2.0 upgrade

2013-01-03 Thread aaron morton
Michael, Could you share some of your problems ? May be of help for others. Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 4/01/2013, at 5:45 AM, Michael Kjellman wrote: > I'm having huge upgrade issues fr

Re: Multi threads updating single row

2013-01-03 Thread Hiller, Dean
My bad, I meant exactly that, there could be performance issues if you have multiple nodes hitting the same row since that row work items are serialised. Dean From: aaron morton mailto:aa...@thelastpickle.com>> Reply-To: "user@cassandra.apache.org" mailto:user@

Re: Row cache and counters

2013-01-03 Thread aaron morton
The first thing I look for with timeouts like that is a flush storm causing blocking in the write path (due to the internal "switch lock"). Take a look in the logs, for a number of messages such as "enqueuing CF…" and "writing cf..". Look for a pattern of enqueuing cf messages that occur immed

Re: Multi threads updating single row

2013-01-03 Thread aaron morton
> Multiple nodes could be a problem Not sure what you mean here Dean. There are no issues with multiple clients, from multiple threads, processes or nodes inserting / updating to the same row. There are potential performance issues though due to row level isolation used in the write path. The

Re: UUID - Java

2013-01-03 Thread aaron morton
There are several classes of UUID, so it depends a little on what the call to com.eaio.uuid.UUID() is doing. In general you will be ok. In general you can assume it's unique, however version 1 time UUID's may not be. As they contain a time component and the machine mac address, which is sometim

Re: sliding window problem

2013-01-03 Thread aaron morton
Without knowing any details of your problem, try using a Time To Live on the column. You will get a better reply if you ask a more specific question. Cheers - Aaron Morton Freelance Cassandra Developer New Zealand @aaronmorton http://www.thelastpickle.com On 3/01/2013, at 5:

Re: Error after 1.2.0 upgrade

2013-01-03 Thread Edward Capriolo
Just a shot in the dark, but I would try setting -Xss higher then the default. It's probably like 180, but I cant even start at that level, bumped it up to 256 for JDK 7. On Thu, Jan 3, 2013 at 12:02 PM, Michael Kjellman wrote: > :) yes, I'm crazy > > The assertion appears to be compiled code whi

Re: RandomPartitioner to Murmur3Partitioner

2013-01-03 Thread Edward Capriolo
By the way "10% faster" does not necessarily mean 10% more requests. https://issues.apache.org/jira/browse/CASSANDRA-2975 https://issues.apache.org/jira/browse/CASSANDRA-3772 Also if you follow the tickets "My tests show that Murmur3Partitioner actually is worse than MD5 with high cardinalit

Cassandra supported Avro data types

2013-01-03 Thread Renato Marroquín Mogrovejo
Hi all, I have started playing around with Cassandra and I am not sure which Avro data types Cassandra supports. Is there any type of documentation or could anyone please help me on this doubt of mine? Thanks! Renato M.

Re: Error after 1.2.0 upgrade

2013-01-03 Thread Michael Kjellman
:) yes, I'm crazy The assertion appears to be compiled code which is why I was guessing jna. Biggest issue right now is that upgraded 1.2.0 nodes only see other 1.2.0 nodes in the ring. 1.1.7 nodes don't see the 1.2.0 nodes.. Upgrading every node to 1.2.0 now lists all nodes in the ring... On

Re: Error after 1.2.0 upgrade

2013-01-03 Thread Alain RODRIGUEZ
Wow, so you're going live with 1.2.0, good luck with that. When it's done, whould you mind letting me know if everything went fine or if you have some advice or feedback? "This looks related to JNA?" Does it ? The only thing logged about JNA is the following : "JNA mlockall successful". What doe

Re: Error after 1.2.0 upgrade

2013-01-03 Thread Michael Kjellman
I'm having huge upgrade issues from 1.1.7 -> 1.2.0 atm but in a 12 node cluster which I am slowly massaging into a good state I haven't seen this in 15+ hours of operation… This looks related to JNA? From: Alain RODRIGUEZ mailto:arodr...@gmail.com>> Reply-To: "user@cassandra.apache.org

Re: Cassandra read throughput with little/no caching.

2013-01-03 Thread Tyler Hobbs
> > Your description above was much better :-) I'm more interested in docs for > the raw metrics provided in JMX. I don't think there are any good docs for what is exposed directly through JMX. Most of the OpsCenter metrics map closely to one exposed JMX item, so that's a start. Other than that

Re: Row cache and counters

2013-01-03 Thread André Cruz
Does anyone see anything wrong in these settings? Anything to account for a 8s timeout during a counter increment? Thanks, André On 31/12/2012, at 14:35, André Cruz wrote: > On Dec 29, 2012, at 8:53 PM, Mohit Anchlia wrote: > >> Can you post gc settings? Also check logs and see what it says

RE: Force data to a specific node

2013-01-03 Thread DE VITO Dominique
Hi Sávio, There is no definitive response: it depends on your business model ;-) I just guess here it should be something like the id of some data root. Take also a look at http://www.datastax.com/dev/blog/schema-in-cassandra-1-1 and look for "partition key", if you want to go through CQL. De

Re: Force data to a specific node

2013-01-03 Thread Sávio Teles
Hi Dominique, I have the same problem! I would like to place an object in a specific node because I'm working in a spatial application. How should I choose the K1 part to forcing a given object to go to a node? 2013/1/3 DE VITO Dominique > Hi Everton, > > ** ** > > AFAIK, the pb is not f

Re: Multi threads updating single row

2013-01-03 Thread Hiller, Dean
Multiple nodes could be a problem but multiple threads is probably just fine. If you have two threads write to the same column, the last one wins though so I hope your timestamps are unique even across threads so you don't lose data ;). Dean From: Jay Svc mailto:jaytechg...@gmail.com>> Reply-T

Re: UUID - Java

2013-01-03 Thread Hiller, Dean
Another option is PlayOrm UniqueKeyGenerator.generateKey(). While it is not returning a UUID, it does return a unique key for a specific cluster that is nice and short too. Dean From: Baskar Sikkayan mailto:techba...@gmail.com>> Reply-To: "user@cassandra.apache.org

RE: Force data to a specific node

2013-01-03 Thread DE VITO Dominique
Hi Everton, AFAIK, the pb is not forcing data to a specific node, but forcing some kind of data locality. There is things into cql to do it: you define a composite key (K1, K2), and K1 part is used as a rowkey and K2 is used within column name. So, all rows with same K1 are on the same node.

Re: Last Modified Time Series in cassandra

2013-01-03 Thread Pierre-Yves Ritschard
You can use an approach with two CFs The first one would be ExpiredCF | -- File = Key |-- Reversed(TimeUUID) (representing) last change In this CF, each entry is expired (after a day, an hour, whatever) EventualCF |-- File = Key |-- String Storing a file update for a key 'K' at a time

UUID - Java

2013-01-03 Thread Baskar Sikkayan
Hi, I use the following code to generate the primary key. public static java.util.UUID getTimeUUID() { return java.util.UUID.fromString(new com.eaio.uuid.UUID().toString()); } Is this the correct approach? Do let me know if there i any better approach which guarantees no conflict

Re: RandomPartitioner to Murmur3Partitioner

2013-01-03 Thread Sylvain Lebresne
On Thu, Jan 3, 2013 at 10:21 AM, Alain RODRIGUEZ wrote: > > Does this mean that there absolutely no way to switch to the new > partitioner for people that are already using Cassandra ? > Yes, that is what this means. -- Sylvain