Re: Nodetool ring and multiple dc

2012-02-09 Thread Ravikumar Govindarajan
Thanks David, for the clarification. I feel it would be better if nodetool ring reports per-dc token space ownerships to correctly reflect what cassandra is internally doing, instead of global token space ownership. - Ravi On Fri, Feb 10, 2012 at 12:42 PM, David Schairer wrote: > nodetool ring

Re: Lots of 0 Bytes tmp Data/Index files remain in data folder

2012-02-09 Thread Janne Jalkanen
Yup, that's exactly it. You can get rid of those either by restarting the node or upgrading to 1.0.7. /Janne On Feb 10, 2012, at 02:49 , Roshan wrote: > I have deployed 2 node Cassandra 1.0.6 cluster in production and it running > almost t weeks without any issue. But I can see lots of (more t

Re: Nodetool ring and multiple dc

2012-02-09 Thread David Schairer
nodetool ring is, IMHO, quite confusing in the case of multiple datacenters. Might be easier to think of it as two rings: in your DC1 ring you have two nodes, and since the tokens are balanced, assuming your rows are randomly distributed you'll have half the data on each, since your replicatio

Nodetool ring and multiple dc

2012-02-09 Thread Ravikumar Govindarajan
Hi, I was trying to setup a backup DC from existing DC. State of existing DC with SimpleStrategy & rep_factor=1. ./nodetool -h localhost ring Address DC RackStatus State LoadOwns Token 85070591730234615865843651857942052864 XXX.YYYDC1

Lots of 0 Bytes tmp Data/Index files remain in data folder

2012-02-09 Thread Roshan
I have deployed 2 node Cassandra 1.0.6 cluster in production and it running almost t weeks without any issue. But I can see lots of (more than 90) 0 bytes tmp data and index files in the data directory. So far this is not a issue for me, but want to know why is that. Seems like this data/index tm

Re: Tips for using OrderedPartitioner

2012-02-09 Thread Tharindu Mathew
That sounds like writing a DB... indexing the index row :) By making the keys uniform Do you mean like keep the initial X characters the same or the last Y the same... Could you elaborate, please? Also, if there's hot spot is there any way out of it, other than restarting from scratch...

Re: Cassandra 1.0.6 multi data center question

2012-02-09 Thread Jeremiah Jordan
No, not an issue. The nodes in DC2 know that they aren't supposed to have data, so they go ask the nodes in DC1 for the data to return to you. -Jeremiah On 02/09/2012 05:28 AM, Roshan Pradeep wrote: Thanks Peter for the replies. Previously it was a typing mistake and it should be "getting".

Flume and Cassandra

2012-02-09 Thread Alain RODRIGUEZ
Hi, 1 - I would like to generate some statistics and store some raw events from log files tailed with flume. I saw some plugins giving Cassandra sinks but I would like to store data in a custom way, storing raw data but also incrementing counters to get near real-time statistcis. How to do it ? Do

Re: Cassandra 1.0.6 multi data center question

2012-02-09 Thread Roshan Pradeep
Thanks Peter for the replies. Previously it was a typing mistake and it should be "getting". I checked the DC2 (with having replica 0) and noticed that there is no SSTables created. I use java hector sample program to insert data to the keyspace. After I insert a data item, I 1) Login to one of n

Re: internode communication using multiple network interfaces

2012-02-09 Thread aaron morton
> I have 3 Cassandra nodes in one data center all on the same local network, > which needs to replicate from an off site data center. Only 1 of the 3 > nodes, called dw01, is externally accessible. If you want to run a multi data centre cluster, all the nodes in both data centers need to be ab