Re: Cassandra 1.0 hangs during GC

2012-07-23 Thread Nikolay Kоvshov
 21th I have mirgated to cassandra 1.1.2 but see no improvement  cat /var/log/cassandra/Earth1.log | grep "GC for" INFO [ScheduledTasks:1] 2012-05-22 17:42:48,445 GCInspector.java (line 123) GC for ParNew: 345 ms for 1 collections, 82451888 used; max is 8464105472 INFO [ScheduledTasks:1] 2012-05-

Re: Unreachable node, not in nodetool ring

2012-07-23 Thread Alain RODRIGUEZ
Does anyone knows how to totally remove a dead node that only appears when doing a "describe cluster" from the cli ? I still got this issue in my production cluster. Alain 2012/7/20 Alain RODRIGUEZ : > Hi Aaron, > > I have repaired and cleanup both nodes already and I did it after any > change o

R: Re: Counters values are less than expected [1.0.6 - Java/Pelops]

2012-07-23 Thread cbert...@libero.it
Cannot reproduce ...Written in CL Quorum, RF = 3, cluster of 5 nodes ... I suppose it's an issue with the client since it's not the first "strange behaviour" with CounterColumns ... Messaggio originale Da: aa...@thelastpickle.com Data: 20/07/2012 11.12 A: Ogg: Re: Counters value

Re: Schema advice: (Single row or multiple row!?) How do I store millions of columns when I need to read a set of around 500 columns at a single read query using column names ?

2012-07-23 Thread rohit bhatia
You should probably try to break the one row scheme to 2*Number_of_nodes rows scheme.. This should ensure proper distribution of rows and still allow u to query from a few fixed number of rows. How u do it depends on how are u gonna choose ur 200-500 columns during reading (try having them in the s

nodetool move causes summary load to grow

2012-07-23 Thread Nikolay Kоvshov
I have a testing cluster cassandra 1.1.2 with default memory and cache settings, 1 CF, 1 KS, RF = 2 This is an empty cluster 10.111.1.141datacenter1 rack1 Up Normal 43.04 KB100.00% 0 10.111.1.142datacenter1 rack1

Re: Cassandra 1.0 hangs during GC

2012-07-23 Thread Joost van de Wijgerd
Howmuch memory do you have on the machine. Seems like you have 8G reserved for the Cassandra java process, If this is all the memory on the machine you might be swapping. Also which jvm do you use? kind regards Joost On Mon, Jul 23, 2012 at 10:07 AM, Nikolay Kоvshov wrote: > 21th I have mirgat

Re: Schema advice: (Single row or multiple row!?) How do I store millions of columns when I need to read a set of around 500 columns at a single read query using column names ?

2012-07-23 Thread Eldad Yamin
in addition, if you don't know how many rows will be needed - in each row, you can store the key of the next one. Just like in a linked list. OR have 1 row that will hold all the keys that combining your other rows. 1st select the main row (with the keys), then select the other rows. On Mon, J

Migrating data from a 0.8.8 -> 1.1.2 ring

2012-07-23 Thread Mike Heffner
Hi, We are migrating from a 0.8.8 ring to a 1.1.2 ring and we are noticing missing data post-migration. We use pre-built/configured AMIs so our preferred route is to leave our existing production 0.8.8 untouched and bring up a parallel 1.1.2 ring and migrate data into it. Data is written to the ri

Re: Schema advice: (Single row or multiple row!?) How do I store millions of columns when I need to read a set of around 500 columns at a single read query using column names ?

2012-07-23 Thread Mohit Anchlia
On Mon, Jul 23, 2012 at 10:07 AM, Ertio Lew wrote: > My major concern is that is it too bad retrieving 300-500 rows (each for a > single column) in a single read query that I should store all these(around > a hundred million) columns in a single row? You could create multiple rows and each row

Re: Schema advice: (Single row or multiple row!?) How do I store millions of columns when I need to read a set of around 500 columns at a single read query using column names ?

2012-07-23 Thread Ertio Lew
Actually these columns are 1 for each entity in my application & I need to query at any time columns for a list of 300-500 entities in one go.

Re: Schema advice: (Single row or multiple row!?) How do I store millions of columns when I need to read a set of around 500 columns at a single read query using column names ?

2012-07-23 Thread Mohit Anchlia
On Mon, Jul 23, 2012 at 10:53 AM, Ertio Lew wrote: > Actually these columns are 1 for each entity in my application & I need to > query at any time columns for a list of 300-500 entities in one go. Can you describe your situation with small example?

Re: Schema advice: (Single row or multiple row!?) How do I store millions of columns when I need to read a set of around 500 columns at a single read query using column names ?

2012-07-23 Thread Ertio Lew
For each user in my application, I want to store a *value* that is queried by using the userId. So there is going to be one column for each user (userId as col Name & *value* as col Value). Now I want to store these columns such that can efficiently read columns for atleast 300-500 users in a sin

Re: Schema advice: (Single row or multiple row!?) How do I store millions of columns when I need to read a set of around 500 columns at a single read query using column names ?

2012-07-23 Thread Mohit Anchlia
On Mon, Jul 23, 2012 at 11:00 AM, Ertio Lew wrote: > For each user in my application, I want to store a *value* that is queried > by using the userId. So there is going to be one column for each user > (userId as col Name & *value* as col Value). Now I want to store these > columns such that can

Re: Schema advice: (Single row or multiple row!?) How do I store millions of columns when I need to read a set of around 500 columns at a single read query using column names ?

2012-07-23 Thread Ertio Lew
I want to read columns for a randomly selected list of userIds(completely random). I fetch the data using userIds(which would be used as column names in case of single row or as rowkeys incase of 1 row for each user) for a selected list of users. Assume that the application knows the list of userId

Re: Schema advice: (Single row or multiple row!?) How do I store millions of columns when I need to read a set of around 500 columns at a single read query using column names ?

2012-07-23 Thread Mohit Anchlia
On Mon, Jul 23, 2012 at 11:16 AM, Ertio Lew wrote: > I want to read columns for a randomly selected list of userIds(completely > random). I fetch the data using userIds(which would be used as column names > in case of single row or as rowkeys incase of 1 row for each user) for a > selected list o

CQL3 and column slices

2012-07-23 Thread Josep Blanquer
Hi, I am confused as to what is the way to specify column slices for composite type CFs using CQL3. I first thought that the way to do so was to use the very ugly and unintuitive syntax of constructing the PK prefix with equalities, except the last part of the composite type. But, now, after see

Re: Is it possible to design queries to retrieve columns which match acronyms of col names ?

2012-07-23 Thread Philip O'Toole
On Tue, Jul 24, 2012 at 03:13:40AM +0530, Ertio Lew wrote: > Hi all, > > I wanted to know if it is somehow possible to design queries which could > fetch all columns from a row whose acronyms(first letter from each word) > would match a particular string ? Do you know the "particular" strings ahe

Bringing a dead node back up after fixing hardware issues

2012-07-23 Thread Eran Chinthaka Withana
Hi, In my cluster, one of the nodes went down (due to a hardware failure). We managed to get it fixed in couple of days. But it seems its harder to bring this same node back into cluster without creating read misses. Here is what I did. Method 1: I copied the data from all the nodes in that data

Re: Bringing a dead node back up after fixing hardware issues

2012-07-23 Thread Brandon Williams
On Mon, Jul 23, 2012 at 6:26 PM, Eran Chinthaka Withana wrote: > Method 1: I copied the data from all the nodes in that data center, into the > repaired node, and brought it back up. But because of the rate of updates > happening, the read misses started going up. That's not really a good method

Re: Bringing a dead node back up after fixing hardware issues

2012-07-23 Thread Eran Chinthaka Withana
Thanks Brandon for the answer (and I didn't know driftx = Brandon Williams. Thanks for your awesome support in Cassandra IRC) Increasing CL is tricky for us for now, as our RF on that datacenter is 2 and CL is set to ONE. If we make the CL to be LOCAL_QUORUM, then, if a node goes down we will have

Re: Cassandra 1.0 hangs during GC

2012-07-23 Thread Wojciech Meler
Can you provide output from sar command for the time period when long GC occurred ? Regards, Wojciech Meler