RE: mmap segment underflow

2011-04-05 Thread Or Yanay
I am considering treating the node as a dead node, delete it's data files and bootstrap from scratch. Is that a reasonable approach? Is there a way for me to identify the bad file/s and remove it/them? From: Or Yanay [mailto:o...@peer39.com] Sent: Monday, April 04, 2011 7:19 PM To:

mmap segment underflow

2011-04-04 Thread Or Yanay
Hi All, I have upgraded from 0.7.0 to 0.7.4, and while running scrub I get the following exception quite a lot: java.lang.AssertionError: mmap segment underflow; remaining is 73936639 but 1970430821 requested at org.apache.cassandra.io.util.MappedFileDataInput.readBytes(MappedF

RE: Requests stuck on production cluster

2011-03-31 Thread Or Yanay
I am using Cassandra 0.7.0 and Random Partitioner. From: Or Yanay [mailto:o...@peer39.com] Sent: Thursday, March 31, 2011 12:20 PM To: user@cassandra.apache.org Subject: Requests stuck on production cluster Hi all, My production cluster reads got stuck. The ring gives: Address Status

Requests stuck on production cluster

2011-03-31 Thread Or Yanay
Hi all, My production cluster reads got stuck. The ring gives: Address Status State LoadOwnsToken 146231632500721020374621781629360107476 10.39.21.7 Up Normal 118.86 GB 18.15% 696879268146680791533

RE: newbie question: how do I know the total number of rows of a cf?

2011-03-28 Thread Or Yanay
I use one of two ways to achieve that: 1. run a map reduce. Pig is really helpful in these cases. Make sure you run your MR using Hadoop task tracker on your nodes - or your performance will take a hit. 2. dump all keys using sstablekeys script from relevant files on all machines and count u

RE: Map-Reduce on top of cassandra

2011-03-24 Thread Or Yanay
o the hadoop cluster prior to processing. They're going to try after installing task trackers on the nodes. On Mar 14, 2011, at 10:06 AM, Or Yanay wrote: > Hi All, > > I am trying to write some map-reduce tasks so I can find out stuff like - how > many records have X status? >

Column TTL Hook

2011-03-24 Thread Or Yanay
Hi all, Is there any way for me to save or pass the columns retired by the column TTL? I understand that compaction takes care of removing expired columns, I would like to know the keys for which columns have expired. Thanks, -Orr

Map-Reduce on top of cassandra

2011-03-14 Thread Or Yanay
Hi All, I am trying to write some map-reduce tasks so I can find out stuff like - how many records have X status? I am using 0.7.0 and have 5 nodes with ~100G of data on each node. I have written the code based on the word_count example and the map-reduce is running successfully BUT is extremel

map-reduce failure

2011-01-11 Thread Or Yanay
Hi all, I am using 0.6.8 across 5 machines with ~30G of data on each machine. I am trying to run a map-reduce query (Both with my own Java code and Pig) and failing after about 30 minutes (see stack trace and details below). I have followed this wiki page