Re: Throttling ColumnFamilyRecordReader

2010-10-20 Thread Michael Moores
n't overlap. > > On Tue, Oct 19, 2010 at 3:22 PM, Michael Moores wrote: >> Does it make sense to add some kind of throttle capability on the >> ColumnFamilyRecordReader for Hadoop? >> >> If I have 60 or so Map tasks running at the same time when the cluster is

ColumnFamilyRecordReader issue

2010-10-14 Thread Michael Moores
My Hadoop TaskTracker is using the Cassandra CplumnFamilyInputFormat, and appears to be finding records (the data is serialized below in the log output), but the cassandra Column class is throwing a validation exception indicating "Required field 'clock' was not present!". My Cassandra cluster v

Re: 0.7.0-beta2 and Hadoop

2010-10-14 Thread Michael Moores
I SOLVED the problem. It was my misunderstanding of how the cassandra connection is being used for calling getSlices(). On Oct 14, 2010, at 10:06 AM, Michael Moores wrote: Ok I moved back to hadoop 20.2 and the WordCount example is doing better. But I am still seeing a problem, that may be due

Re: 0.7.0-beta2 and Hadoop

2010-10-14 Thread Michael Moores
Method) Should I expect my job to be executed on the TaskTracker nodes? On Oct 13, 2010, at 5:39 PM, Michael Moores wrote: What version of hadoop should i be using with cassandra 0.7.0-beta2? I am using the latest version 21.0. Just running a modified version of the WordCount example: https:/

0.7.0-beta2 and Hadoop

2010-10-13 Thread Michael Moores
What version of hadoop should i be using with cassandra 0.7.0-beta2? I am using the latest version 21.0. Just running a modified version of the WordCount example: https://svn.apache.org/repos/asf/cassandra/trunk/contrib/word_count/src/ I get a linkage error thrown from the getSplits method. Exce

Re: 0.7 beta2 - Fatal exception from RowMutationVerbHandler

2010-10-13 Thread Michael Moores
I created CASSANDRA-1617 On Oct 12, 2010, at 1:51 PM, Michael Moores wrote: > I have a cluster of 8 nodes with a replication factor of 3 and consistency of > QUORUM. > When I stop one node in the cluster I end up with socket read timeouts to >

0.7 beta2 - Fatal exception from RowMutationVerbHandler

2010-10-12 Thread Michael Moores
I have a cluster of 8 nodes with a replication factor of 3 and consistency of QUORUM. When I stop one node in the cluster I end up with socket read timeouts to other nodes: org.apache.thrift.transport.TTransportException: java.net.SocketTimeoutException: Read timed out The timeout is set to 4 se