With the trunk version and given patches I am now getting following exception:
10/09/08 22:39:14 WARN mapred.LocalJobRunner: job_local_0001 java.lang.ClassCastException: [B cannot be cast to java.nio.ByteBuffer at org.apache.cassandra.hadoop.ColumnFamilyRecordWriter.write(ColumnFamilyRecordWriter.java:68) at org.apache.hadoop.mapred.ReduceTask$NewTrackingRecordWriter.write(ReduceTask.java:508) at org.apache.hadoop.mapreduce.TaskInputOutputContext.write(TaskInputOutputContext.java:80) at cassandratest.Main$TReducer.reduce(Main.java:132) at cassandratest.Main$TReducer.reduce(Main.java:113) at org.apache.hadoop.mapreduce.Reducer.run(Reducer.java:176) at org.apache.hadoop.mapred.ReduceTask.runNewReducer(ReduceTask.java:566) at org.apache.hadoop.mapred.ReduceTask.run(ReduceTask.java:408) at org.apache.hadoop.mapred.LocalJobRunner$Job.run(LocalJobRunner.java:216) 10/09/08 22:39:14 INFO mapred.JobClient: map 100% reduce 0% Alex. ________________________________________ From: Jonathan Ellis [jbel...@gmail.com] Sent: Wednesday, September 08, 2010 2:26 PM To: user@cassandra.apache.org Subject: Re: ColumnFamilyOutputFormat and mapreduce.output.columnfamilyoutputformat.batch.threshold 0.70beta1 Try the patches on https://issues.apache.org/jira/browse/CASSANDRA-1434 (or wait until they're committed to trunk, then try a nightly build) On Wed, Sep 8, 2010 at 4:18 PM, Alex Burkoff <aburk...@clearpathnet.com> wrote: > Guys, > > I was testing ColumnFamilyOutputFormat and found that only columns from the > last Reduce > invocation get stored when > mapreduce.output.columnfamilyoutputformat.batch.threshold has > the default value. Setting it to 1 changes the behavior, and all data is > stored then. Is it the > intended behavior, or am I missing something ? > > Best regards, > > Alex Burkoff > -- Jonathan Ellis Project Chair, Apache Cassandra co-founder of Riptano, the source for professional Cassandra support http://riptano.com