ad2. it works with -x local , so there cant be issue with pig->DB(Cassandra).
im using pig-0.8 from official site + hadoop-0.20.2 from offic. site. thx 2011/4/20 aaron morton <aa...@thelastpickle.com> > Am guessing but here goes. Looks like the cassandra RPC port is not set, > did you follow these steps in contrib/pig/README.txt > > Finally, set the following as environment variables (uppercase, > underscored), or as Hadoop configuration variables (lowercase, dotted): > * PIG_RPC_PORT or cassandra.thrift.port : the port thrift is listening on > * PIG_INITIAL_ADDRESS or cassandra.thrift.address : initial address to > connect to > * PIG_PARTITIONER or cassandra.partitioner.class : cluster partitioner > > Hope that helps. > Aaron > > > On 20 Apr 2011, at 11:28, pob wrote: > > Hello, > > I did cluster configuration by > http://wiki.apache.org/cassandra/HadoopSupport. When I run > pig example-script.pig > -x local, everything is fine and i get correct results. > > Problem is occurring with -x mapreduce > > Im getting those errors :> > > > 2011-04-20 01:24:21,791 [main] ERROR org.apache.pig.tools.pigstats.PigStats > - ERROR: java.lang.NumberFormatException: null > 2011-04-20 01:24:21,792 [main] ERROR > org.apache.pig.tools.pigstats.PigStatsUtil - 1 map reduce job(s) failed! > 2011-04-20 01:24:21,793 [main] INFO org.apache.pig.tools.pigstats.PigStats > - Script Statistics: > > Input(s): > Failed to read data from "cassandra://Keyspace1/Standard1" > > Output(s): > Failed to produce result in " > hdfs://ip:54310/tmp/temp-1383865669/tmp-1895601791" > > Counters: > Total records written : 0 > Total bytes written : 0 > Spillable Memory Manager spill count : 0 > Total bags proactively spilled: 0 > Total records proactively spilled: 0 > > Job DAG: > job_201104200056_0005 -> null, > null -> null, > null > > > 2011-04-20 01:24:21,793 [main] INFO > > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher > - Failed! > 2011-04-20 01:24:21,803 [main] ERROR org.apache.pig.tools.grunt.Grunt - > ERROR 1066: Unable to open iterator for alias topnames. Backend error : > java.lang.NumberFormatException: null > > > > ==== > thats from jobtasks web management - error from task directly: > > java.lang.RuntimeException: java.lang.NumberFormatException: null > at > org.apache.cassandra.hadoop.ColumnFamilyRecordReader.initialize(ColumnFamilyRecordReader.java:123) > at > org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigRecordReader.initialize(PigRecordReader.java:176) > at > org.apache.hadoop.mapred.MapTask$NewTrackingRecordReader.initialize(MapTask.java:418) > at org.apache.hadoop.mapred.MapTask.runNewMapper(MapTask.java:620) > at org.apache.hadoop.mapred.MapTask.run(MapTask.java:305) > at org.apache.hadoop.mapred.Child.main(Child.java:170) > Caused by: java.lang.NumberFormatException: null > at java.lang.Integer.parseInt(Integer.java:417) > at java.lang.Integer.parseInt(Integer.java:499) > at > org.apache.cassandra.hadoop.ConfigHelper.getRpcPort(ConfigHelper.java:233) > at > org.apache.cassandra.hadoop.ColumnFamilyRecordReader.initialize(ColumnFamilyRecordReader.java:105) > ... 5 more > > > > Any suggestions where should be problem? > > Thanks, > > >