Hi, I have an error with pig action in oozie 4.0.0 using cassandraStorage. (cassandra 1.2.10)
I can run pig scripts right with cassandra. but whe I try to use cassandraStorage to load data I have this error: *Run pig script using PigRunner.run() for Pig version 0.8+* *Apache Pig version 0.10.0 (r1328203) * *compiled Apr 20 2012, 00:33:25* *Run pig script using PigRunner.run() for Pig version 0.8+* *2013-12-10 12:24:39,084 [main] INFO org.apache.pig.Main - Apache Pig version 0.10.0 (r1328203) compiled Apr 20 2012, 00:33:25* *2013-12-10 12:24:39,084 [main] INFO org.apache.pig.Main - Apache Pig version 0.10.0 (r1328203) compiled Apr 20 2012, 00:33:25* *2013-12-10 12:24:39,095 [main] INFO org.apache.pig.Main - Logging error messages to: /tmp/hadoop-ec2-user/mapred/local/taskTracker/ec2-user/jobcache/job_201312100858_0007/attempt_201312100858_0007_m_000000_0/work/pig-job_201312100858_0007.log* *2013-12-10 12:24:39,095 [main] INFO org.apache.pig.Main - Logging error messages to: /tmp/hadoop-ec2-user/mapred/local/taskTracker/ec2-user/jobcache/job_201312100858_0007/attempt_201312100858_0007_m_000000_0/work/pig-job_201312100858_0007.log* *2013-12-10 12:24:39,501 [main] INFO org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting to hadoop file system at: hdfs://10.228.243.18:9000 <http://10.228.243.18:9000>* *2013-12-10 12:24:39,501 [main] INFO org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting to hadoop file system at: hdfs://10.228.243.18:9000 <http://10.228.243.18:9000>* *2013-12-10 12:24:39,510 [main] INFO org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting to map-reduce job tracker at: 10.228.243.18:9001 <http://10.228.243.18:9001>* *2013-12-10 12:24:39,510 [main] INFO org.apache.pig.backend.hadoop.executionengine.HExecutionEngine - Connecting to map-reduce job tracker at: 10.228.243.18:9001 <http://10.228.243.18:9001>* *2013-12-10 12:24:40,505 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2245: * *<file testCassandra.pig, line 7, column 7> Cannot get schema from loadFunc org.apache.cassandra.hadoop.pig.CassandraStorage* *2013-12-10 12:24:40,505 [main] ERROR org.apache.pig.tools.grunt.Grunt - ERROR 2245: * *<file testCassandra.pig, line 7, column 7> Cannot get schema from loadFunc org.apache.cassandra.hadoop.pig.CassandraStorage* *2013-12-10 12:24:40,505 [main] ERROR org.apache.pig.tools.grunt.Grunt - org.apache.pig.impl.logicalLayer.FrontendException: ERROR 2245: * *<file testCassandra.pig, line 7, column 7> Cannot get schema from loadFunc org.apache.cassandra.hadoop.pig.CassandraStorage* * at org.apache.pig.newplan.logical.relational.LOLoad.getSchemaFromMetaData(LOLoad.java:155)* * at org.apache.pig.newplan.logical.relational.LOLoad.getSchema(LOLoad.java:110)* * at org.apache.pig.newplan.logical.relational.LOStore.getSchema(LOStore.java:68)* * at org.apache.pig.newplan.logical.visitor.SchemaAliasVisitor.validate(SchemaAliasVisitor.java:60)* * at org.apache.pig.newplan.logical.visitor.SchemaAliasVisitor.visit(SchemaAliasVisitor.java:84)* * at org.apache.pig.newplan.logical.relational.LOStore.accept(LOStore.java:77)* * at org.apache.pig.newplan.DependencyOrderWalker.walk(DependencyOrderWalker.java:75)* * at org.apache.pig.newplan.PlanVisitor.visit(PlanVisitor.java:50)* * at org.apache.pig.PigServer$Graph.compile(PigServer.java:1617)* * at org.apache.pig.PigServer$Graph.compile(PigServer.java:1611)* * at org.apache.pig.PigServer$Graph.access$200(PigServer.java:1334)* * at org.apache.pig.PigServer.execute(PigServer.java:1239)* * at org.apache.pig.PigServer.executeBatch(PigServer.java:362)* * at org.apache.pig.tools.grunt.GruntParser.executeBatch(GruntParser.java:132)* * at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:193)* * at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:165)* * at org.apache.pig.tools.grunt.Grunt.exec(Grunt.java:84)* * at org.apache.pig.Main.run(Main.java:430)* * at org.apache.pig.PigRunner.run(PigRunner.java:49)* * at org.apache.oozie.action.hadoop.PigMain.runPigJob(PigMain.java:283)* * at org.apache.oozie.action.hadoop.PigMain.run(PigMain.java:223)* * at org.apache.oozie.action.hadoop.LauncherMain.run(LauncherMain.java:37)* * at org.apache.oozie.action.hadoop.PigMain.main(PigMain.java:76)* * at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)* * at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)* * at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)* * at java.lang.reflect.Method.invoke(Method.java:601)* * at org.apache.oozie.action.hadoop.LauncherMapper.map(LauncherMapper.java:226)* * at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50)* * at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:436)* * at org.apache.hadoop.mapred.MapTask.run(MapTask.java:372)* * at org.apache.hadoop.mapred.Child$4.run(Child.java:255)* * at java.security.AccessController.doPrivileged(Native Method)* * at javax.security.auth.Subject.doAs(Subject.java:415)* * at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1136)* * at org.apache.hadoop.mapred.Child.main(Child.java:249)* *Caused by: java.io.IOException: PIG_INPUT_INITIAL_ADDRESS or PIG_INITIAL_ADDRESS environment variable not set* * at org.apache.cassandra.hadoop.pig.CassandraStorage.setLocation(CassandraStorage.java:314)* * at org.apache.cassandra.hadoop.pig.CassandraStorage.getSchema(CassandraStorage.java:358)* * at org.apache.pig.newplan.logical.relational.LOLoad.getSchemaFromMetaData(LOLoad.java:151)* * ... 35 more* *2013-12-10 12:24:40,505 [main] ERROR org.apache.pig.tools.grunt.Grunt - org.apache.pig.impl.logicalLayer.FrontendException: ERROR 2245: * *<file testCassandra.pig, line 7, column 7> Cannot get schema from loadFunc org.apache.cassandra.hadoop.pig.CassandraStorage* * at org.apache.pig.newplan.logical.relational.LOLoad.getSchemaFromMetaData(LOLoad.java:155)* * at org.apache.pig.newplan.logical.relational.LOLoad.getSchema(LOLoad.java:110)* * at org.apache.pig.newplan.logical.relational.LOStore.getSchema(LOStore.java:68)* * at org.apache.pig.newplan.logical.visitor.SchemaAliasVisitor.validate(SchemaAliasVisitor.java:60)* * at org.apache.pig.newplan.logical.visitor.SchemaAliasVisitor.visit(SchemaAliasVisitor.java:84)* * at org.apache.pig.newplan.logical.relational.LOStore.accept(LOStore.java:77)* * at org.apache.pig.newplan.DependencyOrderWalker.walk(DependencyOrderWalker.java:75)* * at org.apache.pig.newplan.PlanVisitor.visit(PlanVisitor.java:50)* * at org.apache.pig.PigServer$Graph.compile(PigServer.java:1617)* * at org.apache.pig.PigServer$Graph.compile(PigServer.java:1611)* * at org.apache.pig.PigServer$Graph.access$200(PigServer.java:1334)* * at org.apache.pig.PigServer.execute(PigServer.java:1239)* * at org.apache.pig.PigServer.executeBatch(PigServer.java:362)* * at org.apache.pig.tools.grunt.GruntParser.executeBatch(GruntParser.java:132)* * at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:193)* * at org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:165)* * at org.apache.pig.tools.grunt.Grunt.exec(Grunt.java:84)* * at org.apache.pig.Main.run(Main.java:430)* * at org.apache.pig.PigRunner.run(PigRunner.java:49)* * at org.apache.oozie.action.hadoop.PigMain.runPigJob(PigMain.java:283)* * at org.apache.oozie.action.hadoop.PigMain.run(PigMain.java:223)* * at org.apache.oozie.action.hadoop.LauncherMain.run(LauncherMain.java:37)* * at org.apache.oozie.action.hadoop.PigMain.main(PigMain.java:76)* * at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)* * at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:57)* * at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:43)* * at java.lang.reflect.Method.invoke(Method.java:601)* * at org.apache.oozie.action.hadoop.LauncherMapper.map(LauncherMapper.java:226)* * at org.apache.hadoop.mapred.MapRunner.run(MapRunner.java:50)* * at org.apache.hadoop.mapred.MapTask.runOldMapper(MapTask.java:436)* * at org.apache.hadoop.mapred.MapTask.run(MapTask.java:372)* * at org.apache.hadoop.mapred.Child$4.run(Child.java:255)* * at java.security.AccessController.doPrivileged(Native Method)* * at javax.security.auth.Subject.doAs(Subject.java:415)* * at org.apache.hadoop.security.UserGroupInformation.doAs(UserGroupInformation.java:1136)* * at org.apache.hadoop.mapred.Child.main(Child.java:249)* *Caused by: java.io.IOException: PIG_INPUT_INITIAL_ADDRESS or PIG_INITIAL_ADDRESS environment variable not set* * at org.apache.cassandra.hadoop.pig.CassandraStorage.setLocation(CassandraStorage.java:314)* * at org.apache.cassandra.hadoop.pig.CassandraStorage.getSchema(CassandraStorage.java:358)* * at org.apache.pig.newplan.logical.relational.LOLoad.getSchemaFromMetaData(LOLoad.java:151)* * ... 35 more* *<<< Invocation of Main class completed <<<* *Failing Oozie Launcher, Main class [org.apache.oozie.action.hadoop.PigMain], exit code [2]* *Oozie Launcher failed, finishing Hadoop job gracefully* *-----------------------------------------------------------------------* I, m using Ec2 instances and i have a hadoop cluster with cassandra in all nodes and I can run right pig_cassandra scriptand I conigured oozie in other instance with namenode.. I set the var in .bash_profile file like: *..* *export PIG_INITIAL_ADDRESS=${seed}* *...* How can I set this env variable in Oozie or in Pig with Oozie Another cuestion is: 2.- I configured oozie installation to use pig 0.11.1 in the pom.xml like the Oozie instructions installation but I see in the logs that Oozie is using Pig 0.10. *...* *Run pig script using PigRunner.run() for Pig version 0.8+* *Apache Pig version 0.10.0 (r1328203) * *compiled Apr 20 2012, 00:33:25* *...* *I upload to hdfs share lib and check pig library is 0.11.1 version* How can i change or configure this issue? thanks in advance And any helps wil be appreciated. Regards