Wait... keep in mind there is no Hadoop on my system. How can I be getting
Hadoop 1/2 issues? I can't use XMLLoader...
grunt> REGISTER
/Users/rjurney/Software/pig-0.14.0/contrib/piggybank/java/piggybank.jar
grunt>
grunt> data = LOAD '/Users/rjurney/Software/foo/data/gsa_feed.xml' USING
org.apache.pig.piggybank.storage.XMLLoader('record') AS (doc:chararray);
grunt> a = limit data 10;
grunt> dump a
2015-01-21 21:42:52,113 [main] INFO
org.apache.pig.tools.pigstats.ScriptState - Pig features used in the
script: LIMIT
2015-01-21 21:42:52,125 [main] WARN org.apache.pig.data.SchemaTupleBackend
- SchemaTupleBackend has already been initialized
2015-01-21 21:42:52,126 [main] INFO
org.apache.pig.newplan.logical.optimizer.LogicalPlanOptimizer -
{RULES_ENABLED=[AddForEach, ColumnMapKeyPrune, ConstantCalculator,
GroupByConstParallelSetter, LimitOptimizer, LoadTypeCastInserter,
MergeFilter, MergeForEach, PartitionFilterOptimizer,
PredicatePushdownOptimizer, PushDownForEachFlatten, PushUpFilter,
SplitFilter, StreamTypeCastInserter]}
2015-01-21 21:42:52,200 [main] INFO
org.apache.hadoop.mapreduce.lib.input.FileInputFormat - Total input paths
to process : 1
2015-01-21 21:42:52,200 [main] INFO
org.apache.pig.backend.hadoop.executionengine.util.MapRedUtil - Total input
paths to process : 1
2015-01-21 21:42:52,204 [main] ERROR org.apache.pig.tools.grunt.Grunt -
ERROR 2998: Unhandled internal error. Found class
org.apache.hadoop.mapreduce.TaskAttemptContext, but interface was expected
2015-01-21 21:42:52,204 [main] ERROR org.apache.pig.tools.grunt.Grunt -
java.lang.IncompatibleClassChangeError: Found class
org.apache.hadoop.mapreduce.TaskAttemptContext, but interface was expected
at
org.apache.pig.piggybank.storage.XMLLoader$XMLRecordReader.initialize(XMLLoader.java:102)
at
org.apache.pig.impl.io.ReadToEndLoader.initializeReader(ReadToEndLoader.java:210)
at
org.apache.pig.impl.io.ReadToEndLoader.getNextHelper(ReadToEndLoader.java:248)
at org.apache.pig.impl.io.ReadToEndLoader.getNext(ReadToEndLoader.java:229)
at
org.apache.pig.backend.hadoop.executionengine.physicalLayer.relationalOperators.POLoad.getNextTuple(POLoad.java:137)
at
org.apache.pig.backend.hadoop.executionengine.physicalLayer.PhysicalOperator.processInput(PhysicalOperator.java:307)
at
org.apache.pig.backend.hadoop.executionengine.physicalLayer.relationalOperators.POLimit.getNextTuple(POLimit.java:122)
at
org.apache.pig.backend.hadoop.executionengine.physicalLayer.PhysicalOperator.processInput(PhysicalOperator.java:307)
at
org.apache.pig.backend.hadoop.executionengine.physicalLayer.relationalOperators.POForEach.getNextTuple(POForEach.java:246)
at
org.apache.pig.backend.hadoop.executionengine.physicalLayer.PhysicalOperator.processInput(PhysicalOperator.java:307)
at
org.apache.pig.backend.hadoop.executionengine.physicalLayer.relationalOperators.POStore.getNextTuple(POStore.java:159)
at
org.apache.pig.backend.hadoop.executionengine.fetch.FetchLauncher.runPipeline(FetchLauncher.java:161)
at
org.apache.pig.backend.hadoop.executionengine.fetch.FetchLauncher.launchPig(FetchLauncher.java:81)
at
org.apache.pig.backend.hadoop.executionengine.HExecutionEngine.launchPig(HExecutionEngine.java:278)
at org.apache.pig.PigServer.launchPlan(PigServer.java:1390)
at org.apache.pig.PigServer.executeCompiledLogicalPlan(PigServer.java:1375)
at org.apache.pig.PigServer.storeEx(PigServer.java:1034)
at org.apache.pig.PigServer.store(PigServer.java:997)
at org.apache.pig.PigServer.openIterator(PigServer.java:910)
at org.apache.pig.tools.grunt.GruntParser.processDump(GruntParser.java:746)
at
org.apache.pig.tools.pigscript.parser.PigScriptParser.parse(PigScriptParser.java:372)
at
org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:230)
at
org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:205)
at org.apache.pig.tools.grunt.Grunt.run(Grunt.java:66)
at org.apache.pig.Main.run(Main.java:558)
at org.apache.pig.Main.main(Main.java:170)
Details also at logfile: /private/tmp/pig_1421905319118.log
ᐧ
On Wed, Jan 21, 2015 at 9:42 PM, Russell Jurney <[email protected]>
wrote:
> Not sure what was going on, but I got it working.
> ᐧ
>
> On Wed, Jan 21, 2015 at 9:39 PM, Russell Jurney <[email protected]>
> wrote:
>
>> I am working on a macbook without Hadoop installed. I download Pig
>> 0.14.0, and I run it...
>>
>> Russells-MacBook-Pro:pig-0.14.0 rjurney$ bin/pig -l /tmp -v -w -x local
>>
>>
>> I run the following commands and get this exception. What gives? Why
>> doesn't pig work?
>>
>> grunt> foo = LOAD '/etc/passwd' USING TextLoader();
>>
>> 2015-01-21 21:36:24,095 [main] INFO
>> org.apache.hadoop.conf.Configuration.deprecation - fs.default.name is
>> deprecated. Instead, use fs.defaultFS
>>
>> 2015-01-21 21:36:24,096 [main] INFO
>> org.apache.hadoop.conf.Configuration.deprecation - io.bytes.per.checksum is
>> deprecated. Instead, use dfs.bytes-per-checksum
>>
>> grunt> dump foo
>>
>> 2015-01-21 21:36:25,701 [main] INFO
>> org.apache.pig.tools.pigstats.ScriptState - Pig features used in the
>> script: UNKNOWN
>>
>> 2015-01-21 21:36:25,735 [main] INFO
>> org.apache.hadoop.conf.Configuration.deprecation - io.bytes.per.checksum is
>> deprecated. Instead, use dfs.bytes-per-checksum
>>
>> 2015-01-21 21:36:25,737 [main] INFO
>> org.apache.hadoop.conf.Configuration.deprecation - fs.default.name is
>> deprecated. Instead, use fs.defaultFS
>>
>> 2015-01-21 21:36:25,738 [main] INFO
>> org.apache.pig.newplan.logical.optimizer.LogicalPlanOptimizer -
>> {RULES_ENABLED=[AddForEach, ColumnMapKeyPrune, ConstantCalculator,
>> GroupByConstParallelSetter, LimitOptimizer, LoadTypeCastInserter,
>> MergeFilter, MergeForEach, PartitionFilterOptimizer,
>> PredicatePushdownOptimizer, PushDownForEachFlatten, PushUpFilter,
>> SplitFilter, StreamTypeCastInserter]}
>>
>> 2015-01-21 21:36:25,740 [main] INFO
>> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MRCompiler -
>> File concatenation threshold: 100 optimistic? false
>>
>> 2015-01-21 21:36:25,743 [main] INFO
>> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer
>> - MR plan size before optimization: 1
>>
>> 2015-01-21 21:36:25,743 [main] INFO
>> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MultiQueryOptimizer
>> - MR plan size after optimization: 1
>>
>> 2015-01-21 21:36:25,776 [main] INFO
>> org.apache.hadoop.conf.Configuration.deprecation - io.bytes.per.checksum is
>> deprecated. Instead, use dfs.bytes-per-checksum
>>
>> 2015-01-21 21:36:25,777 [main] INFO
>> org.apache.hadoop.conf.Configuration.deprecation - fs.default.name is
>> deprecated. Instead, use fs.defaultFS
>>
>> 2015-01-21 21:36:25,779 [main] ERROR org.apache.pig.tools.grunt.Grunt -
>> ERROR 2998: Unhandled internal error. Bad type on operand stack
>>
>> Exception Details:
>>
>> Location:
>>
>>
>> org/apache/hadoop/mapred/JobTrackerInstrumentation.create(Lorg/apache/hadoop/mapred/JobTracker;Lorg/apache/hadoop/mapred/JobConf;)Lorg/apache/hadoop/mapred/JobTrackerInstrumentation;
>> @5: invokestatic
>>
>> Reason:
>>
>> Type 'org/apache/hadoop/metrics2/lib/DefaultMetricsSystem' (current
>> frame, stack[2]) is not assignable to
>> 'org/apache/hadoop/metrics2/MetricsSystem'
>>
>> Current Frame:
>>
>> bci: @5
>>
>> flags: { }
>>
>> locals: { 'org/apache/hadoop/mapred/JobTracker',
>> 'org/apache/hadoop/mapred/JobConf' }
>>
>> stack: { 'org/apache/hadoop/mapred/JobTracker',
>> 'org/apache/hadoop/mapred/JobConf',
>> 'org/apache/hadoop/metrics2/lib/DefaultMetricsSystem' }
>>
>> Bytecode:
>>
>> 0000000: 2a2b b200 03b8 0004 b0
>>
>>
>> 2015-01-21 21:36:25,780 [main] ERROR org.apache.pig.tools.grunt.Grunt -
>> java.lang.VerifyError: Bad type on operand stack
>>
>> Exception Details:
>>
>> Location:
>>
>>
>> org/apache/hadoop/mapred/JobTrackerInstrumentation.create(Lorg/apache/hadoop/mapred/JobTracker;Lorg/apache/hadoop/mapred/JobConf;)Lorg/apache/hadoop/mapred/JobTrackerInstrumentation;
>> @5: invokestatic
>>
>> Reason:
>>
>> Type 'org/apache/hadoop/metrics2/lib/DefaultMetricsSystem' (current
>> frame, stack[2]) is not assignable to
>> 'org/apache/hadoop/metrics2/MetricsSystem'
>>
>> Current Frame:
>>
>> bci: @5
>>
>> flags: { }
>>
>> locals: { 'org/apache/hadoop/mapred/JobTracker',
>> 'org/apache/hadoop/mapred/JobConf' }
>>
>> stack: { 'org/apache/hadoop/mapred/JobTracker',
>> 'org/apache/hadoop/mapred/JobConf',
>> 'org/apache/hadoop/metrics2/lib/DefaultMetricsSystem' }
>>
>> Bytecode:
>>
>> 0000000: 2a2b b200 03b8 0004 b0
>>
>>
>> at org.apache.hadoop.mapred.LocalJobRunner.<init>(LocalJobRunner.java:420)
>>
>> at org.apache.hadoop.mapred.JobClient.init(JobClient.java:472)
>>
>> at org.apache.hadoop.mapred.JobClient.<init>(JobClient.java:457)
>>
>> at
>> org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.MapReduceLauncher.launchPig(MapReduceLauncher.java:163)
>>
>> at
>> org.apache.pig.backend.hadoop.executionengine.HExecutionEngine.launchPig(HExecutionEngine.java:280)
>>
>> at org.apache.pig.PigServer.launchPlan(PigServer.java:1390)
>>
>> at
>> org.apache.pig.PigServer.executeCompiledLogicalPlan(PigServer.java:1375)
>>
>> at org.apache.pig.PigServer.storeEx(PigServer.java:1034)
>>
>> at org.apache.pig.PigServer.store(PigServer.java:997)
>>
>> at org.apache.pig.PigServer.openIterator(PigServer.java:910)
>>
>> at
>> org.apache.pig.tools.grunt.GruntParser.processDump(GruntParser.java:746)
>>
>> at
>> org.apache.pig.tools.pigscript.parser.PigScriptParser.parse(PigScriptParser.java:372)
>>
>> at
>> org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:230)
>>
>> at
>> org.apache.pig.tools.grunt.GruntParser.parseStopOnError(GruntParser.java:205)
>>
>> at org.apache.pig.tools.grunt.Grunt.run(Grunt.java:66)
>>
>> at org.apache.pig.Main.run(Main.java:558)
>>
>> at org.apache.pig.Main.main(Main.java:170)
>>
>>
>> Details also at logfile: /private/tmp/pig_1421904926350.log
>>
>> --
>> Russell Jurney twitter.com/rjurney [email protected] datasyndrome.
>> com
>> ᐧ
>>
>
>
>
> --
> Russell Jurney twitter.com/rjurney [email protected] datasyndrome.
> com
>
--
Russell Jurney twitter.com/rjurney [email protected] datasyndrome.com