Hi All,
I am trying to process sequence file using Pig. Sequence file is provided as input to pig script. A UDF is written which basically does the processing of meta data and actual file data. This UDF returns input sequence file in form of tuples (metadata, actual file ) as output. This is working fine on Hadoop : · Hadoop : 1.0.3-gphd-1.2.0.0 and · pig version : Apache Pig version 0.9.2-gphd-1.2.0.0 But when I am trying to run same on following version of Hadoop and pig it is failing with error as below: · Hadoop : Hadoop 2.2.0-gphd-3.1.0.0 · Pig version : Apache Pig version 0.12.0-gphd-3.1.0.0 Error: Pig Stack Trace --------------- ERROR 1066: Unable to open iterator for alias limited. Backend error : Exception while executing [POUserFunc (Name: POUserFunc(com.company.bigdata.serde.Deserializer)[tuple] - scope-3 Operator Key: scope-3) children: null at []]: java.lang.NullPointerException org.apache.pig.impl.logicalLayer.FrontendException: ERROR 1066: Unable to open iterator for alias limited. Backend error : Exception while executing [POUserFunc (Name: POUserFunc(com.company.bigdata.serde.Deserializer)[tuple] - scope-3 Operator Key: scope-3) children: null at []]: java.lang.NullPointerException Any pointers on this will be helpful. I can see several posts on similar issue by many people on google but no posted solution. Thanks, Ankur
