Hi,
I have a very weird issue with my PIG script. Following is the content of
my script
*REGISTER /home/hadoopuser/Workspace/lib/piggybank.jar*
*REGISTER /home/hadoopuser/Workspace/lib/datafu.jar;*
*REGISTER
/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hbase/hbase-0.94.2-cdh4.2.1-security.jar;
*
*REGISTER
/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/zookeeper/zookeeper-3.4.5-cdh4.2.1.jar;
*
*SET default_parallel 15;*
*records = LOAD 'hbase://dm-re' USING
org.apache.pig.backend.hadoop.hbase.HBaseStorage('v:ctm v:src','-caching
5000 -gt 1366098805& -lt 1366102543&') as
(time:chararray,company:chararray);*
*records_iso = FOREACH records GENERATE
org.apache.pig.piggybank.evaluation.datetime.convert.CustomFormatToISO(time,'yyyy-MM-dd
HH:mm:ss Z') as iso_time;*
*records_group = GROUP records_iso ALL;*
*result = FOREACH records_group GENERATE MAX(records_iso.iso_time) as
maxtime;*
*DUMP result*
When i try to run this script in cluster of 5 nodes with 20 map slots, most
of the map tasks fail with the following error after 10 mins of
initializing,
*Task attempt <id> failed to report status for 600 seconds. Killing!*
I tried to decrease the caching size to less than 100 or so, (under the
intuition that maybe fetching and processing more cache is taking more
time) but still the same issue. However if i manage to load the rows (using
lt and gt) such that number of map tasks are <=2, the job will be
successfully finished. When the number of tasks is > 2 , it is always the
case that 2-4 tasks are completed and the rest all fail with the above
mentioned error. I attach the task tracker log hereby for this attempt. I
don't see any error except for some zookeeper connection warnings. I
manually checked from that node and doing a 'hbase zkcli' connects without
any issue. Hence, I assume that zookeeper is configured properly.
I don't really understand where to debug this problem. It would be great if
someone could provide assistance. Some configurations of the cluster, which
i think maybe relevant here,
*dfs.block.size = 1 GB
io.sort.mb = 1 GB
HRegion size = 1 GB
*
and the size of the hbase table is close to 250 GB. I have observed 100%
cpu usage by the mapred user on the node, while the task is under
execution. I am not really sure, what to optimize in this case for the job
to complete. It would be good if someone can throw some light in this
direction.
PS: All my nodes in the cluster are configured on a EBS backed amazon ec2
cluster.
--
Regards,
Praveen Bysani
http://www.praveenbysani.com
Task Logs: 'attempt_201305081039_0028_m_000010_0'
stdout logs
stderr logs
syslog logs
2013-05-13 06:29:17,218 WARN mapreduce.Counters: Group
org.apache.hadoop.mapred.Task$Counter is deprecated. Use
org.apache.hadoop.mapreduce.TaskCounter instead
2013-05-13 06:29:18,921 INFO
org.apache.hadoop.filecache.TrackerDistributedCacheManager: Creating symlink:
/ebs/mapred/local/taskTracker/hadoopuser/jobcache/job_201305081039_0028/jars/job.jar
<-
/ebs/mapred/local/taskTracker/hadoopuser/jobcache/job_201305081039_0028/attempt_201305081039_0028_m_000010_0/work/job.jar
2013-05-13 06:29:18,954 INFO
org.apache.hadoop.filecache.TrackerDistributedCacheManager: Creating symlink:
/ebs/mapred/local/taskTracker/hadoopuser/jobcache/job_201305081039_0028/jars/.job.jar.crc
<-
/ebs/mapred/local/taskTracker/hadoopuser/jobcache/job_201305081039_0028/attempt_201305081039_0028_m_000010_0/work/.job.jar.crc
2013-05-13 06:29:19,076 WARN org.apache.hadoop.conf.Configuration: session.id
is deprecated. Instead, use dfs.metrics.session-id
2013-05-13 06:29:19,077 INFO org.apache.hadoop.metrics.jvm.JvmMetrics:
Initializing JVM Metrics with processName=MAP, sessionId=
2013-05-13 06:29:19,923 INFO org.apache.hadoop.util.ProcessTree: setsid exited
with exit code 0
2013-05-13 06:29:19,967 INFO org.apache.hadoop.mapred.Task: Using
ResourceCalculatorPlugin :
org.apache.hadoop.util.LinuxResourceCalculatorPlugin@5a92668c
2013-05-13 06:29:20,345 INFO org.apache.hadoop.mapred.MapTask: Processing
split: Number of splits :1
Total Length = 0
Input split[0]:
Length = 0
Locations:
ip-10-122-3-220.ap-northeast-1.compute.internal
-----------------------
2013-05-13 06:29:20,777 INFO org.apache.zookeeper.ZooKeeper: Client
environment:zookeeper.version=3.4.5-cdh4.2.1--1, built on 04/22/2013 03:46 GMT
2013-05-13 06:29:20,777 INFO org.apache.zookeeper.ZooKeeper: Client
environment:host.name=ip-10-122-3-220.ap-northeast-1.compute.internal
2013-05-13 06:29:20,777 INFO org.apache.zookeeper.ZooKeeper: Client
environment:java.version=1.6.0_31
2013-05-13 06:29:20,777 INFO org.apache.zookeeper.ZooKeeper: Client
environment:java.vendor=Sun Microsystems Inc.
2013-05-13 06:29:20,778 INFO org.apache.zookeeper.ZooKeeper: Client
environment:java.home=/usr/lib/jvm/j2sdk1.6-oracle/jre
2013-05-13 06:29:20,778 INFO org.apache.zookeeper.ZooKeeper: Client
environment:java.class.path=/run/cloudera-scm-agent/process/215-mapreduce-TASKTRACKER:/usr/lib/jvm/j2sdk1.6-oracle/lib/tools.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/hadoop-core-2.0.0-mr1-cdh4.2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/activation-1.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/ant-contrib-1.0b3.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/asm-3.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/aspectjrt-1.6.5.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/aspectjtools-1.6.5.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/avro-1.7.3.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/avro-compiler-1.7.3.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/commons-beanutils-1.7.0.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/commons-beanutils-core-1.8.0.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/commons-cli-1.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/commons-codec-1.4.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/commons-collections-3.2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/commons-configuration-1.6.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/commons-digester-1.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/commons-el-1.0.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/commons-httpclient-3.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/commons-io-2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/commons-lang-2.5.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/commons-logging-1.1.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/commons-math-2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/commons-net-3.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/guava-11.0.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/hadoop-fairscheduler-2.0.0-mr1-cdh4.2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/hsqldb-1.8.0.10.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jackson-core-asl-1.8.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jackson-jaxrs-1.8.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jackson-mapper-asl-1.8.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jackson-xc-1.8.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jasper-compiler-5.5.23.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jasper-runtime-5.5.23.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jaxb-api-2.2.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jaxb-impl-2.2.3-1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jersey-core-1.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jersey-json-1.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jersey-server-1.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jets3t-0.6.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jettison-1.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jetty-6.1.26.cloudera.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jetty-util-6.1.26.cloudera.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jline-0.9.94.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jsch-0.1.42.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jsp-api-2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jsr305-1.3.9.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/junit-4.8.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/kfs-0.2.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/kfs-0.3.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/log4j-1.2.17.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/mockito-all-1.8.5.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/paranamer-2.3.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/protobuf-java-2.4.0a.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/servlet-api-2.5.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/slf4j-api-1.6.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/snappy-java-1.0.4.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/stax-api-1.0.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/xmlenc-0.52.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/zookeeper-3.4.5-cdh4.2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jsp-2.1/jsp-2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/jsp-2.1/jsp-api-2.1.jar:/usr/share/cmf/lib/plugins/navigator-plugin-4.5.2-shaded.jar:/usr/share/cmf/lib/plugins/tt-instrumentation-4.5.2.jar:/usr/share/cmf/lib/plugins/event-publish-4.5.2-shaded.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/jackson-mapper-asl-1.8.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/commons-io-2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/commons-logging-1.1.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/xmlenc-0.52.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/jasper-runtime-5.5.23.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/servlet-api-2.5.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/zookeeper-3.4.5-cdh4.2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/jline-0.9.94.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/jackson-core-asl-1.8.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/jsp-api-2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/guava-11.0.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/commons-cli-1.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/asm-3.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/commons-daemon-1.0.3.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/log4j-1.2.17.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/protobuf-java-2.4.0a.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/commons-el-1.0.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/jetty-util-6.1.26.cloudera.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/jersey-core-1.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/jersey-server-1.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/commons-codec-1.4.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/jetty-6.1.26.cloudera.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/jsr305-1.3.9.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/lib/commons-lang-2.5.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/hadoop-hdfs-2.0.0-cdh4.2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/hadoop-hdfs.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-hdfs/hadoop-hdfs-2.0.0-cdh4.2.1-tests.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jackson-mapper-asl-1.8.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/commons-io-2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/commons-logging-1.1.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/xmlenc-0.52.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jasper-runtime-5.5.23.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/servlet-api-2.5.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jackson-jaxrs-1.8.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/zookeeper-3.4.5-cdh4.2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jline-0.9.94.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/commons-collections-3.2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/commons-httpclient-3.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jackson-core-asl-1.8.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jsp-api-2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/guava-11.0.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/slf4j-log4j12-1.6.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/commons-net-3.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/commons-cli-1.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/commons-configuration-1.6.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/commons-digester-1.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jasper-compiler-5.5.23.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/commons-beanutils-1.7.0.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/snappy-java-1.0.4.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jaxb-impl-2.2.3-1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/slf4j-api-1.6.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/paranamer-2.3.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/asm-3.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/junit-4.8.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/commons-beanutils-core-1.8.0.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jersey-json-1.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/commons-math-2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/log4j-1.2.17.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/protobuf-java-2.4.0a.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/commons-el-1.0.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/mockito-all-1.8.5.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jettison-1.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/activation-1.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jsch-0.1.42.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jetty-util-6.1.26.cloudera.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jersey-core-1.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/hue-plugins-2.2.0-cdh4.2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/avro-1.7.3.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jersey-server-1.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jaxb-api-2.2.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jackson-xc-1.8.8.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/stax-api-1.0.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/commons-codec-1.4.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jets3t-0.6.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jetty-6.1.26.cloudera.2.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/jsr305-1.3.9.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/kfs-0.3.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/lib/commons-lang-2.5.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/hadoop-common-2.0.0-cdh4.2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/hadoop-auth.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/hadoop-auth-2.0.0-cdh4.2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/hadoop-common-2.0.0-cdh4.2.1-tests.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/hadoop-annotations-2.0.0-cdh4.2.1.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/hadoop-common.jar:/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop/hadoop-annotations.jar:/ebs/mapred/local/taskTracker/hadoopuser/jobcache/job_201305081039_0028/jars/classes:/ebs/mapred/local/taskTracker/hadoopuser/jobcache/job_201305081039_0028/jars/job.jar:/ebs/mapred/local/taskTracker/distcache/-3456193334429412951_402957723_479460206/ip-10-122-5-26.ap-northeast-1.compute.internal/tmp/temp-651112871/tmp869044128/piggybank.jar:/ebs/mapred/local/taskTracker/distcache/-7337653544631754240_-1468589137_479460237/ip-10-122-5-26.ap-northeast-1.compute.internal/tmp/temp-651112871/tmp1875426668/datafu.jar:/ebs/mapred/local/taskTracker/distcache/-1402638602126841415_-1435345198_479460330/ip-10-122-5-26.ap-northeast-1.compute.internal/tmp/temp-651112871/tmp-1838925944/hbase-0.94.2-cdh4.2.1-security.jar:/ebs/mapred/local/taskTracker/distcache/7452800711435023543_2121443610_479460382/ip-10-122-5-26.ap-northeast-1.compute.internal/tmp/temp-651112871/tmp-1661097048/zookeeper-3.4.5-cdh4.2.1.jar:/ebs/mapred/local/taskTracker/hadoopuser/distcache/-6387295021460789984_342618622_479466414/ip-10-122-5-26.ap-northeast-1.compute.internal/user/hadoopuser/.staging/job_201305081039_0028/libjars/zookeeper-3.4.5-cdh4.2.1.jar:/ebs/mapred/local/taskTracker/hadoopuser/distcache/-4928843957969028386_439824881_479466462/ip-10-122-5-26.ap-northeast-1.compute.internal/user/hadoopuser/.staging/job_201305081039_0028/libjars/guava-11.0.2.jar:/ebs/mapred/local/taskTracker/hadoopuser/distcache/5720329722789448786_1097322137_479466543/ip-10-122-5-26.ap-northeast-1.compute.internal/user/hadoopuser/.staging/job_201305081039_0028/libjars/hbase-0.94.2-cdh4.2.1-security.jar:/ebs/mapred/local/taskTracker/hadoopuser/jobcache/job_201305081039_0028/attempt_201305081039_0028_m_000010_0/work
2013-05-13 06:29:20,778 INFO org.apache.zookeeper.ZooKeeper: Client
environment:java.library.path=/opt/cloudera/parcels/CDH-4.2.1-1.cdh4.2.1.p0.5/lib/hadoop-0.20-mapreduce/lib/native/Linux-amd64-64:/ebs/mapred/local/taskTracker/hadoopuser/jobcache/job_201305081039_0028/attempt_201305081039_0028_m_000010_0/work
2013-05-13 06:29:20,778 INFO org.apache.zookeeper.ZooKeeper: Client
environment:java.io.tmpdir=/ebs/mapred/local/taskTracker/hadoopuser/jobcache/job_201305081039_0028/attempt_201305081039_0028_m_000010_0/work/tmp
2013-05-13 06:29:20,778 INFO org.apache.zookeeper.ZooKeeper: Client
environment:java.compiler=<NA>
2013-05-13 06:29:20,778 INFO org.apache.zookeeper.ZooKeeper: Client
environment:os.name=Linux
2013-05-13 06:29:20,778 INFO org.apache.zookeeper.ZooKeeper: Client
environment:os.arch=amd64
2013-05-13 06:29:20,778 INFO org.apache.zookeeper.ZooKeeper: Client
environment:os.version=3.2.0-36-virtual
2013-05-13 06:29:20,778 INFO org.apache.zookeeper.ZooKeeper: Client
environment:user.name=mapred
2013-05-13 06:29:20,778 INFO org.apache.zookeeper.ZooKeeper: Client
environment:user.home=/var/lib/hadoop-mapreduce
2013-05-13 06:29:20,778 INFO org.apache.zookeeper.ZooKeeper: Client
environment:user.dir=/ebs/mapred/local/taskTracker/hadoopuser/jobcache/job_201305081039_0028/attempt_201305081039_0028_m_000010_0/work
2013-05-13 06:29:20,779 INFO org.apache.zookeeper.ZooKeeper: Initiating client
connection, connectString=ip-10-122-5-26.ap-northeast-1.compute.internal:2181
sessionTimeout=60000 watcher=hconnection
2013-05-13 06:29:20,879 INFO
org.apache.hadoop.hbase.zookeeper.RecoverableZooKeeper: The identifier of this
process is 22865@ip-10-122-3-220
2013-05-13 06:29:20,891 INFO org.apache.zookeeper.ClientCnxn: Opening socket
connection to server
ip-10-122-5-26.ap-northeast-1.compute.internal/10.122.5.26:2181. Will not
attempt to authenticate using SASL (Unable to locate a login configuration)
2013-05-13 06:29:20,896 INFO org.apache.zookeeper.ClientCnxn: Socket connection
established to ip-10-122-5-26.ap-northeast-1.compute.internal/10.122.5.26:2181,
initiating session
2013-05-13 06:29:20,914 INFO org.apache.zookeeper.ClientCnxn: Session
establishment complete on server
ip-10-122-5-26.ap-northeast-1.compute.internal/10.122.5.26:2181, sessionid =
0x13e83a6bd97045c, negotiated timeout = 60000
2013-05-13 06:29:21,328 WARN org.apache.hadoop.conf.Configuration:
hadoop.native.lib is deprecated. Instead, use io.native.lib.available
2013-05-13 06:29:21,705 INFO
org.apache.pig.backend.hadoop.hbase.HBaseTableInputFormat: setScan with ranges:
1608144425504356317089131618754481167815455570216331582650421 -
1608144425504356338779411601026123924953063215065535954039604 (
21690279982271642757137607644849204371389183)
2013-05-13 06:29:21,776 INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigRecordReader:
Current split being processed
ip-10-122-3-220.ap-northeast-1.compute.internal:1358368301&crown&5222885,1358371115&crown&5601434
2013-05-13 06:29:21,788 INFO org.apache.hadoop.mapred.MapTask: Map output
collector class = org.apache.hadoop.mapred.MapTask$MapOutputBuffer
2013-05-13 06:29:21,795 INFO org.apache.hadoop.mapred.MapTask: io.sort.mb = 1024
2013-05-13 06:29:24,102 INFO org.apache.hadoop.mapred.MapTask: data buffer =
816043776/1020054736
2013-05-13 06:29:24,102 INFO org.apache.hadoop.mapred.MapTask: record buffer =
2684354/3355443
2013-05-13 06:29:24,118 INFO org.apache.pig.impl.util.SpillableMemoryManager:
first memory handler call- Usage threshold init = 163708928(159872K) used =
1020054752(996147K) committed = 1073741824(1048576K) max = 1073741824(1048576K)
2013-05-13 06:29:24,123 INFO org.apache.pig.impl.util.SpillableMemoryManager:
first memory handler call - Collection threshold init = 163708928(159872K) used
= 1022801224(998829K) committed = 1073741824(1048576K) max =
1073741824(1048576K)
2013-05-13 06:29:24,137 WARN org.apache.hadoop.conf.Configuration:
dfs.df.interval is deprecated. Instead, use fs.df.interval
2013-05-13 06:29:24,138 WARN org.apache.hadoop.conf.Configuration:
dfs.max.objects is deprecated. Instead, use dfs.namenode.max.objects
2013-05-13 06:29:24,138 WARN org.apache.hadoop.conf.Configuration: dfs.data.dir
is deprecated. Instead, use dfs.datanode.data.dir
2013-05-13 06:29:24,138 WARN org.apache.hadoop.conf.Configuration: dfs.name.dir
is deprecated. Instead, use dfs.namenode.name.dir
2013-05-13 06:29:24,138 WARN org.apache.hadoop.conf.Configuration:
fs.default.name is deprecated. Instead, use fs.defaultFS
2013-05-13 06:29:24,138 WARN org.apache.hadoop.conf.Configuration:
fs.checkpoint.dir is deprecated. Instead, use dfs.namenode.checkpoint.dir
2013-05-13 06:29:24,138 WARN org.apache.hadoop.conf.Configuration:
dfs.block.size is deprecated. Instead, use dfs.blocksize
2013-05-13 06:29:24,138 WARN org.apache.hadoop.conf.Configuration:
dfs.access.time.precision is deprecated. Instead, use
dfs.namenode.accesstime.precision
2013-05-13 06:29:24,138 WARN org.apache.hadoop.conf.Configuration:
dfs.replication.min is deprecated. Instead, use dfs.namenode.replication.min
2013-05-13 06:29:24,138 WARN org.apache.hadoop.conf.Configuration:
dfs.name.edits.dir is deprecated. Instead, use dfs.namenode.edits.dir
2013-05-13 06:29:24,139 WARN org.apache.hadoop.conf.Configuration:
dfs.replication.considerLoad is deprecated. Instead, use
dfs.namenode.replication.considerLoad
2013-05-13 06:29:24,139 WARN org.apache.hadoop.conf.Configuration:
dfs.balance.bandwidthPerSec is deprecated. Instead, use
dfs.datanode.balance.bandwidthPerSec
2013-05-13 06:29:24,139 WARN org.apache.hadoop.conf.Configuration:
dfs.safemode.threshold.pct is deprecated. Instead, use
dfs.namenode.safemode.threshold-pct
2013-05-13 06:29:24,139 WARN org.apache.hadoop.conf.Configuration:
dfs.http.address is deprecated. Instead, use dfs.namenode.http-address
2013-05-13 06:29:24,139 WARN org.apache.hadoop.conf.Configuration:
dfs.name.dir.restore is deprecated. Instead, use dfs.namenode.name.dir.restore
2013-05-13 06:29:24,139 WARN org.apache.hadoop.conf.Configuration:
dfs.https.client.keystore.resource is deprecated. Instead, use
dfs.client.https.keystore.resource
2013-05-13 06:29:24,139 WARN org.apache.hadoop.conf.Configuration:
dfs.backup.address is deprecated. Instead, use dfs.namenode.backup.address
2013-05-13 06:29:24,139 WARN org.apache.hadoop.conf.Configuration:
dfs.backup.http.address is deprecated. Instead, use
dfs.namenode.backup.http-address
2013-05-13 06:29:24,139 WARN org.apache.hadoop.conf.Configuration:
dfs.permissions is deprecated. Instead, use dfs.permissions.enabled
2013-05-13 06:29:24,140 WARN org.apache.hadoop.conf.Configuration:
dfs.safemode.extension is deprecated. Instead, use
dfs.namenode.safemode.extension
2013-05-13 06:29:24,140 WARN org.apache.hadoop.conf.Configuration:
dfs.datanode.max.xcievers is deprecated. Instead, use
dfs.datanode.max.transfer.threads
2013-05-13 06:29:24,140 WARN org.apache.hadoop.conf.Configuration:
dfs.https.need.client.auth is deprecated. Instead, use
dfs.client.https.need-auth
2013-05-13 06:29:24,140 WARN org.apache.hadoop.conf.Configuration:
dfs.https.address is deprecated. Instead, use dfs.namenode.https-address
2013-05-13 06:29:24,140 WARN org.apache.hadoop.conf.Configuration:
dfs.replication.interval is deprecated. Instead, use
dfs.namenode.replication.interval
2013-05-13 06:29:24,140 WARN org.apache.hadoop.conf.Configuration:
fs.checkpoint.edits.dir is deprecated. Instead, use
dfs.namenode.checkpoint.edits.dir
2013-05-13 06:29:24,140 WARN org.apache.hadoop.conf.Configuration:
dfs.write.packet.size is deprecated. Instead, use dfs.client-write-packet-size
2013-05-13 06:29:24,140 WARN org.apache.hadoop.conf.Configuration:
dfs.permissions.supergroup is deprecated. Instead, use
dfs.permissions.superusergroup
2013-05-13 06:29:24,140 WARN org.apache.hadoop.conf.Configuration:
topology.script.number.args is deprecated. Instead, use
net.topology.script.number.args
2013-05-13 06:29:24,141 WARN org.apache.hadoop.conf.Configuration:
dfs.umaskmode is deprecated. Instead, use fs.permissions.umask-mode
2013-05-13 06:29:24,141 WARN org.apache.hadoop.conf.Configuration:
dfs.secondary.http.address is deprecated. Instead, use
dfs.namenode.secondary.http-address
2013-05-13 06:29:24,141 WARN org.apache.hadoop.conf.Configuration:
fs.checkpoint.period is deprecated. Instead, use dfs.namenode.checkpoint.period
2013-05-13 06:29:24,141 WARN org.apache.hadoop.conf.Configuration:
topology.node.switch.mapping.impl is deprecated. Instead, use
net.topology.node.switch.mapping.impl
2013-05-13 06:29:24,141 WARN org.apache.hadoop.conf.Configuration:
io.bytes.per.checksum is deprecated. Instead, use dfs.bytes-per-checksum
2013-05-13 06:29:24,207 INFO org.apache.pig.data.SchemaTupleBackend: Key
[pig.schematuple] was not set... will not generate code.
2013-05-13 06:29:24,678 INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigGenericMapReduce$Map:
Aliases being processed per job phase (AliasName[line,offset]): M:
bet_records[7,14],bet_records_iso[-1,-1],bet_result[-1,-1],bet_records_group[11,20]
C: bet_result[-1,-1],bet_records_group[11,20] R: bet_result[-1,-1]
2013-05-13 06:30:02,971 INFO org.apache.hadoop.mapred.MapTask: Starting flush
of map output
2013-05-13 06:30:03,147 INFO org.apache.hadoop.io.compress.CodecPool: Got
brand-new compressor [.snappy]
2013-05-13 06:30:03,394 INFO
org.apache.pig.backend.hadoop.executionengine.mapReduceLayer.PigCombiner$Combine:
Aliases being processed per job phase (AliasName[line,offset]): M:
bet_records[7,14],bet_records_iso[-1,-1],bet_result[-1,-1],bet_records_group[11,20]
C: bet_result[-1,-1],bet_records_group[11,20] R: bet_result[-1,-1]
2013-05-13 06:37:41,107 INFO org.apache.zookeeper.ClientCnxn: Unable to read
additional data from server sessionid 0x13e83a6bd97045c, likely server has
closed socket, closing socket connection and attempting reconnect
2013-05-13 06:38:13,251 INFO org.apache.zookeeper.ClientCnxn: Opening socket
connection to server
ip-10-122-5-26.ap-northeast-1.compute.internal/10.122.5.26:2181. Will not
attempt to authenticate using SASL (Unable to locate a login configuration)
2013-05-13 06:38:13,252 INFO org.apache.zookeeper.ClientCnxn: Socket connection
established to ip-10-122-5-26.ap-northeast-1.compute.internal/10.122.5.26:2181,
initiating session