Hi,

I am trying to kick off a mapreduce job via WebHCat.  The following is the
hadoop jar command.

hadoop jar
/home/hadoop/camus-non-avro-consumer-1.0-SNAPSHOT-jar-with-dependencies.jar
com.linkedin.camus.etl.kafka.CamusJob -P
/home/hadoop/camus_non_avro.properties

As you can see there is an application specific parameter '-P' which
designates the properties file location.  How do I pass this to WebHCat?

Referring to the docs (
https://cwiki.apache.org/confluence/display/Hive/WebHCat+Reference+MapReduceJar)
I came up with the following.

curl -s -d user.name=hadoop \
       -d
jar=/tmp/camus-non-avro-consumer-1.0-SNAPSHOT-jar-with-dependencies.jar \
       -d class=com.linkedin.camus.etl.kafka.CamusJob \
       -d arg=/tmp/camus_non_avro.properties \
       '
http://internal-daalt-hcatalog-1507773817.us-east-1.elb.amazonaws.com/templeton/v1/mapreduce/jar
'

This command gets the following response from WebHCat
{"id":"job_201312212124_0161"}

However I only see TempletonControllerJob in the jobtracker UI.  I don't
see the Camus jobs that will show up if executed at the command-line.

The following are the only things showing in webhcat.log


The jar and properties files are in the /tmp directory on HDFS.

hadoop fs -ls /tmp
-rw-r--r--   2 hadoop supergroup   41456481 2013-12-27 17:45
/tmp/camus-non-avro-consumer-1.0-SNAPSHOT-jar-with-dependencies.jar
-rw-r--r--   2 hadoop supergroup       2605 2013-12-27 17:45
/tmp/camus_non_avro.properties

Reply via email to