Hello,

I just started hive today. Following instructions I did set it up, and
made it work to play with my web server log files.

I created two tables:

CREATE TABLE item_view(view_time BIGINT, ip_number STRING, session_id
STRING, session_cookie STRING, referrer_url STRING, eser_sid INT,
sale_status INT, maker_name STRING, title STRING) CLUSTERED
BY(eser_sid) SORTED BY(view_time) INTO 32 BUCKETS row format delimited
fields terminated by '\t' lines terminated  BY '\n' STORED AS
SEQUENCEFILE;

and

CREATE TABLE item_view_raw(view_time BIGINT,log_level STRING, log_tag
STRING, ip_number STRING, session_id STRING, session_cookie STRING,
referrer_url STRING, agent_string STRING, eser_sid INT, sale_status
INT, maker_name STRING, title STRING) row format delimited fields
terminated by '\t' lines terminated  BY '\n' STORED AS TEXTFILE;

then I load data into item_view_raw table:

LOAD DATA LOCAL INPATH '/tmp/prod.log' INTO TABLE item_view_raw;

whıch puts the data item_view_raw table.

this is the staging table, and i want to insert rows from it into my
item_view table so I:

FROM item_view_raw ivr INSERT OVERWRITE TABLE item_view SELECT
ivr.view_time, ivr.ip_number, ivr.session_id, ivr.session_cookie,
ivr.referrer_url, ivr.eser_sid, ivr.sale_status, ivr.maker_name,
ivr.title;

but I get:

Total MapReduce jobs = 2
Launching Job 1 out of 2
Number of reduce tasks is set to 0 since there's no reduce operator
java.io.IOException: Call to /192.168.1.10:50030 failed on local
exception: java.io.EOFException
        at org.apache.hadoop.ipc.Client.wrapException(Client.java:775)
        at org.apache.hadoop.ipc.Client.call(Client.java:743)
        at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:220)
        at org.apache.hadoop.mapred.$Proxy5.getProtocolVersion(Unknown Source)
        at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:359)
        at org.apache.hadoop.mapred.JobClient.createRPCProxy(JobClient.java:429)
        at org.apache.hadoop.mapred.JobClient.init(JobClient.java:423)
        at org.apache.hadoop.mapred.JobClient.<init>(JobClient.java:410)
        at 
org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:683)
        at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:107)
        at 
org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:55)
        at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:633)
        at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:506)
        at org.apache.hadoop.hive.ql.Driver.run(Driver.java:384)
        at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:138)
        at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:197)
        at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:302)
        at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method)
        at 
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
        at 
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
        at java.lang.reflect.Method.invoke(Method.java:597)
        at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
Caused by: java.io.EOFException
        at java.io.DataInputStream.readInt(DataInputStream.java:375)
        at 
org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:508)
        at org.apache.hadoop.ipc.Client$Connection.run(Client.java:446)
Job Submission failed with exception 'java.io.IOException(Call to
/192.168.1.10:50030 failed on local exception: java.io.EOFException)'
FAILED: Execution Error, return code 1 from
org.apache.hadoop.hive.ql.exec.ExecDriver

I did fiddle with it for couple of hours, then I decided to write to
this list. no matter what I tried i could not solve the problem.

Any ideas, recomendations and help greatly appreciated,
-C.B.

Reply via email to