Hello, I just started hive today. Following instructions I did set it up, and made it work to play with my web server log files.
I created two tables: CREATE TABLE item_view(view_time BIGINT, ip_number STRING, session_id STRING, session_cookie STRING, referrer_url STRING, eser_sid INT, sale_status INT, maker_name STRING, title STRING) CLUSTERED BY(eser_sid) SORTED BY(view_time) INTO 32 BUCKETS row format delimited fields terminated by '\t' lines terminated BY '\n' STORED AS SEQUENCEFILE; and CREATE TABLE item_view_raw(view_time BIGINT,log_level STRING, log_tag STRING, ip_number STRING, session_id STRING, session_cookie STRING, referrer_url STRING, agent_string STRING, eser_sid INT, sale_status INT, maker_name STRING, title STRING) row format delimited fields terminated by '\t' lines terminated BY '\n' STORED AS TEXTFILE; then I load data into item_view_raw table: LOAD DATA LOCAL INPATH '/tmp/prod.log' INTO TABLE item_view_raw; whıch puts the data item_view_raw table. this is the staging table, and i want to insert rows from it into my item_view table so I: FROM item_view_raw ivr INSERT OVERWRITE TABLE item_view SELECT ivr.view_time, ivr.ip_number, ivr.session_id, ivr.session_cookie, ivr.referrer_url, ivr.eser_sid, ivr.sale_status, ivr.maker_name, ivr.title; but I get: Total MapReduce jobs = 2 Launching Job 1 out of 2 Number of reduce tasks is set to 0 since there's no reduce operator java.io.IOException: Call to /192.168.1.10:50030 failed on local exception: java.io.EOFException at org.apache.hadoop.ipc.Client.wrapException(Client.java:775) at org.apache.hadoop.ipc.Client.call(Client.java:743) at org.apache.hadoop.ipc.RPC$Invoker.invoke(RPC.java:220) at org.apache.hadoop.mapred.$Proxy5.getProtocolVersion(Unknown Source) at org.apache.hadoop.ipc.RPC.getProxy(RPC.java:359) at org.apache.hadoop.mapred.JobClient.createRPCProxy(JobClient.java:429) at org.apache.hadoop.mapred.JobClient.init(JobClient.java:423) at org.apache.hadoop.mapred.JobClient.<init>(JobClient.java:410) at org.apache.hadoop.hive.ql.exec.ExecDriver.execute(ExecDriver.java:683) at org.apache.hadoop.hive.ql.exec.Task.executeTask(Task.java:107) at org.apache.hadoop.hive.ql.exec.TaskRunner.runSequential(TaskRunner.java:55) at org.apache.hadoop.hive.ql.Driver.launchTask(Driver.java:633) at org.apache.hadoop.hive.ql.Driver.execute(Driver.java:506) at org.apache.hadoop.hive.ql.Driver.run(Driver.java:384) at org.apache.hadoop.hive.cli.CliDriver.processCmd(CliDriver.java:138) at org.apache.hadoop.hive.cli.CliDriver.processLine(CliDriver.java:197) at org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:302) at sun.reflect.NativeMethodAccessorImpl.invoke0(Native Method) at sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39) at sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25) at java.lang.reflect.Method.invoke(Method.java:597) at org.apache.hadoop.util.RunJar.main(RunJar.java:156) Caused by: java.io.EOFException at java.io.DataInputStream.readInt(DataInputStream.java:375) at org.apache.hadoop.ipc.Client$Connection.receiveResponse(Client.java:508) at org.apache.hadoop.ipc.Client$Connection.run(Client.java:446) Job Submission failed with exception 'java.io.IOException(Call to /192.168.1.10:50030 failed on local exception: java.io.EOFException)' FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.ExecDriver I did fiddle with it for couple of hours, then I decided to write to this list. no matter what I tried i could not solve the problem. Any ideas, recomendations and help greatly appreciated, -C.B.