HSQL integration

2011-11-15 Thread Ghousia
Hi, We are planning to use HSQL as a metastore. Has anyone tried using hsql. Or are there any issue with using HSQL, our design is such that each query will have its own DDL/DMLs. Thanks, -Ghousia.

Re: Hive HFileOutput error

2011-11-15 Thread Ben West
Nevermind, this was resolved by adding a few more jars. Now I am getting a "file not found" error... - Original Message - From: Ben West To: "user@hive.apache.org" Cc: Sent: Tuesday, November 15, 2011 3:45 PM Subject: Hive HFileOutput error Hey all, I'm just getting started with Hi

Hive HFileOutput error

2011-11-15 Thread Ben West
Hey all, I'm just getting started with Hive, and am trying to follow the instructions on https://cwiki.apache.org/confluence/display/Hive/HBaseBulkLoad. I can create my table fine, but when I runĀ  insert overwrite table hb3 select key, val from hb2 cluster by key; the map phase runs fine, but

Re: Asynchronous query exection

2011-11-15 Thread Wouter de Bie
Another way would be using Hive server. This will execute multiple queries in parallel. --Wouter de Bie Team Lead Analytics Infrastructure, Spotify wou...@spotify.com (mailto:wou...@spotify.com) +46 72 018 0777 On Tuesday, November 15, 2011 at 2:48 PM, Sam Wilson wrote: > If you go this route,

Re: Asynchronous query exection

2011-11-15 Thread Sam Wilson
If you go this route, you may want to use nohup. This way your processes will continue running even if you lose connection to your terminal session. Other options: 1) You can write your queries to a DB/Queue and have a process running on the Hive server that reads from the DB/queue and runs the

Re: Asynchronous query exection

2011-11-15 Thread Mark Grover
Hi Ghousia, I believe that Hadoop has a default FIFO scheduler for MapReduce jobs by default. Also, as a corollary, the second MapReduce job wouldn't get started until the first one is done. If you are using FIFO scheduler in Hadoop, even if you issue 2 Hive queries, the second one's MapReduce j

Re: Asynchronous query exection

2011-11-15 Thread Mapred Learn
You could write your query to a file and do something like: hive -f & hive -f & etc. to invoke many instances in parallel. On Tue, Nov 15, 2011 at 3:24 AM, Chinna Rao Lalam wrote: > Hi, > > > > Hive calls are blocking calls because once the query is executed it will > return the ResultSet

RE: Asynchronous query exection

2011-11-15 Thread Chinna Rao Lalam
Hi, Hive calls are blocking calls because once the query is executed it will return the ResultSet from that result set u will get the results. "hive.exec.parallel" property will helps to speed up the query execution if the query generates more than one independent tasks. If it generates

Asynchronous query exection

2011-11-15 Thread Ghousia
Hi, Hive queries take longer time to execute, and by default it is a blocking call. Is there any way provided by Hive client to supports non blocking execution. Also, to execute jobs parallely, I tried setting the *"hive.exec.parallel"*to true in hive-site.xml. But this did not work, Looking at t