Hi,
We are planning to use HSQL as a metastore. Has anyone tried using hsql. Or
are there any issue with using HSQL, our design is such that each query
will have its own DDL/DMLs.
Thanks,
-Ghousia.
Nevermind, this was resolved by adding a few more jars. Now I am getting a
"file not found" error...
- Original Message -
From: Ben West
To: "user@hive.apache.org"
Cc:
Sent: Tuesday, November 15, 2011 3:45 PM
Subject: Hive HFileOutput error
Hey all,
I'm just getting started with Hi
Hey all,
I'm just getting started with Hive, and am trying to follow the instructions on
https://cwiki.apache.org/confluence/display/Hive/HBaseBulkLoad. I can create my
table fine, but when I runĀ
insert overwrite table hb3 select key, val from hb2 cluster by key;
the map phase runs fine, but
Another way would be using Hive server. This will execute multiple queries in
parallel.
--Wouter de Bie
Team Lead Analytics Infrastructure, Spotify
wou...@spotify.com (mailto:wou...@spotify.com)
+46 72 018 0777
On Tuesday, November 15, 2011 at 2:48 PM, Sam Wilson wrote:
> If you go this route,
If you go this route, you may want to use nohup. This way your processes will
continue running even if you lose connection to your terminal session.
Other options:
1) You can write your queries to a DB/Queue and have a process running on the
Hive server that reads from the DB/queue and runs the
Hi Ghousia,
I believe that Hadoop has a default FIFO scheduler for MapReduce jobs by
default. Also, as a corollary, the second MapReduce job wouldn't get started
until the first one is done. If you are using FIFO scheduler in Hadoop, even if
you issue 2 Hive queries, the second one's MapReduce j
You could write your query to a file and do something like:
hive -f &
hive -f &
etc. to invoke many instances in parallel.
On Tue, Nov 15, 2011 at 3:24 AM, Chinna Rao Lalam wrote:
> Hi,
>
>
>
> Hive calls are blocking calls because once the query is executed it will
> return the ResultSet
Hi,
Hive calls are blocking calls because once the query is executed it will
return the ResultSet from that result set u will get the results.
"hive.exec.parallel" property will helps to speed up the query execution if
the query generates more than one independent tasks. If it generates
Hi,
Hive queries take longer time to execute, and by default it is a blocking
call. Is there any way provided by Hive client to supports non blocking
execution.
Also, to execute jobs parallely, I tried setting the
*"hive.exec.parallel"*to true in hive-site.xml. But this did not work,
Looking at t