Another way would be using Hive server. This will execute multiple queries in
parallel.
--Wouter de Bie
Team Lead Analytics Infrastructure, Spotify
wou...@spotify.com (mailto:wou...@spotify.com)
+46 72 018 0777
On Tuesday, November 15, 2011 at 2:48 PM, Sam Wilson wrote:
> If you go this route,
If you go this route, you may want to use nohup. This way your processes will
continue running even if you lose connection to your terminal session.
Other options:
1) You can write your queries to a DB/Queue and have a process running on the
Hive server that reads from the DB/queue and runs the
ce/docs/r0.21.0/capacity_scheduler.html
http://hadoop.apache.org/mapreduce/docs/r0.21.0/fair_scheduler.html
Mark
- Original Message -
From: "Mapred Learn"
To: user@hive.apache.org
Sent: Tuesday, November 15, 2011 2:15:59 PM
Subject: Re: Asynchronous query exection
You could write yo
You could write your query to a file and do something like:
hive -f &
hive -f &
etc. to invoke many instances in parallel.
On Tue, Nov 15, 2011 at 3:24 AM, Chinna Rao Lalam wrote:
> Hi,
>
>
>
> Hive calls are blocking calls because once the query is executed it will
> return the ResultSet
Hi,
Hive calls are blocking calls because once the query is executed it will
return the ResultSet from that result set u will get the results.
"hive.exec.parallel" property will helps to speed up the query execution if
the query generates more than one independent tasks. If it generates