Hi Gopal - actually no., the table is not partitioned/bucketed.
Everyday the whole table gets cleaned up and populated with last 120 days'
data...
What are the other properties I can try to improve the performance of
reduce steps...?
Suresh V
http://www.justbirds.in
On Sat, Jan 9, 2016 at 8:52
Hi Mich
We have to use TEZ as the engine since the data volume is high and with MR
it takes several hours.
With TEZ it used to take about an hour max.
Thanks
Suresh.
On Sat, Jan 9, 2016 at 7:34 AM, Mich Talebzadeh wrote:
> Hi Suresh,
>
>
>
> I have the same issue when I use Hive on Spark.
>
>
Hi,
> The job completes fine if we reduce the # of rows processed by reducing
>the # of days data being processed.
>
> It just gets stuck after all maps are completed. We checked the logs and
>it says the containers are released.
Looks like you're inserting into a bucketed & partitioned table an
Hi Suresh,
I have the same issue when I use Hive on Spark.
What normally works is Hive on MR. Have you tried:
set hive.execution.engine=mr;
Sounds like it times out for one reason or other!
From: Suresh V [mailto:verdi...@gmail.com]
Sent: 09 January 2016 11:35
To: user@hive.apa