Hi,
You’re correct - that is not a valid rewrite.
Both tables have to be shuffled across due to the OR clause with no
reductions.
Cheers,
Gopal
On 5/11/15, 10:43 AM, "Eugene Koifman" wrote:
>This isn’t a valid rewrite.
>if a(x,y) has 1 row (1,2) and b(x,z) has 1 row (1,1) then the 1st query
>
I see only 1 reduce run forerver. Skew join?
r7raul1...@163.com
From: Eugene Koifman
Date: 2015-05-12 01:43
To: user
CC: r7raul1...@163.com
Subject: Re: hive sql on tez run forever
This isn’t a valid rewrite.
if a(x,y) has 1 row (1,2) and b(x,z) has 1 row (1,1) then the 1st query
will
This isn’t a valid rewrite.
if a(x,y) has 1 row (1,2) and b(x,z) has 1 row (1,1) then the 1st query
will produce 1 row
but the 2nd query with subselects will not.
On 5/11/15, 10:13 AM, "Gopal Vijayaraghavan" wrote:
>Hi,
>
>> I change the sql where condition to (where t.update_time >=
>>'2015-05-
15 18:14
To: user
Cc: r7raul1...@163.com
Subject: Re: hive sql on tez run forever
Hi,
> I change the sql where condition to (where t.update_time >=
>'2015-05-04') , the sql can return result for a while. Because
>t.update_time
> >= '2015-05-04'
Hi,
> I change the sql where condition to (where t.update_time >=
>'2015-05-04') , the sql can return result for a while. Because
>t.update_time
> >= '2015-05-04' can filter many row when table scan. But why change
>where condition to
> (where t.update_time >= '2015-05-04' or length(t8.end_user_i