Here is my query :*
select table1.column10, table1.column11, count(distinct table2.column3) as
total from table1 JOIN table2 ON (table1.column1='value1' and
to_date(table1.column2) >= '2011-01-06' and to_date(table1.column2) <
'2011-01-07' and table2.column3!='' and table2.column3 is NOT NULL and
What is your OR conditions? Are they involving both tables? Can you post your
query here?
On Mar 23, 2011, at 12:04 AM, MIS wrote:
> Ning, Thanks for the reply.
> Yes. you are right. Using NOT and AND didn't work as expected.
> I'll give a try in implementing nested-loop map-side join.
>
> In t
Ning, Thanks for the reply.
Yes. you are right. Using NOT and AND didn't work as expected.
I'll give a try in implementing nested-loop map-side join.
In the meanwhile, I brought out the expression using OR from inside the JOIN
expression to be used in the filtering expression {in the WHERE clause
Joins with OR conditions are not supported by Hive currently. I think even
though you rewrite the condition to use NOT and AND only, the results may be
wrong.
It is quite hard to implement joins of any tables with OR conditions in a
MapReduce framework. it is straightforward to implement it i
Found it at *org.apache.hadoop.hive.ql.parse.SemanticAnalyzer.java* line
no. 1122
There is some concern mentioned that supporting OR would lead to data
explosion. Is it discussed/documneted in a little more detail somewhere ? If
so, some pointers towards the same will be helpful.
Thanks,
MIS.
On