hive on spark ,three tables(one is small, others are big),cannot go mapjoin

2017-01-03 Thread Maria
Hi,all I have a doubt: my test hql is : "select tmp.src_ip,c.to_ip from (select a.src_ip,b.appid from small_tbl a join im b on a.src_ip=b.src_ip) tmp join email c on tmp.appid=c.appid" , im and email are bigtables, set hive.execution.engine=mr, the execution plan generated two mapjoin stage,

Re: Zero Bytes Files importance

2017-01-03 Thread Gopal Vijayaraghavan
> Thanks Gopal. Yeah I'm using CloudBerry.  Storage is Azure. Makes sense, only an object store would have this. > Are you saying this _0,1,2,3 are directories ?. No, only the zero size "files". This is really for compat with regular filesystems. If you have /tmp/1/foo in an object store

Re: please give me the permission to update the wiki of hive on spark

2017-01-03 Thread Lefty Leverenz
Done. Welcome to the Hive wiki team, Kelly, and happy new year! -- Lefty On Mon, Jan 2, 2017 at 5:40 PM, Zhang, Liyun wrote: > Hi > > I want to update wiki confluence/display/Hive/Hive+on+Spark%3A+Getting+Started> of hive on > spark because HIVE-8373, my Conflue

Re: Zero Bytes Files importance

2017-01-03 Thread Mahender Sarangam
Thanks Gopal. Yeah I'm using CloudBerry. Storage is Azure. Are you saying this _0,1,2,3 are directories ?. [cid:part1.C19351C7.1FC7FD04@outlook.com] On 12/29/2016 11:18 AM, Gopal Vijayaraghavan wrote: For any insert operation, there will be one Zero bytes file. I would like to know im