Re: Skewed Tables

2014-04-28 Thread Prasanth Jayachandran
Lefty, I have updated the hive wiki in few places to say we should use "stored as directories" for list bucketing features. There are two different optimizations that uses "SKEWED BY” keyword. One is skewed join optimization and other is list bucketing optimization. I think we need to mention th

Re: Skewed Tables

2014-04-27 Thread Lefty Leverenz
Prasanth, Hive's user docs are wiki-only at this point so there's no version control. We just add notes about which release introduced or changed something. For an example see the beginning of the Skewed Tables

Re: Skewed Tables

2014-04-27 Thread Prasanth Jayachandran
@Mayur.. I don’t think the initial design considered CTAS for skewed tables. So it might not be supported at all. @Lefty.. I am not sure where/how the docs are maintained. Is it version controlled? Or is it only maintained in confluence wiki? If it is the later can you please provide me access

Re: Skewed Tables

2014-04-26 Thread Mayur Gupta
Hey Prasanth, The CTAS for skewed table doesn't work, is it a bug? create tablet1(r1 string, r2 string) skewed by (r2) on (‘a’) stored as directories select r1, r2 from t2; On Thu, Apr 24, 2014 at 3:03 PM, Mayur Gupta wrote: > Thanks a lot Prasanth for the reply. I would have never figured tha

Re: Skewed Tables

2014-04-26 Thread Lefty Leverenz
I can point to possible locations but I'm not sure where this belongs. For starters, STORED AS DIRECTORIES needs to be added to the storage format sectionin the DDL doc and sever

Re: Skewed Tables

2014-04-25 Thread Prasanth Jayachandran
Lefty, I can add this information. Can you please point me to the location to add this? Perhaps, you can help reviewing it. Thanks Prasanth Jayachandran On Apr 24, 2014, at 1:13 PM, Lefty Leverenz wrote: > I'm looking at the docs and thinking of ways to include this information. > But Prasa

Re: Skewed Tables

2014-04-24 Thread Lefty Leverenz
I'm looking at the docs and thinking of ways to include this information. But Prasanth, if you want to do it yourself that would be great. -- Lefty On Thu, Apr 24, 2014 at 5:33 AM, Mayur Gupta wrote: > Thanks a lot Prasanth for the reply. I would have never figured that out > as the documentat

Re: Skewed Tables

2014-04-24 Thread Mayur Gupta
Thanks a lot Prasanth for the reply. I would have never figured that out as the documentation at Hive Wiki DDL pageand design page doe

Re: Skewed Tables

2014-04-23 Thread Prasanth Jayachandran
Hi Mayur, The reason why you see single file is, you have not enabled storing skewed columns/values as directories. You can do the following to enable storing the skewed columns and values as directories set hive.mapred.supports.subdirectories=true; set mapred.input.dir.recursive=true; create t

Re: Skewed Tables

2014-04-23 Thread Mayur Gupta
Below is my skewedInfo skewedInfo:SkewedInfo(skewedColNames:[r2], skewedColValues:[[a]], skewedColValueLocationMaps:{}) Any idea why is the skewedColValueLocationMaps empty? On Mon, Apr 21, 2014 at 11:19 AM, Mayur Gupta wrote: > Hey There, > > I was trying to use Skewed tables but I am facing