Re: Vectorizied execution on RCFile

2014-01-10 Thread Jitendra Pandey
Hi Rajesh, Some parts of the patch on HIVE-4483 are already committed to the trunk. So the patch may not cleanly apply. Once you have this patch, you would need to use CommonRCFileInputFormat (that this patch provides). We haven't included this input-format in the trunk because we intend to enhan

Re: Vectorizied execution on RCFile

2014-01-10 Thread Rajesh Balamohan
Thanks a lot Eric. I will try with HIVE-4483 patch. As you mentioned, It would be awesome to update the standard input formats to leverage vectorization. ~Rajesh.B On Fri, Jan 10, 2014 at 1:23 AM, Eric Hanson (BIG DATA) < eric.n.han...@microsoft.com> wrote: > There’s actually a different inp

RE: Vectorizied execution on RCFile

2014-01-09 Thread Eric Hanson (BIG DATA)
There’s actually a different inputformat for vectorized processing on RCFile. See https://issues.apache.org/jira/browse/HIVE-4483. Vectorized execution won’t run as fast on RCFile as ORC, but there should still be a noticeable improvement on RCFile. In the future, I think it’s best to update th