Although the two first links are interesting reads, I would recommend you the last one provided by Russel.
The key thing is not to think "I will choose the most efficient wrt to performance" but "I will choose the most in tune with my use cases". Secondly, HCatalog is really something worth mentioning. Basically, the difference in format handling as explained in the other articles is less hard-lined now. Regards Bertrand On Wed, Sep 19, 2012 at 2:42 AM, Russell Jurney <[email protected]>wrote: > A presentation on Hive vs Pig by a committer on both projects is here: > http://hortonworks.com/blog/hadoop-features-large-at-stanford-xldb/ > > > Russell Jurney > twitter.com/rjurney > [email protected] > datasyndrome.com > > On Sep 18, 2012, at 3:01 PM, Aniket Mokashi <[email protected]> wrote: > > (Probably not what you are looking for) Check - > http://www.larsgeorge.com/2009/10/hive-vs-pig.html > > ~Aniket > > On Fri, Sep 14, 2012 at 2:28 PM, Russell Jurney > <[email protected]>wrote: > >> A detailed post comparing Pig/Hive performance from last week: >> http://hortonworks.com/blog/pig-performance-and-optimization-analysis/ >> >> Russell Jurney >> twitter.com/rjurney >> [email protected] >> datasyndrome.com >> >> On Sep 14, 2012, at 10:55 AM, Anurag Tangri <[email protected]> >> wrote: >> >> > Knowing performance statistics would be good too. >> > >> > Sent from my iPhone >> > >> > On Sep 14, 2012, at 10:34 AM, Bharath Mundlapudi <[email protected]> >> wrote: >> > >> >> Hello Community, >> >> >> >> Is there any document/blog comparing different features offered by Pig >> 0.8 (0.9, 0.10) or greater and Hive 0.8 (0.9)? >> >> >> >> -Bharath >> > > > > -- > "...:::Aniket:::... Quetzalco@tl" > > -- Bertrand Dechoux
