subject:"Performance tuning a hive query"

Re: Performance tuning a hive query

2012-07-19 Thread kulkarni.swar...@gmail.com

Couple to add to the list: Indexing[1] Columnar Storage/RCFile[2] [1] https://cwiki.apache.org/confluence/display/Hive/IndexDev [2] http://www.cse.ohio-state.edu/hpcs/WWW/HTML/publications/papers/TR-11-4.pdf On Thu, Jul 19, 2012 at 8:39 AM, Jan Dolinár wrote: > There are many ways, but beware

Re: Performance tuning a hive query

2012-07-19 Thread Jan Dolinár

There are many ways, but beware that some of them may result in worse performance when used inappropriately. Some of the settings we use to achieve faster queries: hive.map.aggr=true hive.exec.parallel=true hive.exec.compress.intermediate=true mapred.job.reuse.jvm.num.tasks=-1 Structuring the que

Re: Performance tuning a hive query

2012-07-19 Thread Nitin Pawar

depends on what kind of query if yoy are doing joins then there are different kind of join queries depending on how did you layout the data and how much of data is held in what table. On Thu, Jul 19, 2012 at 6:54 PM, Abhishek wrote: > > Apart from partitions and buckets how to improve of hive q

Performance tuning a hive query

2012-07-19 Thread Abhishek

Apart from partitions and buckets how to improve of hive queries Regards Abhi Sent from my iPhone