Couple to add to the list:
Indexing[1]
Columnar Storage/RCFile[2]
[1] https://cwiki.apache.org/confluence/display/Hive/IndexDev
[2]
http://www.cse.ohio-state.edu/hpcs/WWW/HTML/publications/papers/TR-11-4.pdf
On Thu, Jul 19, 2012 at 8:39 AM, Jan DolinĂ¡r wrote:
> There are many ways, but beware
There are many ways, but beware that some of them may result in worse
performance when used inappropriately.
Some of the settings we use to achieve faster queries:
hive.map.aggr=true
hive.exec.parallel=true
hive.exec.compress.intermediate=true
mapred.job.reuse.jvm.num.tasks=-1
Structuring the que
depends on what kind of query
if yoy are doing joins then there are different kind of join queries
depending on how did you layout the data and how much of data is held in
what table.
On Thu, Jul 19, 2012 at 6:54 PM, Abhishek wrote:
>
> Apart from partitions and buckets how to improve of hive q
Apart from partitions and buckets how to improve of hive queries
Regards
Abhi
Sent from my iPhone