Hi Mohit,


Can you be more precise how the fixed and variable row size are evaluated 
wrongly? I don't quite understand what you mean. Did I miss any context?



I guess you are running a previous version and try to figure out whether you 
need to port this patch? In that case, I think OOM is the worst possible case. 
We also care about whether one task uses more resource than it really needs and 
competes resource with other tasks. I don't think there can be other impact. If 
you want to try to repro a OOM, you should produce a condition that sum of 
distinct string key size > maximum heap size, and fix size + aggregate 
parameter size much smaller than average key size. You can try very long 
distinct string keys as input and group by it. My feeling is that it is not 
such a common case, since we never hit OOM for this.



For current trunk or version 0.7, now the codes are really not the same as when 
we did HIVE-1737, since we've had HIVE-1830 now, which put a memory usage check 
and force to flush the disk when memory is more than a threshold, so that even 
without HIVE-1737, there won't be OOM any way.



Thanks,



Siying



________________________________
From: Mohit [mohitsi...@huawei.com]
Sent: Tuesday, March 01, 2011 7:08 AM
To: Siying Dong
Cc: Namit Jain; chinna...@huawei.com; hive-...@hadoop.apache.org
Subject: FW: Regarding HIVE-1737

Hi Namit/Siying,

Ok, even I agree with your analysis. Both the fixed and variable row size 
evaluated wrongly here.

But what I was more interested in how critical is the change; like what if hash 
aggregation map is not flushed, even if the number of existing entries overshot 
the false entries stats calculated on basis of configured property 
hive.map.aggr.map.percentmemory (whereas if it happens faithfully by the code 
changes you did, it will trigger flush), any issues apart from out of memory in 
child JVM or there is more to it, something else bad can happen?

If you can provide me the pointers to re-produce it's side effect, It will be 
great.

-Mohit

***************************************************************************************
This e-mail and attachments contain confidential information from HUAWEI, which 
is intended only for the person or entity whose address is listed above. Any 
use of the information contained herein in any way (including, but not limited 
to, total or partial disclosure, reproduction, or dissemination) by persons 
other than the intended recipient's) is prohibited. If you receive this e-mail 
in error, please notify the sender by phone or email immediately and delete it!
________________________________
From: Mohit [mailto:mohitsi...@huawei.com]
Sent: Tuesday, March 01, 2011 12:39 PM
To: 'siyin...@fb.com'
Subject: Regarding HIVE-1737

Hi Siying,

Hope you doing great.
Well, I have one request regarding this defect, I'm not able to understand and 
hence reproduce this issue.
May be you can help in that, I need to know what queries you ran.

-Mohit

***************************************************************************************
This e-mail and attachments contain confidential information from HUAWEI, which 
is intended only for the person or entity whose address is listed above. Any 
use of the information contained herein in any way (including, but not limited 
to, total or partial disclosure, reproduction, or dissemination) by persons 
other than the intended recipient's) is prohibited. If you receive this e-mail 
in error, please notify the sender by phone or email immediately and delete it!

Reply via email to