Avoid GC Pauses on Scan MapReduces

Pere Kyle Thu, 13 Nov 2014 17:58:13 -0800

Hi there,

Recently I have been experiencing instability when scanning our HBASE cluster. 
The table we are trying to scan is 1.5B records 1TB, we have 12GB heap and 17 
servers. Our GC options are as so:
-XX:OnOutOfMemoryError=kill -9 %p -Xmx12000m -XX:+UseConcMarkSweepGC -Xmx12g 
-Xmx12g


The errors I am seeing in the logs:
2014-11-14 00:00:32,656 WARN org.apache.hadoop.hbase.util.Sleeper 
(regionserver60020): We slept 17355ms instead of 3000ms, this is likely due to 
a long garbage collecting pause and it's usually bad, see 
http://hbase.apache.org/book.html#trouble.rs.runtime.zkexpired
2014-11-14 00:00:32,663 WARN org.apache.hadoop.hbase.util.Sleeper 
(regionserver60020.periodicFlusher): We slept 24734ms instead of 10000ms, this 
is likely due to a long garbage collecting pause and it's usually bad, see 
http://hbase.apache.org/book.html#trouble.rs.runtime.zkexpired

We are using hbase.client.scanner.caching=1000. I suspect this may be a block 
cache issue. My question is if/how to disable the block cache for the scan 
queries? This is taking out writes and causing instability on the cluster.

Thanks,
Pere

Avoid GC Pauses on Scan MapReduces

Reply via email to