On Tue, Nov 25, 2014 at 8:07 PM, Stan Lemon <sle...@salesforce.com> wrote:
> We are using v2.0.11 and have seen several instances in our 24 node > cluster where the node becomes unresponsive, when we look into it we find > that there is a cassandra process chewing up a lot of CPU. There are no > other indications in logs or anything as to what might be happening, > however if we strace the process that is chewing up CPU we see a segmental > fault: > > Has anyone seen this? Does anyone have any ideas what might be happening, > or how we could debug it further? > Does it go away when you restart the node? First, you should do the standard checks for if this is GC pre-fail, which looks like a flattop on heap consumption graphs combined with a spike in GC duration. If you don't find that or OOM log messages, your version is new enough that I would file a JIRA at http://issues.apache.org =Rob