OOM recovering failed node with many CFs

Flavio Baronti Thu, 26 May 2011 07:28:40 -0700

I can't seem to be able to recover a failed node on a database where idid many updates to the schema.

I have a small cluster with 2 nodes, around 1000 CF (I know it's a lot,but it can't be changed right now), and ReplicationFactor=2.I shut down a node and cleaned its data entirely, then tried to bring itback up. The node starts fetching schema updates from the live node, butthe operation fails halfway with an OOME.

After some investigation, what I found is that:

- I have a lot of schema updates (there are 2067 rows in thesystem.Schema CF).- The live node loads migrations 1-1000, and sends them to therecovering node (Migration.getLocalMigrations())- Soon afterwards, the live node checks the schema version on therecovering node and finds it has moved by a little - say it has appliedthe first 3 migrations. It then loads migrations 3-1003, and sends themto the node.- This process is repeated very quickly (sends migrations 6-1006,9-1009, etc).

Analyzing the memory dump and the logs, it looks like each of these 1000migration blocks are composed in a single message and sent to theOutboundTcpConnection queue. However, since the schema is big, themessages occupy a lot of space, and are built faster than the connectioncan send them. Therefore, they accumulate inOutboundTcpConnection.queue, until memory is completely filled.

Any suggestions? Can I change something to make this work, apart fromreducing the number of CFs?


Flavio

OOM recovering failed node with many CFs

Reply via email to