Hello,
during the Cloudera Developer Training at Berlin I came up with an idea,
regarding a lost name-node.

As in this case all data blocks are lost, the solution could be, to have a
table which relates *filenames *and *block_ids* on that data node a block is
stored.
This table can be scaned after a name-node is lost. Or, even on every block
could be a kind of a backlink to the filename and the total nr of blocks
and/or a total hashsum attached.
This would it make easy to recover a broken HDFS with minimal overhead.

Now I would like to ask the developer community, if there is any good reason
not to do this, before I start to figure out, where to start an
implementation of such a feature.

So the name node would not be any longe such a high risk, I think.

Thanks a lot,
Mirko

Reply via email to