Hello, during the Cloudera Developer Training at Berlin I came up with an idea, regarding a lost name-node.
As in this case all data blocks are lost, the solution could be, to have a table which relates *filenames *and *block_ids* on that data node a block is stored. This table can be scaned after a name-node is lost. Or, even on every block could be a kind of a backlink to the filename and the total nr of blocks and/or a total hashsum attached. This would it make easy to recover a broken HDFS with minimal overhead. Now I would like to ask the developer community, if there is any good reason not to do this, before I start to figure out, where to start an implementation of such a feature. So the name node would not be any longe such a high risk, I think. Thanks a lot, Mirko