Tanks Andrew and Billy. I think the subject of this mail thread is not appropriate, it may not be a balance issue. The problem seems the block deleting scheduler in HDFS.
Last night(timezone:+8), I slow down my application, and this morning, I found almost all garbage blocks are deleted. Here is the current blocks number of each datanode: node1: 10651 node2: 10477 node3: 12185 node4: 11607 node5: 14000 It seems fine. But I want to study the code of HDFS and make clear the policy of deleting blocks on datanodes. If anyone in the hadoop community can give me some advices? Schubert On Thu, Mar 26, 2009 at 7:55 AM, Andrew Purtell <apurt...@apache.org> wrote: > > > From: schubert zhang <zson...@gmail.com> > > From another point of view, I think HBase cannot control to > > delete blocks on which node, it would just delete files, and > > HDFS delete blocks where the blocks locating. > > Yes, that is exactly correct. > > Best regards, > > - Andy > > > > >