David Mollitor created HDFS-14790: ------------------------------------- Summary: Support Client Write Fan-Out Key: HDFS-14790 URL: https://issues.apache.org/jira/browse/HDFS-14790 Project: Hadoop HDFS Issue Type: Improvement Components: block placement, hdfs-client Affects Versions: 3.3.0 Reporter: David Mollitor
The default behavior of an HDFS write is to setup a pipeline. A file is broken into packets and sent through the pipeline. Pipelining provides good throughput, but latency suffers. Allowing a client to specify a fan-out strategy allows the client to send the packets to the DataNodes concurrently instead of passing the packet through a pipeline serially. {code:none} # Pipeline C |-------> DN -------> DN -------> DN # Fan Out |-------> DN C |-------> DN |-------> DN {code} Also, if there's a 'min replication' of, for example, 2. The client only needs to wait for the first 2 ACKs before writing the next packet as long as the 2 ACKs are from different racks. The block placement rules may need to support this. HBase requires this improved latency. -- This message was sent by Atlassian Jira (v8.3.2#803003) --------------------------------------------------------------------- To unsubscribe, e-mail: hdfs-dev-unsubscr...@hadoop.apache.org For additional commands, e-mail: hdfs-dev-h...@hadoop.apache.org