Kihwal Lee created HDFS-7174:
--------------------------------

             Summary: Support for more efficient large directories
                 Key: HDFS-7174
                 URL: https://issues.apache.org/jira/browse/HDFS-7174
             Project: Hadoop HDFS
          Issue Type: Improvement
            Reporter: Kihwal Lee
            Assignee: Kihwal Lee
            Priority: Critical


When the number of children under a directory grows very large, insertion 
becomes very costly.  E.g. creating 1M entries takes 10s of minutes.  This is 
because the complexity of an insertion is O(n). As the size of a list grows, 
the overhead grows n^2. (integral of linear function).  It also causes 
allocations and copies of big arrays.





--
This message was sent by Atlassian JIRA
(v6.3.4#6332)

Reply via email to