Sorry to be coming in to this discussion late. Rather than pull the code out of Hadoop, may I suggest instead making it a separate subproject within Hadoop itself? I'd suggest letting it release independently of Hadoop, since it will need a much faster cadence that Hadoop proper does. It should also keep the number of dependencies to a very small set (empty?).
Thoughts? Owen