Bernd, We would divide the derived code into two categories: that which we modified only slightly (for example to allow us to extend it) and that which we modified heavily. Now that we are able to interact openly, we hope to supply much of that back to the original projects. There is a detailed overview below. We identified these by searching for "copyright" in our code. The total count came to just over 14,000 lines. We use "heavily" as a qualitative assessment of how much we modified, but we could certainly come up with quantitative assessments.
5400 lines: slightly modified versions of Hadoop BCFile and related classes (our current file format extends BCFile) 4300 lines: heavily modified versions of MapFile and SequenceFile (no longer our default file format, but still included for backward compatibility) 2000 lines: heavily modified versions of HBase BlockCache and related files (Adam didn't count the tests when he said 1500 lines) 1300 lines: heavily modified versions of Hadoop BloomFilters 419 lines: modified Hadoop TeraSortIngest to sort data using Accumulo 325 lines: our Value is an immutable version of Hadoop BytesWritable 142 lines: modified ClassLoader based on commons-jci ReloadingClassLoader Billie ----- Original Message ----- From: "Bernd Fondermann" <bernd.fonderm...@googlemail.com> To: general@incubator.apache.org Sent: Sunday, September 4, 2011 3:41:09 AM Subject: Re: [PROPOSAL] Accumulo for the Apache Incubator On Saturday, September 3, 2011, Adam P Fuchs <adam.p.fu...@ugov.gov> wrote: > Hi Bernd, > > The latest stable release of Accumulo contains roughly 200,000 lines of code, of which about 85,000 are machine generated thrift code. Of the remaining code, about 15,000 lines are derived from other Apache projects, and about 1,500 of those are derived from HBase code. The code derived from HBase comprises a query caching layer (block cache, index cache, multi-level LRU logic, etc.). So, you are saying more than 10% of the non-generated code base (and you are not counting lib-style uses/JARs here, right?) is derived from other Apache code? That seems to be unusual. Just curious, could you elaborate a bit about why you did that amd what kind of code that is? Thank you. Bernd --------------------------------------------------------------------- To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org