On Jan 31, 2016 3:52 PM, "Karanjeet Singh" <karan...@usc.edu> wrote:
> Hello Everyone, > > With great pleasure, I would like to introduce DRAT (Distributed Release > Audit Tool) which is a distributed, parallelized wrapper around Apache RAT > to inspect for appropriate open source licensing in software projects. > DRAT was started by my advisor, Chris Mattmann, in an effort to get RAT > working on a ver large code base. RAT uses Apache OODT, Apache Tika, and > Apache Solr. > > We are now auditing the complete Apache SVN code base to check for proper > licenses. Until now, we have scanned 171 / 191 repositories and > illustrated the statistics for 133 of them through D3 visualization > located at http://drat.dyndns.org:8080/dratviz > > Projects should check out the MIME analysis of the code base and click > around. Please also note due to the sheer size of the Apache code bases > and the fact that we scanned and included all revisions in the Apache SVN > repo, DRAT is not running in real time. We are running DRAT on the NSF > Super Computer Wrangler, which has a petabyte of flash storage and the > ability to stand up Hadoop and Spark clusters. We are also working on a > paper describing our results. > > Please send feedback to myself (Karanjeet Singh <karan...@usc.edu>), > Professor Mattmann <mattm...@usc.edu> and/or ird...@mymaillists.usc.edu. > > Thanks & Regards, > Karanjeet Singh > C.S. Graduate Student > University of Southern California > karan...@usc.edu | +1-213-675-9583 >