Does an proposal need a full set of mentors before it can be voted on? Just wondering.
Regards, Alan On Jul 22, 2011, at 1:30 PM, Avery Ching wrote: > Hi and good friday to you all, > > It's been a week since we submitted our proposal for Giraph's inclusion into > the Apache incubator and the discussion around the proposal seems to have > settled. Thank you for all the comments/questions/general interest and for > those who volunteered to be committers. At this time, I'd like to ask for a > vote. > > The latest proposal can be found at the end of this email and in the > following wiki: > > http://wiki.apache.org/incubator/GiraphProposal > > <http://wiki.apache.org/incubator/GiraphProposal>The discussion regarding the > proposal can be found below: > > http://www.mail-archive.com/general@incubator.apache.org/msg29957.html > > <http://www.mail-archive.com/general@incubator.apache.org/msg29957.html> > Please cast your votes: > > [ ] +1 Accept Giraph for incubation > [ ] +0 Indifferent to Giraph incubation > [ ] -1 Reject Giraph for incubation > > This vote will close 72 hours from now. > > Thanks! > > Avery > > > = Giraph : Large-scale graph processing on Hadoop = > > == Abstract == > > Giraph is a large-scale, fault-tolerant, Bulk Synchronous Parallel > (BSP)-based graph processing framework. > > == Proposal == > > Graph processing platforms to run large-scale algorithms (such as page rank, > shared connections, personalization-based popularity, etc.) have become quite > popular. Some recent examples include Pregel and HaLoop. For > general-purpose big data computation, the MapReduce computation model is > widely adopted and the most deployed MapReduce infrastructure is Apache > Hadoop. We have implemented a graph-processing framework that is launched as > a typical Hadoop MapReduce job to leverage existing Hadoop infrastructure, > such as Amazon’s EC2. Giraph builds upon the graph-oriented nature of Pregel > but additionally adds fault-tolerance to the coordinator process with the use > of ZooKeeper as its centralized coordination service. Additionally, Giraph > will include a library of generic graph algorithms. > > == Background == > > Giraph was initially began development as a side project at Yahoo! at the end > of 2010. It was made functional in a month and then started adding various > features. Development has been focused on internal customers needs until > this point. > > == Rationale == > > Web and online social graphs have been rapidly growing in size and scale > during the past decade. In 2008, Google estimated that the number of web > pages reached over a trillion. Online social networking and email sites, > including Yahoo!, Google, Microsoft, Facebook, LinkedIn, and Twitter, have > hundreds of millions of users and are expected to grow much more in the > future. Processing these graphs plays a big role in relevant and > personalized information for users, such as results from a search engine or > news in an online social networking site. > > == Initial Goals == > > At this point, most of the functionality has been implemented and we are > looking to get more adoption and contributions from users outside Yahoo!. > We want to ensure that performance scales and that the code is robust and > fault tolerant. > > == Current Status == > > === Meritocracy === > > Giraph was initially developed by Avery Ching and Christian Kunz beginning in > December 2010 at Yahoo!. There are other developers using Giraph at Yahoo! > that are making suggestions and adding code. We are reaching out to other > folks at social networking companies for additional usage and development. > > === Community === > > Several groups who are interested in either joining our project or using our > code have contacted us. We certainly believe that there is a lot of interest > and are actively looking to improve and expand the community. > > === Core Developers === > > * Avery Ching: Wrote a majority of the code > * Christian Kunz: Wrote most of the communication code and security > integration with Hadoop > > === Alignment === > > Giraph uses several Apache projects as its underlying infrastructure (Hadoop > and ZooKeeper). It also builds on Apache Maven. > > == Known Risks == > > === Orphaned products === > > There are many social networking companies that would be interested in using > this graph-processing framework and we have already received interest from > some of them. Yahoo! is already using this code in production and will > certainly continue to use it in the future as well. > > === Inexperience with Open Source === > > While the initial developers have limited experience on contributing to > open-source projects, Yahoo! as a company has a strong commitment to > open-source and we have several advisors that we can ask for help. > > === Homogenous Developers === > > At this time, the project is relatively young and the developers work at only > two companies (Yahoo! and Jybe). However, given the interest we have seen in > the project, we expect the diversity to improve in the near future. > > === Reliance on Salaried Developers === > > Currently Giraph is being developed by a combination of salaried and > volunteer time. We expect that other corporations will take an interest in > this project and likely contribute with salaried developers. Some > individuals will likely spend volunteer time on it as well. It is still > early in their project and we are hoping for a lot of growth. > > === Relationships with Other Apache Products === > > Giraph depends on many Apache projects: Hadoop, ZooKeeper, Log4j, Commons, > etc. It is built using Apache Maven. > > Giraph has some overlapping functionality with Apache Hama. However, there > are some significant differences. Giraph focuses on graph-based bulk > synchronous parallel (BSP) computing, while Apache Hama is more for general > purposed BSP computing. Giraph runs on the Hadoop infrastructure, while > Apache Hama uses its own computing framework. > > === An Excessive Fascination with the Apache Brand === > > The Apache brand is likely to help us find contributors, however, our > interests in Apache are primarily because the other projects that we depend > on are also Apache projects and it makes sense that all this software be > available from the same place. > > === Documentation === > > Currently we have little documentation, but several examples. We are working > on improving this situation. > > === Initial Source === > > The initial source of the code is from Yahoo! and began development in > December 2010. It is already available on GitHub at > https://github.com/aching/Giraph. > > === Source and Intellectual Property Submission Plan === > > We intend the entire code base to be licensed under the Apache License, > Version 2.0. > > === External Dependencies === > > The required dependencies are all Apache compatible licenses. The following > components with non-Apache licenses are enumerated: > * JSON – Public Domain > > === Cryptography === > > Giraph depends on secure Hadoop that can optionally use Kerberos. > > == Required Resources == > > === Mailing lists === > > * giraph-private (with moderated subscriptions) > * giraph-dev > * giraph-commits > * giraph-users > > === Subversion Directory === > > https://svn.apache.org/repos/asf/incubator/giraph > > === Issue Tracking === > > JIRA Giraph (GIRAPH) > > === Other Resources === > > Giraph has integration tests that can be run with the LocalJobRunner. These > same tests also designed to be run on a small (even single node) Hadoop > cluster. While not required at this time, it would be nice if such a > resource were available. > > === Initial Committers === > > * Avery Ching, aching at yahoo-inc dot com > * Christian Kunz, christian at jybe-inc dot com > * Owen O’Malley, owen at hortonworks dot com > * Phillip Rhodes, prhodes at apache dot org > * Hyunsik Choi, hyunsik at apache dot org > * Jakob Homan, jghoman at apache dot org > * Arun Suresh, asuresh at yahoo-inc dot com > > === Affiliations === > > * Avery Ching, Yahoo! > * Christian Kunz, Jybe > * Owen O'Malley, Hortonworks > * Phillip Rhodes, Fogbeam Labs > * Hyunsik Choi, Database Lab, Korea University > * Jakob Homan, LinkedIn > * Arun Suresh, Yahoo! > > == Sponsors == > > === Champion === > > Owen O’ Malley > > === Nominated Mentors === > > Owen O’Malley > > === Sponsoring Entity === > > Apache Incubator PMC > --------------------------------------------------------------------- To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org For additional commands, e-mail: general-h...@incubator.apache.org