+1 (non-binding) -Anoop-
On Wed, May 20, 2015 at 10:32 AM, Chris Nauroth <cnaur...@hortonworks.com> wrote: > +1 (binding) > > --Chris Nauroth > > > > > On 5/19/15, 2:27 PM, "Stack" <st...@duboce.net> wrote: > > >Following the discussion earlier in the thread [1], I would like to call a > >VOTE to accept Trafodion as a new Apache Incubator project. > > > >The proposal is available on the wiki at [2] and is also attached to this > >mail. > > > >The VOTE is open for at least the next 72 hours: > > > > [ ] +1 accept Trafodion into the Apache Incubator > > [ ] ±0 Abstain > > [ ] -1 because... > > > >I am +1 (binding) > > > >Thank you, > >St.Ack > > > >1. > > > http://mail-archives.apache.org/mod_mbox/incubator-general/201505.mbox/%3C > >CADcMMgG4NHtmFZ519iqgZLA8Lj-E7VmaQ%3Dr8C011LuS5pR0Vkw%40mail.gmail.com%3E > >2. https://wiki.apache.org/incubator/TrafodionProposal > ><https://wiki.apache.org/incubator/TrafodionProposal#preview> > > > > > > > >Trafodion Apache Incubator Proposal > > > >Abstract > > > >Trafodion is a webscale SQL-on-Hadoop solution enabling transactional or > >operational workloads on Hadoop. > > > >Proposal > > > >Apache Trafodion builds on the scalability, elasticity, and flexibility of > >Hadoop. Trafodion extends Hadoop to provide guaranteed transactional > >integrity, enabling new kinds of big data applications to run on Hadoop. > >Key > >features of Apache Trafodion include: > > > >* Full-functioned ANSI SQL language support > >* JDBC/ODBC connectivity for Linux/Windows clients > >* Distributed ACID transaction protection across multiple statements, > >tables and rows > >* Performance improvements for OLTP workloads with compile-time and > >run-time optimizations > >* Support for large data sets using a parallel-aware query optimizer > >* ANSI SQL security and data integrity constraints including referential > >integrity > > > >Hewlett-Packard Company submits this proposal to donate its Apache > >License, > >Version 2.0 open source project known as Trafodion, its source code, > >documentation, and web site content to the Apache Software Foundation in > >order to build an open source community > > > >Background > > > >Trafodion is an open source project sponsored by HP, incubated at HP Labs > >and HP-IT, to develop an enterprise-class SQL-on-Hadoop solution targeting > >big data transactional or operational workloads. HP publically announced > >the open source project and uploaded the source code to GitHub in June > >2014. > > > >The SQL compiler, optimizer and executor components of Trafodion have a > >rich heritage. Under development since 1993, they were released as > >commercial closed source software in various flavors such as HP NonStop > >SQL/MX and HP Neoview. NonStop SQL/MX was designed for online transaction > >processing on HP¹s NonStop (formerly Tandem) fault-tolerant servers and is > >known for its high availability, scalability, and performance. Hundreds of > >companies and thousands of servers are running mission-critical > >applications today on NonStop SQL/MX. In addition, much of these > >components > >today are running internal to HP as the core of its Enterprise Data > >Warehouse (EDW), managing over a PB of data. > > > >Starting in 2013, the software was modified to run on HBase and a new > >distributed transaction manager was written to run as an HBase > >co-processor. > > > >Unlike most NOSQL and other SQL-on-Hadoop open source projects, Trafodion > >provides comprehensive ANSI SQL language support including full-functioned > >data definition (DDL), data manipulation (DML), transaction control (TCL) > >and database utility support. > > > >Trafodion provides comprehensive and standard SQL data manipulation > >support > >including SELECT, INSERT, UPDATE, DELETE, and UPSERT/MERGE syntax with > >language options including join variants, unions, where predicates, > >aggregations (group by and having), sort ordering, sampling, correlated > >and > >nested sub-queries, cursors, and many SQL functions. > > > >Utilities are provided for updating table statistics used by the optimizer > >for costing (i.e. selectivity/cardinality estimates) plan alternatives, > >for > >displaying the chosen SQL execution plan, plan shaping, backup and > >restoring the database, data loading and unloading, and a command line > >utility for interfacing with the database engine. > > > >Explicit control statements are provided to allow applications to define > >transaction boundaries and to abort transactions when warranted, including > >BEGIN WORK, COMMIT WORK, ROLLBACK WORK and SET TRANSACTION. > > > >Trafodion supports ANSI¹s grant/revoke semantics to define user and role > >privileges in terms of managing and accessing the database objects. > > > >Rationale > > > >The name ³Trafodion² (the Welsh word for transactions, pronounced > >³Tra-vod-eee-on²) was chosen specifically to emphasize the differentiation > >that Trafodion provides in closing a critical gap in the Hadoop ecosystem. > >Trafodion builds on the scalability, elasticity, and flexibility of > >Hadoop. > >Trafodion extends Hadoop to provide guaranteed transactional integrity, > >enabling new kinds of big data applications to run on Hadoop. > > > >Current Status > > > >HP released the Trafodion code under the Apache License, Version 2, in > >June > >of 2014. Since that time, we have had one major release in January 2015 > >and > >one minor release in April 2015. The focus of these releases has been in > >getting our base functionality, including security, working on top of > >Apache HBase, as well as improving performance, availability and > >scalability, and integrating better with HBase. > > > >Meritocracy > > > >We want to build a diverse developer community, based on the Apache Way, > >around Trafodion. To help developers become contributors, we have > >documentation on the wiki about the architecture, the source tree > >structure, and an example enhancement. We plan to publish our project > >backlog to the community, specifically highlighting areas where developers > >new to Trafodion may best start contributing, such as extending the > >database functionality with User Defined Routines (UDRs) and integrating > >with other Apache projects in the Hadoop ecosystem. > > > >Community > > > >We have already begun building a community but at this time the community > >consists only of Trafodion developers all HP employees and prospective > >users. We have participated in and hosted HBase Meetups and intend to ramp > >up our community building efforts. > > > >The Trafodion project has seen interest in China, where HP has conducted > >proof-of-concepts with multiple companies and expects to see some of its > >first commercial deployments. To help recruit contributors and users in > >China, members of the team are translating Trafodion wiki content into > >Mandarin. > > > >Core Developers > > > >The core developers are very experienced in database and transaction > >monitor technology, with many having spent more than 20 years working in > >this space. > > > >Alignment > > > >Apache Trafodion relies on Apache HBase as its storage engine. The > >development team has collaborated with and gained valuable advice from > >working with the Apache HBase core developers. Apache Trafodion has > >federation capabilities as well, and can query Trafodion tables stored in > >HBase, native HBase tables, and Apache Hive tables. > > > >Known Risks > > > >Orphaned Products > > > >HP Labs and HP-IT have been incubating Trafodion development for almost > >two > >years. This is part of HP¹s strategy to leverage its investment in > >database > >software and bring software to market as open source and is similar to > >HP¹s > >efforts with OpenStack. Trafodion builds on HP¹s equity investment in the > >Hadoop ecosystem and its efforts to monetize Hadoop through hardware, > >software, and services. HP wants Trafodion to be successful, as HP will > >offer a commercially supported distribution of Trafodion. > > > >Inexperience with Open Source > > > >We have been working with open source software in building closed source > >software for well over two decades. To help transition to doing open > >source > >development, the development team received guidance and best practices > >from > >HP developers working on OpenStack open source projects, many of whom have > >experience working on Apache and other open source projects as well. Since > >releasing Trafodion as an open source project in June of 2014, the > >committers and contributors have moved forward using open source > >development processes and tools for bug tracking and design blueprints and > >Jenkins for continuous integration. As part of the incubation process, we > >recognize we may need to change some of our development processes/tools > >and > >conduct our discussions using Apache email dlists. > > > >Homogenous Developers > > > >Since the initial development of Trafodion has been supported by HP, all > >of > >the current developers are HP employees. Through the support of the Apache > >incubation project, we aim to expand the list of developers and gain > >contributors from related SQL-on-Hadoop projects and the Apache HBase > >project. Trafodion developers are experienced with distributed development > >processes, being primarily based in Palo Alto, CA; Austin, TX; and > >Shanghai, China. Trafodion is written in C++ and Java. > > > >Reliance on Salaried Developers > > > >Currently all of the developers working on the project are paid by their > >employer to work on the project. These developers will work on the open > >source project as well as work on the commercially supported distribution > >of Trafodion that HP will offer. > > > >Relationship with Other Apache Products > > > >Trafodion is built upon Apache HBase and extends it to support ACID > >transactions with HBase co-processors for distributed transaction > >management and recovery. Trafodion envisions future collaborations with > >the > >Apache HBase project on performance optimizations, such as in the areas of > >mixed workload support, High Availability, etc. It also provides > >transactional support and querying from native HBase tables as well. > > > >Trafodion uses Apache Zookeeper to coordinate and manage the distribution > >of connection services across the cluster for load-balancing and high > >availability reconnection purposes in the event a Trafodion process should > >fail. > > > >Trafodion also envisions working with the Apache Ambari project on > >enabling > >better Trafodion manageability. While Ambari focuses on system and > >component level performance metrics, Trafodion manageability will focus in > >a complimentary way on database workload monitoring and performance > >analytics with capabilities more geared towards database administrators. > > > >There are alternative open source projects that are providing > >SQL-on-Hadoop > >capabilities, such as Apache Hive, Apache Drill, and Apache Phoenix. These > >are more focused on reporting and analytics across data structures > >supported on HDFS. In comparison to all of these technologies Trafodion > >provides a very complete implementation of ANSI SQL, one of the most > >sophisticated optimizers for such workloads, a completely parallel data > >flow architecture that does not materialize intermediate results unless > >necessary, full ACID transactional support, ANSI GRANT/REVOKE security, > >and > >other capabilities that would take decades to build in these products. On > >the other hand currently Trafodion is just focused on HBase and querying > >Hive, whereas Hive and Drill provide access to other data formats in HDFS. > > > >An Excessive Fascination with the Apache Brand > > > >We understand the reputation and value of the Apache brand, and no doubt > >believe that it will help us attract contributors and users. Our primary > >goal is to follow a proven, open source development and community building > >model that will make Trafodion successful and enable better collaboration > >with other Apache projects in the Hadoop ecosystem. We also understand the > >rules and guidelines about the use of the Apache brand and intend to > >follow > >them. > > > >Documentation > > > >Documentation and technical details on Trafodion can be found at: > >http://www.trafodion.org/ > > > >Initial Source > > > >The source is available today in a public github repository: > >https://github.com/trafodion/trafodion. > > > >Source and Intellectual Property Submission Plan > > > >The source code has already been released under the Apache License, > >Version > >2. The manuals have been released in Adobe PDF format. As part of the > >submission process, the source for the manuals will be converted from a > >proprietary DocBook XML format to AsciiDoc. > > > >External Dependencies > > > >Two dependencies do not have Apache compatible licenses and will be > >addressed as we enter incubation. One dependency is log4cpp, which is > >licensed under the LGPL. A compatible alternative might be Apache > >incubator > >project log4cxx. The other dependency is unixodbc, which is used as the > >ODBC driver manager. We will look into how Apache Hive manages being able > >to use this incompatible software and do similar. All other dependencies > >have Apache compatible licenses, including Apache 2.0, MIT/X11, MIT, and > >BSD. > > > >Cryptography > > > >Trafodion does not contain any cryptographic code. It does call > >cryptographic libraries: OpenSSL for C++ code and Java Cryptography > >Extension (JCE) for Java code. > > > >Required Resources > > > >Mailing Lists > > > >priv...@trafodion.incubator.apache.org > >d...@trafodion.incubator.apache.org comm...@trafodion.incubator.apache.org > > > >Git Repository > > > >https://git-wip-us.apache.org/repos/afs/incubator-trafodion.git > > > >Issue Tracking > > > >JIRA: JIRA Trafodion (Trafodion) > > > > > >Initial Committers and Affiliation > > > >Dave Birdsall, Hewlett-Packard Company, Dave.Birdsall<AT>hp<DOT>com > >Matt Brown, Hewlett-Packard Company, mattbrown<AT>hp<DOT>com > >Tharak Capirala, Hewlett-Packard Company, Tharak.Capirala<AT>hp<DOT>com > >Alice Chen, Hewlett-Packard Company, Alice.Chen<AT>hp<DOT>com > >John DeRoo, Hewlett-Packard Company, John.Deroo<AT>hp<DOT>com > >Roberta Marton, Hewlett-Packard Company, Roberta.Marton<AT>hp<DOT>com > >Amanda Moran, Hewlett-Packard Company, Amanda.Kay.Moran<AT>hp<DOT>com > >Suresh Subbiah, Hewlett-Packard Company, Suresh.Subbiah<AT>hp<DOT>com > >Sandyha Sundaresan, Hewlett-Packard Company, > >Sandhya.Sundaresan<AT>hp<DOT>com > > > >Sponsors > > > >Champion > > > >Michael Stack, Stack<AT>apache<DOT>org > > > >Nominated Mentors > > > >Andrew Purtell apurtell<AT>apache<DOT>org > >Devaraj Das, ddas<AT>apache<DOT>or > >Enis Söztutar, Enis<AT>apache<DOT>org > >Lars Hofhansl, larsh<AT>apache<DOT>org > >Michael Stack, Stack<AT>apache<DOT>org > >Roman Shaposhnik, rshaposhnik<AT>pivotal<DOT>io > > > >Sponsoring Entity > > > >Apache Incubator PMC > > > --------------------------------------------------------------------- > To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org > For additional commands, e-mail: general-h...@incubator.apache.org > >