Hi Raphael, what kind of numbers are you expecting to see?

Max

On Tue, Apr 25, 2017 at 12:59 PM, Raphael Bircher <rbircherapa...@gmail.com>
wrote:

> Hi all
>
> There is no information about affiliation of the initial committers. The
> only information is, that they are from Airbnb inc and Hortonworks. But
> there are no numbers.
>
> Regards Raphael
>
>
> Am .04.2017, 09:17 Uhr, schrieb Jeff Feng <jeff.f...@gmail.com>:
>
> Thanks John and Max - I have updated the proposal wiki to reflect this
>> update.  It now reads:
>>
>> Source and Intellectual Property Submission Plan
>>
>> Airbnb will submit a Software Grant Agreement (SGA) as Superset joins the
>> incubator. We do not expect any complications for the submission of the
>> Superset code base. Our code is already in Github and there is only a
>> single code base.
>>
>>
>> On Mon, Apr 24, 2017 at 11:32 PM, Maxime Beauchemin <
>> maximebeauche...@gmail.com> wrote:
>>
>> "Airbnb will submit a Software Grant Agreement (SGA) as Superset joins the
>>> incubator."
>>>
>>> Should I add this sentence in the proposal?
>>>
>>> Max
>>>
>>> On Mon, Apr 24, 2017 at 5:48 AM, John D. Ament <johndam...@apache.org>
>>> wrote:
>>>
>>> > I missed this discussion.  In your IP section, you list out:
>>> >
>>> > == Source and Intellectual Property Submission Plan ==
>>> > We do not expect any complications for the submission of the Superset
>>> code
>>> > base.  Our code is already in Github and there is only a single code
>>> base.
>>> >
>>> > This IMHO not clear.  Does Airbnb plan to submit a SGA for Superset, or
>>> > expect that no SGA is required because its Apache licensed?
>>> >
>>> > John
>>> >
>>> > On Sun, Apr 2, 2017 at 4:09 PM Jeff Feng <jeff.f...@airbnb.com.invalid
>>> >
>>> > wrote:
>>> >
>>> > > Dear Apache Incubator Community,
>>> > >
>>> > > We are excited to share our proposal for discussion and feedback for
>>> > > entering Apache Incubation.  Superset is an enterprise-ready web
>>> > > application for data exploration, data visualization and
>>> dashboarding.
>>> > >
>>> > > Our Incubation proposal is at the following Wiki as well as copied in
>>> the
>>> > > email below:
>>> > >
>>> > > https://wiki.apache.org/incubator/SupersetProposal
>>> > >
>>> > > We have an active Superset community including 400+ members and
>>> nearly
>>> > 200
>>> > > topics.  The Google Group can be found below.  We plan to move the
>>> > > discussion to the ASF:
>>> > >
>>> > > https://groups.google.com/forum/#!forum/airbnb_superset
>>> > >
>>> > > Thank you and look forward to the discussion!
>>> > >
>>> > > Jeff, Max & Alanna
>>> > >
>>> > >
>>> > >
>>> > >
>>> > > = Superset =
>>> > >
>>> > > == Abstract ==
>>> > >
>>> > > Superset is an enterprise-ready web application for data exploration,
>>> > data
>>> > > visualization and dashboarding.
>>> > >
>>> > > == Proposal ==
>>> > >
>>> > > Superset is business intelligence (BI) software that helps modern
>>> > > organizations visualize and interact with their data. Superset
>>> enables
>>> > > users explore data from a variety of databases, assemble beautiful
>>> > > dashboards and share their findings.  Superset works neatly with all
>>> > modern
>>> > > SQL-speaking databases, and integrates with Druid.io to provide
>>> > real-time,
>>> > > interactive, blazing fast data access to large datasets.
>>> > >
>>> > > == Background ==
>>> > >
>>> > > Data is mission critical. To succeed in this era, organizations need
>>> to
>>> > > provide low-friction, intuitive and interactive access to data. It is
>>> > > paramount for knowledge workers to be capable of answering their own
>>> > > questions by querying, exploring and visualizing data.
>>> > >
>>> > > The entire business intelligence industry has pivoted from a model of
>>> > > centralized top-down platforms driven by IT organizations to
>>> self-service
>>> > > analytics and agile workflows by any user.  This shift unblocks
>>> > centralized
>>> > > service bottlenecks for creating data visualizations while also
>>> creating
>>> > an
>>> > > environment that is iterative and fast-moving.  This means that
>>> business
>>> > > intelligence software must also be easy and delightful to use.
>>> > > Self-service analytics doesn’t mean that admin and governance
>>> features
>>> > are
>>> > > not needed.
>>> > >
>>> > > Modern BI tools provide fine-grain access controls and auditing
>>> > > capabilities to understand how data is being used.  Superset is a
>>> > solution
>>> > > that delivers on all of these vectors.
>>> > >
>>> > > The technology stack is also constantly morphing - vendors are
>>> struggling
>>> > > to provide cheap, quick and easy solutions to access data.  Business
>>> > > intelligence users are finding existing solutions lacking as these
>>> > software
>>> > > products either disregard or react slowly to recent game-changing
>>> > > technologies like Druid.io, PrestoDB, Apache Drill, Apache Kylin,
>>> d3.js,
>>> > > React.js and iPython’s Jupyter for instance.
>>> > >
>>> > > == Rationale ==
>>> > >
>>> > > Business intelligence is more relevant today than at any other point
>>> in
>>> > > history.  Organizations are currently very limited in options for
>>> open
>>> > > source data visualization solutions, especially solutions that are
>>> both
>>> > > self-service and enterprise-ready.  Every company informing their
>>> > decisions
>>> > > with data needs a BI tool.
>>> > >
>>> > > We believe that Superset will be a strong compliment to existing
>>> Apache
>>> > > Software Foundation technologies by offering scalable user
>>> interactions
>>> > to
>>> > > distributed storage and computation solutions.  Users will often find
>>> > that
>>> > > Superset can act as a catalyst for tooling that can visualize the
>>> > byproduct
>>> > > of data and computation infrastructure.
>>> > >
>>> > > Superset has many key design elements that help fill a gap in current
>>> > > solutions for organizations:
>>> > >
>>> > > * Easy, low friction access to data through a simple, web-based data
>>> > > exploration interface.  Composing charts and dashboards are
>>> intuitive.
>>> > > Eliminating the need to write code or SQL empowers anyone to use it.
>>> > >
>>> > > * Access to a wide array of rich, interactive data visualization
>>> types.
>>> > >
>>> > > * Enterprise-ready: Integration with different authentication
>>> mechanisms
>>> > > and granular permissions centered around actions and data access.
>>> > >
>>> > > * Realtime & fast: Superset provides realtime analytics at the speed
>>> of
>>> > > thought on very large datasets when integrated with Druid.io.
>>> > >
>>> > > * Broad data access: Consume data out of any SQL-speaking relational
>>> > > database.
>>> > >
>>> > > * Extensible: Can be extended to talk to many noSQL databases like
>>> Apache
>>> > > Drill, Elastic Search, and other popular database engines.
>>> > >
>>> > > * Fast loading dashboards with configurable web-scale caching.
>>> > >
>>> > > * Plug-in framework that enables organizations to build custom
>>> analytical
>>> > > applications with new UI/UX interfaces.
>>> > >
>>> > > * SQL Lab, a state-of-the-art SQL IDE that empowers SQL-speaking
>>> users
>>> > with
>>> > > more flexibility.  SQL Lab integrates with the visualization engine
>>> > > seamlessly.
>>> > >
>>> > > == Initial Goals ==
>>> > >
>>> > > The initial goals of the Superset project are several-fold:
>>> > >
>>> > > Move the existing codebase to Apache and integrate with the Apache
>>> > > development process.
>>> > >
>>> > > Redesign the user interface and interaction model for creating
>>> > > visualizations/dashboards and connecting to data sources
>>> > >
>>> > > Build robust support for security and governance of the tool
>>> including
>>> > > popular authorization modules (including Apache Ranger and Apache
>>> Sentry)
>>> > > and a more sophisticated permissions system
>>> > >
>>> > > Grow the extensibility of the project both in terms of enhanced
>>> > > connectivity to NoSQL-based data sources and creating a plug-in
>>> framework
>>> > > that enables organizations to build custom analytical applications
>>> which
>>> > > require a new UI/UX
>>> > >
>>> > > == Current Status ==
>>> > >
>>> > > By many standards, Superset is already a successful open source
>>> project.
>>> > As
>>> > > of March 2017, Superset is officially used in production at about a
>>> dozen
>>> > > companies, has received contributions from over one hundred
>>> contributors
>>> > on
>>> > > Github, 1500+ forks, and 12k+ stars.
>>> > >
>>> > > Sizeable companies like Airbnb, Yahoo! and Hortonworks have made
>>> > > significant contributions, and expressed their commitment to the
>>> project.
>>> > > The product is feature complete and has been viable for months. It
>>> > already
>>> > > serves as the main interface for consuming data at many companies of
>>> > > different sizes.
>>> > >
>>> > > While the product is usable, there’s room for improvement across the
>>> > board,
>>> > > starting with providing a smoother user experience around content
>>> > creation,
>>> > > making sure all features work out-of-the-box on more platforms and
>>> > > databases, providing better user training guides and videos, having a
>>> > > predictable release process, and increasing the overall quality of
>>> the
>>> > > Superset releases.
>>> > >
>>> > > === Meritocracy ===
>>> > >
>>> > > We plan to invest in supporting a meritocracy. We will discuss the
>>> > > requirements in an open forum. Several companies have expressed
>>> interest
>>> > in
>>> > > this project, and we intend to invite additional developers to
>>> > participate.
>>> > > We will encourage and monitor community participation so that
>>> privileges
>>> > > can be extended to those that contribute.
>>> > >
>>> > > === Community ===
>>> > >
>>> > > The need for an enterprise-ready data visualization and exploration
>>> > > platform in the open source community is tremendous.  While Superset
>>> is
>>> > > fairly well known, recognized and used within the Druid.io community,
>>> > > adoption is currently limited outside of that niche. There is a huge
>>> > > opportunity to grow the community to hundreds if not thousands of
>>> > > organizations, and we are hoping that embracing “the Apache way” will
>>> > > accelerate the growth of our community.
>>> > >
>>> > > We have already been active at seeking and inviting contributions,
>>> and
>>> > are
>>> > > planning to scale the project by investing time and growing the
>>> support
>>> > > structure to grow the community.
>>> > >
>>> > > === Core Developers ===
>>> > >
>>> > > The initial committers for Superset include experienced full stack,
>>> > > front-end and data engineers:
>>> > >
>>> > > * Maxime Beauchemin (Airbnb)
>>> > >
>>> > > * Alanna Scott (Airbnb)
>>> > >
>>> > > * Bogdan Kyryliuk (Airbnb)
>>> > >
>>> > > * Vera Liu  (Airbnb)
>>> > >
>>> > > * Jeff Feng (Airbnb)
>>> > >
>>> > > * Ashutosh Chauhan (Hortonworks)
>>> > >
>>> > > * Nishant Bangarwa (Hortonworks)
>>> > >
>>> > > * Slim Bouguerra (Hortonworks)
>>> > >
>>> > > * Priyank Shah (Hortonworks)
>>> > >
>>> > > * Sriharsha Chintalapani (Hortonworks)
>>> > >
>>> > > * Daniel Dai (Hortonworks)
>>> > >
>>> > > We realize that additional employer diversity is needed, and we will
>>> work
>>> > > aggressively to recruit developers from additional companies.
>>> > >
>>> > > === Alignment ===
>>> > >
>>> > > The initial committers strongly believe that a system for interactive
>>> > > visualization of data will gain broader adoption as an open source,
>>> > > community driven project, where the community can contribute not only
>>> to
>>> > > the core components, but also to a growing collection of connectors,
>>> > > visualizations and improving integration a all potential data
>>> sources.
>>> > > Superset already integrates closely with Apache Hive, the Hive
>>> metastore,
>>> > > as well as most SQL-speaking databases found in modern data
>>> ecosystems.
>>> > >
>>> > > == Known Risks ==
>>> > >
>>> > > === Orphaned Products ===
>>> > >
>>> > > Superset is a vital component for both visualizing, accessing and
>>> > > democratizing data at Airbnb.  Also at Hortonworks, Superset is a
>>> core
>>> > > component of the DataFlow product offering.  Thus, the risk of the
>>> > project
>>> > > being orphaned is relatively low.  The project could be at risk if
>>> Airbnb
>>> > > changes their approach for democratizing data or if Hortonworks
>>> changes
>>> > > their strategy in the market.  In such an event, the committers plan
>>> to
>>> > > continue working on the project on their own time, thought the
>>> progress
>>> > > will likely be slower.  We plan to mitigate this risk by recruiting
>>> > > additional committers.
>>> > >
>>> > > === Inexperience with Open Source ===
>>> > >
>>> > > The initial committers include veteran Apache members (committers and
>>> PMC
>>> > > members) and other developers who have varying degrees of experience
>>> with
>>> > > open source projects. All have been involved with source code that
>>> has
>>> > been
>>> > > released under an open source license, and several also have
>>> experience
>>> > > developing code with an open source development process.
>>> > >
>>> > > === Homogenous Developers ===
>>> > >
>>> > > The initial committers are employed by Airbnb Inc., and Hortonworks.
>>> We
>>> > are
>>> > > committed to recruiting additional committers from other companies.
>>> > >
>>> > > === Reliance on Salaried Developers ===
>>> > >
>>> > > It is expected that Superset development will occur on both salaried
>>> time
>>> > > and on volunteer time, after hours. The majority of initial
>>> committers
>>> > are
>>> > > paid by their employer to contribute to this project. However, they
>>> are
>>> > all
>>> > > passionate about the project, and we are confident that the project
>>> will
>>> > > continue even if no salaried developers contribute to the project. We
>>> are
>>> > > committed to recruiting additional committers including non-salaried
>>> > > developers.
>>> > >
>>> > > === Relationships with Other Apache Products ===
>>> > >
>>> > > To the knowledge of the Initial Committers, there are no direct
>>> > competitors
>>> > > to Superset within the Apache Software Foundation.  That said, Apache
>>> > > Zeppelin is an indirect competitor, but it solves a different use
>>> case.
>>> > >
>>> > > Apache Zeppelin is a web-based notebook that enables interactive data
>>> > > analytics. It enables the creation of beautiful data-driven,
>>> interactive
>>> > > and collaborative documents with SQL, Scala and more.  Although a
>>> user
>>> > can
>>> > > create data visualizations using this project, it leverages a
>>> notebook
>>> > > style user interfaces and it is geared towards the Spark community
>>> where
>>> > > Scala and SQL co-exist
>>> > >
>>> > > We look forward to collaborating with those communities, as well as
>>> other
>>> > > Apache communities.
>>> > >
>>> > > === An Excessive Fascination with the Apache Brand ===
>>> > >
>>> > > Superset is solving two huge challenges:
>>> > >
>>> > > The challenge of enabling every knowledge worker to make data
>>> informed
>>> > > decisions, particularly those who are not deeply skilled at writing
>>> SQL.
>>> > >
>>> > > The challenge of visualizing huge amounts of data interactively and
>>> in
>>> > > real-time
>>> > >
>>> > > Superset was first developed as a data visualization solution for
>>> > Druid.io
>>> > > as a way to visualize billions of rows of data.  Since then, usage of
>>> > > Superset has expanded to address data visualization use cases across
>>> SQL
>>> > > speaking data sources as well.
>>> > >
>>> > > Our rationale for developing Superset as an Apache project is
>>> detailed
>>> in
>>> > > the Rationale Section.  We believe that the Apache brand and
>>> community
>>> > > process will help us attract more contributors to this project, and
>>> help
>>> > > grow the footprint of the project through usage at other
>>> organizations
>>> > and
>>> > > within other applications.  Establishing consensus among users and
>>> > > developers will result in a more valuable tool for everyone.
>>> > >
>>> > > == Documentation ==
>>> > >
>>> > > References to further reading material:
>>> > >
>>> > > * [[http://airbnb.io/superset/|Superset Documentation]]
>>> > >
>>> > > * [[https://medium.com/airbnb-engineering/caravel-airbnb-s-dat
>>> > > a-exploration-platform-15a72aa610e5#.npqmmbu25|Blog Post:  Superset:
>>> > > Airbnb’s Data Exploration Platform]]
>>> > >
>>> > > * [[https://medium.com/airbnb-engineering/superset-scaling-dat
>>> > > a-access-and-visual-insights-at-airbnb-3ce3e9b88a7f#.a505zvb1t|Blog
>>> > Post:
>>> > >  Superset: Scaling Data Access & Visual Insights at Airbnb]]
>>> > >
>>> > > == Initial Source ==
>>> > >
>>> > > The origin of the proposed code base can be found at
>>> > > https://github.com/airbnb/superset.  The code base is primarily in
>>> > Python.
>>> > >
>>> > > == Source and Intellectual Property Submission Plan ==
>>> > >
>>> > > We do not expect any complications for the submission of the Superset
>>> > code
>>> > > base.  Our code is already in Github and there is only a single code
>>> > base.
>>> > >
>>> > > == External Dependencies ==
>>> > >
>>> > > List of Python packages, from the Python Package Index (Pypi):
>>> > >
>>> > > * boto3
>>> > >
>>> > > * celery
>>> > >
>>> > > * cryptography
>>> > >
>>> > > * flask-appbuilder
>>> > >
>>> > > * flask-cache
>>> > >
>>> > > * flask-migrate
>>> > >
>>> > > * flask-script
>>> > >
>>> > > * flask-sqlalchemy
>>> > >
>>> > > * flask-testing
>>> > >
>>> > > * humanize
>>> > >
>>> > > * gunicorn
>>> > >
>>> > > * markdown
>>> > >
>>> > > * pandas
>>> > >
>>> > > * parsedatetime
>>> > >
>>> > > * pydruid
>>> > >
>>> > > * PyHive
>>> > >
>>> > > * python-dateutil
>>> > >
>>> > > * requests
>>> > >
>>> > > * simplejson
>>> > >
>>> > > * six
>>> > >
>>> > > * sqlalchemy
>>> > >
>>> > > * sqlalchemy-utils
>>> > >
>>> > > * sqlparse
>>> > >
>>> > > * thrift
>>> > >
>>> > > * thrift-sasl
>>> > >
>>> > > * werkzeug
>>> > >
>>> > > List of Javascript packages, from NPM:
>>> > >
>>> > > * autobind-decorator
>>> > >
>>> > > * bootstrap
>>> > >
>>> > > * bootstrap-datepicker
>>> > >
>>> > > * brace
>>> > >
>>> > > * brfs
>>> > >
>>> > > * cal-heatmap
>>> > >
>>> > > * classnames
>>> > >
>>> > > * d3
>>> > >
>>> > > * d3-cloud
>>> > >
>>> > > * d3-sankey
>>> > >
>>> > > * d3-scale
>>> > >
>>> > > * d3-tip
>>> > >
>>> > > * datamaps
>>> > >
>>> > > * datatables-bootstrap3-plugin
>>> > >
>>> > > * datatables.net-bs
>>> > >
>>> > > * font-awesome
>>> > >
>>> > > * gridster
>>> > >
>>> > > * immutability-helper
>>> > >
>>> > > * immutable
>>> > >
>>> > > * jquery
>>> > >
>>> > > * lodash.throttle
>>> > >
>>> > > * mapbox-gl
>>> > >
>>> > > * moment
>>> > >
>>> > > * moments
>>> > >
>>> > > * mustache
>>> > >
>>> > > * nvd3
>>> > >
>>> > > * react
>>> > >
>>> > > * react-ace
>>> > >
>>> > > * react-bootstrap
>>> > >
>>> > > * react-bootstrap-table
>>> > >
>>> > > * react-dom
>>> > >
>>> > > * react-draggable
>>> > >
>>> > > * react-gravatar
>>> > >
>>> > > * react-grid-layout
>>> > >
>>> > > * react-map-gl
>>> > >
>>> > > * react-redux
>>> > >
>>> > > * react-resizable
>>> > >
>>> > > * react-select
>>> > >
>>> > > * react-syntax-highlighter
>>> > >
>>> > > * reactable
>>> > >
>>> > > * redux
>>> > >
>>> > > * redux-localstorage
>>> > >
>>> > > * redux-thunk
>>> > >
>>> > > * shortid
>>> > >
>>> > > * style-loader
>>> > >
>>> > > * supercluster
>>> > >
>>> > > * topojson
>>> > >
>>> > > * victory
>>> > >
>>> > > * viewport-mercator-project
>>> > >
>>> > > == Cryptography ==
>>> > >
>>> > > The proposal does not include cryptographic code.
>>> > >
>>> > > == Required Resources ==
>>> > >
>>> > > === Mailing List ===
>>> > >
>>> > > There is a current mailing list as a Google Group “airbnb_superset”
>>> that
>>> > we
>>> > > are planning on deprecating as the Apache.org become ready to serve
>>> our
>>> > > community.
>>> > >
>>> > > * superset-private
>>> > >
>>> > > * superset-dev
>>> > >
>>> > > * superset-user
>>> > >
>>> > > === Subversion Directory ===
>>> > >
>>> > > Git is the preferred source control system.
>>> > http://svn.apache.org/repos/as
>>> > > f/incubator/superset <http://svn.apache.org/repos/
>>> asf/incubator/superset
>>> > >
>>> > >
>>> > > == Git Repository ==
>>> > >
>>> > > Git is the preferred source control system, we’re assuming
>>> > > https://github.com/apache/incubator-superset based on the naming
>>> scheme
>>> > >
>>> > > == Issue Tracking ==
>>> > >
>>> > > JIRA Superset (SUPERSET). If possible, we’d like to use Github
>>> issues &
>>> > PRs
>>> > > to manage our project as much as possible. It’s been said that there
>>> are
>>> > > ways to keep Github’s issues in sync with Jira, allowing us to get
>>> best
>>> > of
>>> > > both worlds. If that is not possible, we will comply to using Jira.
>>> > >
>>> > > == Other Resources ==
>>> > >
>>> > > We currently use a set of Github integrated services that are free to
>>> the
>>> > > open source community, like Travis-ci, Code Climate, Coveralls,
>>> > > Landscape.io, Requires.io, david-dm and Gitter. We would like to keep
>>> > using
>>> > > these services as they allow us to scale contributions and optimize
>>> our
>>> > > development flows. These services require some elevated rights on the
>>> > > Github repository in order to set up or tune and we would like for
>>> the
>>> > > committers to have the required rights.
>>> > >
>>> > >
>>> > > == Initial Committers ==
>>> > >
>>> > > * Maxime Beauchemin <maxime.beauche...@airbnb.com> - PMC & Committer
>>> > >
>>> > > * Alanna Scott <alanna.sc...@airbnb.com> - PMC & Committer
>>> > >
>>> > > * Bogdan Kyryliuk <b.kyryl...@gmail.com> - PMC & Committer
>>> > >
>>> > > * Vera Liu <vera....@airbnb.com> - Committer
>>> > >
>>> > > * Jeff Feng <jeff.f...@airbnb.com> - PMC & Committer
>>> > >
>>> > > * Ashutosh Chauhan <hashut...@apache.org> - Mentor & Committer
>>> > >
>>> > > * Nishant Bangarwa <nbanga...@hortonworks.com> - PMC & Committer
>>> > >
>>> > > * Slim Bouguerra <sbougue...@hortonworks.com> - Committer
>>> > >
>>> > > * Priyank Shah <ps...@hortonworks.com> - Committer
>>> > >
>>> > > * Harsha Chintalapani <schintalap...@hortonworks.com> - Committer
>>> > >
>>> > > * Daniel Dai <da...@apache.org> - Champion & Committer
>>> > >
>>> > > == Affiliations ==
>>> > >
>>> > > The initial committers are employees of Airbnb Inc. and Hortonworks.
>>> > >
>>> > > == Sponsors ==
>>> > >
>>> > > === Champion ===
>>> > >
>>> > > Daniel Dai <da...@apache.org>
>>> > >
>>> > > === Nominated Mentors ===
>>> > >
>>> > > Ashutosh Chauhan <hashut...@apache.org>
>>> > >
>>> > > === Sponsoring Entity ===
>>> > >
>>> > > Incubator PMC
>>> > >
>>> > >
>>> > > --
>>> > >
>>> > > *Jeff Feng*
>>> > > Product Manager
>>> > > m: (949)-610-5108 <(949)%20610-5108> <(949)%20610-5108>
>>> > > twitter: @jtfeng
>>> > >
>>> >
>>>
>>>
>
> --
> My introduction https://youtu.be/Ln4vly5sxYU
>
>
> ---------------------------------------------------------------------
> To unsubscribe, e-mail: general-unsubscr...@incubator.apache.org
> For additional commands, e-mail: general-h...@incubator.apache.org
>
>

Reply via email to