Re: [VOTE] Accept Druid into the Apache Incubator

2018-02-23 Thread Jyotirmoy Sundi
+1

On 2018/02/22 19:03:55, Julian Hyde wrote: 
> Hi all,> 
> 
> After some discussion on the Druid proposal[1], I'd like to> 
> start a vote on accepting Druid into the Apache Incubator,> 
> per the ASF policy[2] and voting rules[3].> 
> 
> A vote for accepting a new Apache Incubator podling is a> 
> majority vote for which only Incubator PMC member votes are> 
> binding. Votes from other people are also welcome as an> 
> indication of people's enthusiasm (or lack thereof).> 
> 
> Please do not use this VOTE thread for discussions. If> 
> needed, start a new thread instead.> 
> 
> This vote will run for at least 72 hours. Please VOTE as> 
> follows:> 
> [ ] +1 Accept Druid into the Apache Incubator> 
> [ ] +0 Abstain> 
> [ ] -1 Do not accept Druid into the Apache Incubator> 
> because ...> 
> 
> The proposal is listed below, but you can also access it on> 
> the wiki[4].> 
> 
> Julian> 
> 
> [1] 
> https://lists.apache.org/thread.html/b95f90a30b6e8587e9b108f368b07c1b3e23e25ca592448d9c9f81e2@%3Cgeneral.incubator.apache.org%3E>
>  
> 
> [2] 
> https://incubator.apache.org/policy/incubation.html#approval_of_proposal_by_sponsor>
>  
> 
> [3] http://www.apache.org/foundation/voting.html> 
> 
> [4] https://wiki.apache.org/incubator/DruidProposal> 
> 
> 
> 
> 
> 
> = Druid Proposal => 
> 
> == Abstract ==> 
> 
> Druid is a high-performance, column-oriented, distributed> 
> data store.> 
> 
> == Proposal ==> 
> 
> Druid is an open source data store designed for real-time> 
> exploratory analytics on large data sets. Druid's key> 
> features are a column-oriented storage layout, a distributed> 
> shared-nothing architecture, and ability to generate and> 
> leverage indexing and caching structures. Druid is typically> 
> deployed in clusters of tens to hundreds of nodes, and has> 
> the ability to load data from Apache Kafka and Apache> 
> Hadoop, among other data sources. Druid offers two query> 
> languages: a SQL dialect (powered by Apache Calcite) and a> 
> JSON-over-HTTP API.> 
> 
> Druid was originally developed to power a slice-and-dice> 
> analytical UI built on top of large event streams. The> 
> original use case for Druid targeted ingest rates of> 
> millions of records/sec, retention of over a year of data,> 
> and query latencies of sub-second to a few seconds. Many> 
> people can benefit from such capability, and many already> 
> have (see http://druid.io/druid-powered.html). In addition,> 
> new use cases have emerged since Druid's original> 
> development, such as OLAP acceleration of data warehouse> 
> tables and more highly concurrent applications operating> 
> with relatively narrower queries.> 
> 
> == Background ==> 
> 
> Druid is a data store designed for fast analytics. It would> 
> typically be used in lieu of more general purpose query> 
> systems like Hadoop MapReduce or Spark when query latency is> 
> of the utmost importance. Druid is often used as a data> 
> store for powering GUI analytical applications.> 
> 
> The buzzwordy description of Druid is a high-performance,> 
> column-oriented, distributed data store. What we mean by> 
> this is:> 
> 
> * "high performance": Druid aims to provide low query> 
> latency and high ingest rates possible.> 
> * "column-oriented": Druid stores data in a column-oriented> 
> format, like most other systems designed for analytics. It> 
> can also store indexes along with the columns.> 
> * "distributed": Druid is deployed in clusters, typically of> 
> tens to hundreds of nodes.> 
> * "data store": Druid loads your data and stores a copy of> 
> it on the cluster's local disks (and may cache it in> 
> memory). It doesn't query your data from some other> 
> storage system.> 
> 
> == Rationale ==> 
> 
> Druid is a mature, active project with a large number of> 
> production installations, dozens of contributors to each> 
> release, and multiple vendors offering professional> 
> support. Given Druid's strong community, its close> 
> integration with many other Apache projects (such as Kafka,> 
> Hadoop, and Calcite), and its pre-existing Apache-inspired> 
> governance structure, we feel that Apache is the best home> 
> for the project on a long-term basis.> 
> 
> == Current Status ==> 
> 
> === Meritocracy ===> 
> 
> Since Druid was first open sourced the original developers> 
> have solicited contributions from others, including through> 
> our blog, the project mailing lists, and through accepting> 
> GitHub pull requests. We have an Apache-inspired governance> 
> structure with a PMC and committers, and our committer ranks> 
> include a good number of people from outside the original> 
> development team.> 
> 
> === Community ===> 
> 
> The Druid core developers have sought to nurture a community> 
> throughout the life of the project. We use GitHub as the> 
> focal point for bug reports and code contributions, and the> 
> mailing lists for most other discussion. To try to make> 
> people feel welcome, we've also spelled this out on a> 
> "CONTRIBUTE" l

Re: [VOTE] Accept Druid into the Apache Incubator

2018-02-23 Thread Xin Wang
+1 (non-binding)

- Xin Wang

2018-02-23 4:34 GMT+08:00 Jyotirmoy Sundi :

> +1
>
> On 2018/02/22 19:03:55, Julian Hyde wrote:
> > Hi all,>
> >
> > After some discussion on the Druid proposal[1], I'd like to>
> > start a vote on accepting Druid into the Apache Incubator,>
> > per the ASF policy[2] and voting rules[3].>
> >
> > A vote for accepting a new Apache Incubator podling is a>
> > majority vote for which only Incubator PMC member votes are>
> > binding. Votes from other people are also welcome as an>
> > indication of people's enthusiasm (or lack thereof).>
> >
> > Please do not use this VOTE thread for discussions. If>
> > needed, start a new thread instead.>
> >
> > This vote will run for at least 72 hours. Please VOTE as>
> > follows:>
> > [ ] +1 Accept Druid into the Apache Incubator>
> > [ ] +0 Abstain>
> > [ ] -1 Do not accept Druid into the Apache Incubator>
> > because ...>
> >
> > The proposal is listed below, but you can also access it on>
> > the wiki[4].>
> >
> > Julian>
> >
> > [1] https://lists.apache.org/thread.html/b95f90a30b6e8587e9b108f368b07c
> 1b3e23e25ca592448d9c9f81e2@%3Cgeneral.incubator.apache.org%3E>
> >
> > [2] https://incubator.apache.org/policy/incubation.html#
> approval_of_proposal_by_sponsor>
> >
> > [3] http://www.apache.org/foundation/voting.html>
> >
> > [4] https://wiki.apache.org/incubator/DruidProposal>
> >
> >
> >
> >
> >
> > = Druid Proposal =>
> >
> > == Abstract ==>
> >
> > Druid is a high-performance, column-oriented, distributed>
> > data store.>
> >
> > == Proposal ==>
> >
> > Druid is an open source data store designed for real-time>
> > exploratory analytics on large data sets. Druid's key>
> > features are a column-oriented storage layout, a distributed>
> > shared-nothing architecture, and ability to generate and>
> > leverage indexing and caching structures. Druid is typically>
> > deployed in clusters of tens to hundreds of nodes, and has>
> > the ability to load data from Apache Kafka and Apache>
> > Hadoop, among other data sources. Druid offers two query>
> > languages: a SQL dialect (powered by Apache Calcite) and a>
> > JSON-over-HTTP API.>
> >
> > Druid was originally developed to power a slice-and-dice>
> > analytical UI built on top of large event streams. The>
> > original use case for Druid targeted ingest rates of>
> > millions of records/sec, retention of over a year of data,>
> > and query latencies of sub-second to a few seconds. Many>
> > people can benefit from such capability, and many already>
> > have (see http://druid.io/druid-powered.html). In addition,>
> > new use cases have emerged since Druid's original>
> > development, such as OLAP acceleration of data warehouse>
> > tables and more highly concurrent applications operating>
> > with relatively narrower queries.>
> >
> > == Background ==>
> >
> > Druid is a data store designed for fast analytics. It would>
> > typically be used in lieu of more general purpose query>
> > systems like Hadoop MapReduce or Spark when query latency is>
> > of the utmost importance. Druid is often used as a data>
> > store for powering GUI analytical applications.>
> >
> > The buzzwordy description of Druid is a high-performance,>
> > column-oriented, distributed data store. What we mean by>
> > this is:>
> >
> > * "high performance": Druid aims to provide low query>
> > latency and high ingest rates possible.>
> > * "column-oriented": Druid stores data in a column-oriented>
> > format, like most other systems designed for analytics. It>
> > can also store indexes along with the columns.>
> > * "distributed": Druid is deployed in clusters, typically of>
> > tens to hundreds of nodes.>
> > * "data store": Druid loads your data and stores a copy of>
> > it on the cluster's local disks (and may cache it in>
> > memory). It doesn't query your data from some other>
> > storage system.>
> >
> > == Rationale ==>
> >
> > Druid is a mature, active project with a large number of>
> > production installations, dozens of contributors to each>
> > release, and multiple vendors offering professional>
> > support. Given Druid's strong community, its close>
> > integration with many other Apache projects (such as Kafka,>
> > Hadoop, and Calcite), and its pre-existing Apache-inspired>
> > governance structure, we feel that Apache is the best home>
> > for the project on a long-term basis.>
> >
> > == Current Status ==>
> >
> > === Meritocracy ===>
> >
> > Since Druid was first open sourced the original developers>
> > have solicited contributions from others, including through>
> > our blog, the project mailing lists, and through accepting>
> > GitHub pull requests. We have an Apache-inspired governance>
> > structure with a PMC and committers, and our committer ranks>
> > include a good number of people from outside the original>
> > development team.>
> >
> > === Community ===>
> >
> > The Druid core developers have sought to nurture a community>
> > throughout the life of the project. We use GitHub as the>
> > 

Re: [VOTE] Accept Druid into the Apache Incubator

2018-02-23 Thread Jyotirmoy Sundi
+1 Vote

On 2018/02/22 19:03:55, Julian Hyde  wrote: 
> Hi all,> 
> 
> After some discussion on the Druid proposal[1], I'd like to> 
> start a vote on accepting Druid into the Apache Incubator,> 
> per the ASF policy[2] and voting rules[3].> 
> 
> A vote for accepting a new Apache Incubator podling is a> 
> majority vote for which only Incubator PMC member votes are> 
> binding. Votes from other people are also welcome as an> 
> indication of people's enthusiasm (or lack thereof).> 
> 
> Please do not use this VOTE thread for discussions.  If> 
> needed, start a new thread instead.> 
> 
> This vote will run for at least 72 hours. Please VOTE as> 
> follows:> 
>  [ ] +1 Accept Druid into the Apache Incubator> 
>  [ ] +0 Abstain> 
>  [ ] -1 Do not accept Druid into the Apache Incubator> 
> because ...> 
> 
> The proposal is listed below, but you can also access it on> 
> the wiki[4].> 
> 
> Julian> 
> 
> [1] 
> https://lists.apache.org/thread.html/b95f90a30b6e8587e9b108f368b07c1b3e23e25ca592448d9c9f81e2@%3Cgeneral.incubator.apache.org%3E>
>  
> 
> [2] 
> https://incubator.apache.org/policy/incubation.html#approval_of_proposal_by_sponsor>
>  
> 
> [3] http://www.apache.org/foundation/voting.html> 
> 
> [4] https://wiki.apache.org/incubator/DruidProposal> 
> 
> 
> 
> 
> 
> = Druid Proposal => 
> 
> == Abstract ==> 
> 
> Druid is a high-performance, column-oriented, distributed> 
> data store.> 
> 
> == Proposal ==> 
> 
> Druid is an open source data store designed for real-time> 
> exploratory analytics on large data sets. Druid's key> 
> features are a column-oriented storage layout, a distributed> 
> shared-nothing architecture, and ability to generate and> 
> leverage indexing and caching structures. Druid is typically> 
> deployed in clusters of tens to hundreds of nodes, and has> 
> the ability to load data from Apache Kafka and Apache> 
> Hadoop, among other data sources. Druid offers two query> 
> languages: a SQL dialect (powered by Apache Calcite) and a> 
> JSON-over-HTTP API.> 
> 
> Druid was originally developed to power a slice-and-dice> 
> analytical UI built on top of large event streams. The> 
> original use case for Druid targeted ingest rates of> 
> millions of records/sec, retention of over a year of data,> 
> and query latencies of sub-second to a few seconds. Many> 
> people can benefit from such capability, and many already> 
> have (see http://druid.io/druid-powered.html). In addition,> 
> new use cases have emerged since Druid's original> 
> development, such as OLAP acceleration of data warehouse> 
> tables and more highly concurrent applications operating> 
> with relatively narrower queries.> 
> 
> == Background ==> 
> 
> Druid is a data store designed for fast analytics. It would> 
> typically be used in lieu of more general purpose query> 
> systems like Hadoop MapReduce or Spark when query latency is> 
> of the utmost importance. Druid is often used as a data> 
> store for powering GUI analytical applications.> 
> 
> The buzzwordy description of Druid is a high-performance,> 
> column-oriented, distributed data store. What we mean by> 
> this is:> 
> 
> * "high performance": Druid aims to provide low query> 
>   latency and high ingest rates possible.> 
> * "column-oriented": Druid stores data in a column-oriented> 
>   format, like most other systems designed for analytics. It> 
>   can also store indexes along with the columns.> 
> * "distributed": Druid is deployed in clusters, typically of> 
>   tens to hundreds of nodes.> 
> * "data store": Druid loads your data and stores a copy of> 
>   it on the cluster's local disks (and may cache it in> 
>   memory). It doesn't query your data from some other> 
>   storage system.> 
> 
> == Rationale ==> 
> 
> Druid is a mature, active project with a large number of> 
> production installations, dozens of contributors to each> 
> release, and multiple vendors offering professional> 
> support. Given Druid's strong community, its close> 
> integration with many other Apache projects (such as Kafka,> 
> Hadoop, and Calcite), and its pre-existing Apache-inspired> 
> governance structure, we feel that Apache is the best home> 
> for the project on a long-term basis.> 
> 
> == Current Status ==> 
> 
> === Meritocracy ===> 
> 
> Since Druid was first open sourced the original developers> 
> have solicited contributions from others, including through> 
> our blog, the project mailing lists, and through accepting> 
> GitHub pull requests. We have an Apache-inspired governance> 
> structure with a PMC and committers, and our committer ranks> 
> include a good number of people from outside the original> 
> development team.> 
> 
> === Community ===> 
> 
> The Druid core developers have sought to nurture a community> 
> throughout the life of the project. We use GitHub as the> 
> focal point for bug reports and code contributions, and the> 
> mailing lists for most other discussion. To try to make> 
> people feel welcome, we've also spelled 

RE: [VOTE] Accept Druid into the Apache Incubator

2018-02-23 Thread Atul K. Gupta
+1 (non-binding)

-Original Message-
From: Jyotirmoy Sundi [mailto:sundi...@gmail.com] 
Sent: 23 February 2018 02:17
To: general@incubator.apache.org
Subject: Re: [VOTE] Accept Druid into the Apache Incubator

+1 Vote

On 2018/02/22 19:03:55, Julian Hyde  wrote: 
> Hi all,>
> 
> After some discussion on the Druid proposal[1], I'd like to> start a 
> vote on accepting Druid into the Apache Incubator,> per the ASF 
> policy[2] and voting rules[3].>
> 
> A vote for accepting a new Apache Incubator podling is a> majority 
> vote for which only Incubator PMC member votes are> binding. Votes 
> from other people are also welcome as an> indication of people's 
> enthusiasm (or lack thereof).>
> 
> Please do not use this VOTE thread for discussions.  If> needed, start 
> a new thread instead.>
> 
> This vote will run for at least 72 hours. Please VOTE as> follows:>  [ 
> ] +1 Accept Druid into the Apache Incubator>  [ ] +0 Abstain>  [ ] -1 
> Do not accept Druid into the Apache Incubator>
> because ...>
> 
> The proposal is listed below, but you can also access it on> the 
> wiki[4].>
> 
> Julian> 
> 
> [1] 
> https://lists.apache.org/thread.html/b95f90a30b6e8587e9b108f368b07c1b3
> e23e25ca592448d9c9f81e2@%3Cgeneral.incubator.apache.org%3E>
> 
> [2] 
> https://incubator.apache.org/policy/incubation.html#approval_of_propos
> al_by_sponsor>
> 
> [3] http://www.apache.org/foundation/voting.html>
> 
> [4] https://wiki.apache.org/incubator/DruidProposal>
> 
> 
> 
> 
> 
> = Druid Proposal =>
> 
> == Abstract ==>
> 
> Druid is a high-performance, column-oriented, distributed> data 
> store.>
> 
> == Proposal ==>
> 
> Druid is an open source data store designed for real-time> exploratory 
> analytics on large data sets. Druid's key> features are a 
> column-oriented storage layout, a distributed> shared-nothing 
> architecture, and ability to generate and> leverage indexing and 
> caching structures. Druid is typically> deployed in clusters of tens 
> to hundreds of nodes, and has> the ability to load data from Apache 
> Kafka and Apache> Hadoop, among other data sources. Druid offers two 
> query>
> languages: a SQL dialect (powered by Apache Calcite) and a> 
> JSON-over-HTTP API.>
> 
> Druid was originally developed to power a slice-and-dice> analytical 
> UI built on top of large event streams. The> original use case for 
> Druid targeted ingest rates of> millions of records/sec, retention of 
> over a year of data,> and query latencies of sub-second to a few 
> seconds. Many> people can benefit from such capability, and many 
> already> have (see http://druid.io/druid-powered.html). In addition,> 
> new use cases have emerged since Druid's original> development, such 
> as OLAP acceleration of data warehouse> tables and more highly 
> concurrent applications operating> with relatively narrower queries.>
> 
> == Background ==>
> 
> Druid is a data store designed for fast analytics. It would> typically 
> be used in lieu of more general purpose query> systems like Hadoop 
> MapReduce or Spark when query latency is> of the utmost importance. 
> Druid is often used as a data> store for powering GUI analytical 
> applications.>
> 
> The buzzwordy description of Druid is a high-performance,> 
> column-oriented, distributed data store. What we mean by> this is:>
> 
> * "high performance": Druid aims to provide low query> 
>   latency and high ingest rates possible.>
> * "column-oriented": Druid stores data in a column-oriented> 
>   format, like most other systems designed for analytics. It> 
>   can also store indexes along with the columns.>
> * "distributed": Druid is deployed in clusters, typically of> 
>   tens to hundreds of nodes.>
> * "data store": Druid loads your data and stores a copy of> 
>   it on the cluster's local disks (and may cache it in> 
>   memory). It doesn't query your data from some other> 
>   storage system.>
> 
> == Rationale ==>
> 
> Druid is a mature, active project with a large number of> production 
> installations, dozens of contributors to each> release, and multiple 
> vendors offering professional> support. Given Druid's strong 
> community, its close> integration with many other Apache projects 
> (such as Kafka,> Hadoop, and Calcite), and its pre-existing 
> Apache-inspired> governance structure, we feel that Apache is the best 
> home> for the project on a long-term basis.>
> 
> == Current Status ==>
> 
> === Meritocracy ===>
> 
> Since Druid was first open sourced the original developers> have 
> solicited contributions from others, including through> our blog, the 
> project mailing lists, and through accepting> GitHub pull requests. We 
> have an Apache-inspired governance> structure with a PMC and 
> committers, and our committer ranks> include a good number of people 
> from outside the original> development team.>
> 
> === Community ===>
> 
> The Druid core developers have sought to nurture a community> 
> throughout the life of the project. We use GitHub as the> f