Il mar 20 feb 2018, 12:05 Enrico Olivelli <eolive...@gmail.com> ha scritto:

> Il mar 20 feb 2018, 10:57 Sijie Guo <guosi...@gmail.com> ha scritto:
>
>> Hi all,
>>
>> Hope you guys have gone through the BP.
>>
>> I've also made the repo of this table service that we have been working on
>> public. https://github.com/streamlio/stream-storage
>>
>> I would like to call a vote for making this work as a contrib module in
>> bookkeeper for 4.7 as a preview, so we can continue the development of
>> this
>> idea for bookkeeper metadata storage in subsequent releases. Like how
>> bookkeeper was developed from a contrib module in zookeeper before.
>>
>> Here is the PR for adding this BP to bookkeeper_proposals page:
>> https://github.com/apache/bookkeeper/pull/1185
>>
>> Please take a look and vote your opinions.
>>
>
> So I have to put my +1 on github or will you start an email thread?
>

Sent on github
Enrico

>
>> - Sijie
>>
>>
>>
>>
>>
>>
>>
>>
>>
>>
>> On Sun, Feb 11, 2018 at 5:07 PM, Sijie Guo <guosi...@gmail.com> wrote:
>>
>> > Thanks JV and Encrico.
>> >
>> > I would like to include this as a contrib in bookkeeper for 4.7 like
>> > bookkeeper was grown from a contrib in zookeeper before.
>> >
>> > So if the idea sounds good to you guys, and if you guys think this is
>> > aligned with bookkeeper roadmap, let’s try to move this forward with a
>> > contrib module in bookkeeper and continue the development in bookkeeper.
>> >
>> > If there is no major concerns, I would like to call a vote for this
>> week.
>> >
>> > Sijie
>> >
>> >
>> > On Thu, Feb 8, 2018 at 12:01 AM Venkateswara Rao Jujjuri <
>> > jujj...@gmail.com> wrote:
>> >
>> >> A great step to move forward. BP-29 and BP-30 along with reorganizing
>> ZK
>> >> will help the BK to shape perfect MDS abstraction.
>> >> While BP-30 is ambitious, it is a perfect way to start ambitious
>> projects.
>> >> :)
>> >>
>> >> JV
>> >>
>> >> On Wed, Feb 7, 2018 at 6:49 AM, Enrico Olivelli <eolive...@gmail.com>
>> >> wrote:
>> >>
>> >> > It is very interesting! Thank you.
>> >> > I will look into it soon
>> >> >
>> >> > Enrico
>> >> >
>> >> > Il mer 7 feb 2018, 15:24 Sijie Guo <guosi...@gmail.com> ha scritto:
>> >> >
>> >> > > Hi all,
>> >> > >
>> >> > > I started a proposal of contributing a table (aka key/value)
>> service
>> >> > > component as a contrib module to the bookkeeper community. This BP
>> >> > together
>> >> > > with other BPs I sent last week forms the idea of how we can do on
>> >> > > improving metadata management in bookkeeper (I will talk a bit
>> more at
>> >> > the
>> >> > > end of this email).
>> >> > >
>> >> > > **why it was developed**
>> >> > >
>> >> > > Two main categories of use cases were driving the need of a
>> key/value
>> >> > like
>> >> > > service.
>> >> > >
>> >> > > One is metadata storage, bookkeeper needs a key/value like storage
>> >> > > (currently it is zookeeper) to store the ledger's metadata, systems
>> >> built
>> >> > > on top of bookkeeper like distributedlog/pulsar also follow the
>> >> pattern
>> >> > > that bookkeeper is using. They all need a key/value like storage to
>> >> store
>> >> > > their metadata. We all know zookeeper is the bottleneck of the
>> >> > scalability.
>> >> > > And it is also an issue marker to production systems (based on my
>> >> biased
>> >> > > production experiences).
>> >> > >
>> >> > > The other one is state storage in real-time/streaming
>> >> > > analytics/computation. In streaming analytics, the computation jobs
>> >> > usually
>> >> > > process streaming data. they usually need to store some sort of
>> state
>> >> of
>> >> > > the computation operators into a storage and serve the computation
>> >> state
>> >> > as
>> >> > > final results for queries. Those state are usually represented in
>> >> > key/value
>> >> > > forms, and usually backed by wal. BookKeeper has been used in this
>> >> area
>> >> > via
>> >> > > distributedlog/pulsar for storing and serving log / streaming data.
>> >> It is
>> >> > > ideal for bookkeeper also able to store and serve state data for
>> the
>> >> sake
>> >> > > of unification, simplification and also reducing the complexity of
>> >> > > deployment and operations.
>> >> > >
>> >> > > Hence we prototyped/developed a table service component as an
>> add-on
>> >> to
>> >> > > bookkeeper. We'd like to contribute this as a contrib module to
>> >> > bookkeeper
>> >> > > and continue the development, integration and evaluation in the
>> >> > bookkeeper
>> >> > > community.
>> >> > >
>> >> > > We hope this can be like bookkeeper in zookeeper. bookkeeper was a
>> >> > contrib
>> >> > > module in zookeeper, and it is developed in the community and grown
>> >> into
>> >> > > what it is now.
>> >> > >
>> >> > > **how it is aligned with metadata storage**
>> >> > >
>> >> > > BP-28, BP-29 and BP-30. They are related at some extend.
>> >> > >
>> >> > > BP-28 is more a cleanup proposal to carry-on Jia's work (on service
>> >> > > discovery interfaces). This is to produce a clean metadata api
>> module,
>> >> > > define a clean dependency between
>> >> > > bookkeeper implementation and metadata service, and allow we really
>> >> > plugin
>> >> > > different
>> >> > > metadata services without touching/changing bookkeeper
>> implementation.
>> >> > >
>> >> > > BP-29 and BP-30 can be thought as two different metadata service
>> >> > > implementation based
>> >> > > on the metadata api contract defined in BP-28.
>> >> > >
>> >> > > BP-29 is to use Etcd as the metadata service, while BP-30 is to
>> have a
>> >> > > built-in key/value service as the metadata service. Both BP-29 and
>> >> BP-30
>> >> > > have pros and cons. However they
>> >> > > are not against to each other. Allowing two concurrent approaches
>> will
>> >> > help
>> >> > > us understand
>> >> > > more on metadata management in bookkeeper and its ecosystem (e.g.
>> >> dlog,
>> >> > > pulsar), which
>> >> > > will lead the project head in a healthy direction.
>> >> > >
>> >> > > **Proposed Changes**
>> >> > >
>> >> > > This proposal is to propose this table service as a contrib module
>> >> under
>> >> > > `stream` directory just as how we handle `dlog`. We can mark it as
>> >> > > "preview"/"alpha" in 4.7 and continue the development of this
>> module
>> >> in
>> >> > > bookkeeper community.
>> >> > >
>> >> > > The details of the proposal can be found in the google doc attached
>> >> > below:
>> >> > >
>> >> > >
>> >> > > https://docs.google.com/document/d/155xAwWv5IdOitHh1NVMEwCMGgB28M
>> >> > 3FyMiQSxEpjE-Y/edit#heading=h.56rbh52koe3f
>> >> > >
>> >> > > Please take a look. Comments are welcome.
>> >> > >
>> >> > > - Sijie
>> >> > >
>> >> >
>> >> >
>> >> > --
>> >> >
>> >> >
>> >> > -- Enrico Olivelli
>> >> >
>> >>
>> >>
>> >>
>> >> --
>> >> Jvrao
>> >> ---
>> >> First they ignore you, then they laugh at you, then they fight you,
>> then
>> >> you win. - Mahatma Gandhi
>> >>
>> >
>>
> --
>
>
> -- Enrico Olivelli
>
-- 


-- Enrico Olivelli

Reply via email to