Hello all,

I've addressed the areas with comments with updated explanations and
responses where necessary.

Please do have a quick read if you have time.
I shall follow-up with these datas as markdown changes to beam site in a
couple of days for feedback.

Thanks!

Herman Mak |  Customer Engineer, Hong Kong, Google Cloud |
herman...@google.com |  +852-3923-5417





On Sat, Dec 17, 2022 at 2:13 AM Andrew Pilloud <apill...@google.com> wrote:

> By "Relational" I mean things like: Column Pruning, Filter Pushdown, Table
> Statistics, Partition Metadata, Metastore. We have a bunch of one-off
> implementations in various IOs (mostly BigQueryIO) and have been waiting
> for IO standards to push them out to all IOs. This was section "F5 -
> Relational" from https://s.apache.org/beam-io-api-standard-documentation
>
> On Thu, Dec 15, 2022 at 6:50 PM Herman Mak <herman...@google.com> wrote:
>
>> Hey all,
>>
>> Firstly apologies for the confusion.
>>
>> The scope of this effort is to *finalize and have this added to the Beam
>> public documentation* to be used as a PR reference once we have resolved
>> the comments.
>> YES this document is a continuation of the below docs with some
>> additional components such as testing!
>>
>> The idea is to convert this to a MD file and add a page under "Developing
>> new I/O connectors" with some small cleanup work around this area in other
>> pages.
>> [image: image.png]
>>
>>
>>
>>
>> Docs that this is a continuation of:
>> https://s.apache.org/beam-io-api-standard-documentation
>> https://s.apache.org/beam-io-api-standard
>>
>>
>> @Andrew Pilloud <apill...@google.com> Totally not intending to start
>> from the beginning here, by relational do you mean having this hosting in
>> the Beam confluence?
>>
>> Thanks all, and keep the feedback to the docs coming
>>
>> Herman Mak |  Customer Engineer, Hong Kong, Google Cloud |
>> herman...@google.com |  +852-3923-5417 <+852%203923%205417>
>>
>>
>>
>>
>>
>> On Fri, Dec 16, 2022 at 1:36 AM Chamikara Jayalath <chamik...@google.com>
>> wrote:
>>
>>>
>>>
>>> On Thu, Dec 15, 2022, 8:33 AM Alexey Romanenko <aromanenko....@gmail.com>
>>> wrote:
>>>
>>>> Cham, do you remember what was a reason to not finalise that doc?
>>>>
>>>
>>> I think this is a continuation of those docs (so we are trying to
>>> finalize) but probably  Herman can explain better.
>>>
>>>
>>>> Personally, I find having such standards very useful (if they are
>>>> flexible during a time, of course), especially for new developers and PR
>>>> reviewers, and it’d be great to finally have such doc as a part of
>>>> contribution guide.
>>>>
>>>
>>> +1
>>>
>>> Thanks,
>>> Cham
>>>
>>>>
>>>> —
>>>> Alexey
>>>>
>>>> On 13 Dec 2022, at 04:32, Chamikara Jayalath via dev <
>>>> dev@beam.apache.org> wrote:
>>>>
>>>> Yeah, I don't think either finalized or documented (in the Website) the
>>>> previous iteration. This doc seems to contain details from the documents
>>>> shared in the previous iteration.
>>>>
>>>> Thanks,
>>>> Cham
>>>>
>>>>
>>>>
>>>> On Mon, Dec 12, 2022 at 6:49 PM Robert Burke <rob...@frantil.com>
>>>> wrote:
>>>>
>>>>> I think ultimately: until the docs a clearly available on the Beam
>>>>> site itself, it's not documentation. See also, design docs, previous
>>>>> emails, and similar.
>>>>>
>>>>> On Mon, Dec 12, 2022, 6:07 PM Andrew Pilloud via dev <
>>>>> dev@beam.apache.org> wrote:
>>>>>
>>>>>> I believe the previous iteration was here:
>>>>>> https://lists.apache.org/thread/3o8glwkn70kqjrf6wm4dyf8bt27s52hk
>>>>>>
>>>>>> The associated docs are:
>>>>>> https://s.apache.org/beam-io-api-standard-documentation
>>>>>> https://s.apache.org/beam-io-api-standard
>>>>>>
>>>>>> This is missing all the relational stuff that was in those docs, this
>>>>>> appears to be another attempt starting from the beginning?
>>>>>>
>>>>>> Andrew
>>>>>>
>>>>>>
>>>>>> On Mon, Dec 12, 2022 at 9:57 AM Alexey Romanenko <
>>>>>> aromanenko....@gmail.com> wrote:
>>>>>>
>>>>>>> Thanks for writing this!
>>>>>>>
>>>>>>> IIRC, the similar design doc was sent for review here a while ago.
>>>>>>> Is this just an updated version and a new one?
>>>>>>>
>>>>>>> —
>>>>>>> Alexey
>>>>>>>
>>>>>>> On 11 Dec 2022, at 15:16, Herman Mak via dev <dev@beam.apache.org>
>>>>>>> wrote:
>>>>>>>
>>>>>>> Hello Everyone,
>>>>>>>
>>>>>>> *TLDR*
>>>>>>>
>>>>>>> Should we adopt a set of standards that Connector I/Os should adhere
>>>>>>> to?
>>>>>>> Attached is a first version of a Beam I/O Standards guideline that
>>>>>>> includes opinionated best practices across important components of a
>>>>>>> Connector I/O, namely Documentation, Development and Testing.
>>>>>>>
>>>>>>> *The Long Version*
>>>>>>>
>>>>>>> Apache Beam is a unified open-source programming model for both
>>>>>>> batch and streaming. It runs on multiple platform runners and integrates
>>>>>>> with over 50 services using individually developed I/O Connectors
>>>>>>> <https://beam.apache.org/documentation/io/connectors/>.
>>>>>>>
>>>>>>> Given that Apache Beam connectors are written by many different
>>>>>>> developers and at varying points in time, they vary in syntax style,
>>>>>>> documentation completeness and testing done. For a new adopter of Apache
>>>>>>> Beam, that can definitely cause some uncertainty.
>>>>>>>
>>>>>>> So should we adopt a set of standards that Connector I/Os should
>>>>>>> adhere to?
>>>>>>> Attached is a first version, in Doc format, of a Beam I/O Standards
>>>>>>> guideline that includes opinionated best practices across important
>>>>>>> components of a Connector I/O, namely Documentation, Development and
>>>>>>> Testing. And the aim is to incorporate this into the documentation and 
>>>>>>> to
>>>>>>> have it referenced as standards for new Connector I/Os (and ideally have
>>>>>>> existing Connectors upgraded over time). If it looks helpful, the 
>>>>>>> immediate
>>>>>>> next step is that we can convert it into a .md as a PR into the Beam 
>>>>>>> repo!
>>>>>>>
>>>>>>> Thanks and looking forward to feedbacks and discussion,
>>>>>>>
>>>>>>>  [PUBLIC] Beam I/O Standards
>>>>>>> <https://docs.google.com/document/d/1BCTpSZDUjK90hYZjcn8aAnPd9vuRfj8YU1j3mpSgRwI/edit?usp=drive_web>
>>>>>>>
>>>>>>> Herman Mak |  Customer Engineer, Hong Kong, Google Cloud |
>>>>>>> herman...@google.com |  +852-3923-5417 <+852%203923%205417>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>
>>>>

Reply via email to