Re: [VOTE] SPIP: Storage Partitioned Join for Data Source V2

Russell Spitzer Fri, 29 Oct 2021 10:49:17 -0700

+1 This is a great idea, (I have no Apache Spark voting points)

On Fri, Oct 29, 2021 at 12:41 PM L. C. Hsieh <[email protected]> wrote:


>
> I'll start with my +1.
>
> On 2021/10/29 17:30:03, L. C. Hsieh <[email protected]> wrote:
> > Hi all,
> >
> > I’d like to start a vote for SPIP: Storage Partitioned Join for Data
> Source V2.
> >
> > The proposal is to support a new type of join: storage partitioned join
> which
> > covers bucket join support for DataSourceV2 but is more general. The goal
> > is to let Spark leverage distribution properties reported by data
> sources and
> > eliminate shuffle whenever possible.
> >
> > Please also refer to:
> >
> >    - Previous discussion in dev mailing list: [DISCUSS] SPIP: Storage
> Partitioned Join for Data Source V2
> >    <
> https://lists.apache.org/thread.html/r7dc67c3db280a8b2e65855cb0b1c86b524d4e6ae1ed9db9ca12cb2e6%40%3Cdev.spark.apache.org%3E
> >
> >    .
> >    - JIRA: SPARK-37166 <
> https://issues.apache.org/jira/browse/SPARK-37166>
> >    - Design doc <
> https://docs.google.com/document/d/1foTkDSM91VxKgkEcBMsuAvEjNybjja-uHk-r3vtXWFE>
>
> >
> > Please vote on the SPIP for the next 72 hours:
> >
> > [ ] +1: Accept the proposal as an official SPIP
> > [ ] +0
> > [ ] -1: I don’t think this is a good idea because …
> >
> > ---------------------------------------------------------------------
> > To unsubscribe e-mail: [email protected]
> >
> >
>
> ---------------------------------------------------------------------
> To unsubscribe e-mail: [email protected]
>
>

Re: [VOTE] SPIP: Storage Partitioned Join for Data Source V2

Reply via email to