How about renaming it to `connectors` if docker is the only exception and will be moved out?
On Sat, Mar 19, 2022 at 6:18 PM Alkis Evlogimenos <alkis.evlogime...@databricks.com.invalid> wrote: > It looks like renaming the directory and moving components can be separate > steps. If there is consensus that connectors will move out, should the > directory be named misc for everything else until there is some direction > for the remaining modules? > > On Fri, 18 Mar 2022 at 03:03 Jungtaek Lim <kabhwan.opensou...@gmail.com> > wrote: > >> Avro reader is technically a connector. We eventually called data source >> implementation "connector" as well; the package name in the catalyst >> represents it. >> >> Docker is something I'm not sure fits with the name "external". It >> probably deserves a top level directory now, since we start to release an >> official docker image. That does not seem to be an experimental one. >> >> Except Docker, all modules in the external directory are "sort of" >> connectors. Ganglia metric sink is an exception, but it is still a kind of >> connector for Dropwizard. >> (It might be interesting to see how many users are still using >> kinesis-asl and ganglia-lgpl modules. We have had almost no updates for >> DStream for several years.) >> >> If we agree with my proposal for docker, remaining is going to be >> effectively a rename. I don't have a strong opinion, just wanted to avoid >> the external directory to become/remain miscellaneous one. >> >> On Fri, Mar 18, 2022 at 10:04 AM Sean Owen <sro...@gmail.com> wrote: >> >>> I sympathize, but might be less change to just rename the dir. There is >>> more in there like the avro reader; it's kind of miscellaneous. I think we >>> might want fewer rather than more top level dirs. >>> >>> On Thu, Mar 17, 2022 at 7:33 PM Jungtaek Lim < >>> kabhwan.opensou...@gmail.com> wrote: >>> >>>> We seem to just focus on how to avoid the conflict with the name >>>> "external" used in bazel. Since we consider the possibility of renaming, >>>> why not revisit the modules "external" contains? >>>> >>>> Looks like kinds of the modules external directory contains are 1) >>>> Docker 2) Connectors 3) Sink on Dropwizard metrics (only ganglia here, and >>>> it seems to be just that Ganglia is LGPL) >>>> >>>> Would it make sense if each kind deserves a top directory? We can >>>> probably give better generalized names, and as a side-effect we will no >>>> longer have "external". >>>> >>>> On Fri, Mar 18, 2022 at 5:45 AM Dongjoon Hyun <dongjoon.h...@gmail.com> >>>> wrote: >>>> >>>>> Thank you for posting this, Alkis. >>>>> >>>>> Before the question (1) and (2), I'm curious if the Apache Spark >>>>> community has other downstreams using Bazel. >>>>> >>>>> To All. If there are some Bazel users with Apache Spark code, could >>>>> you share your practice? If you are using renaming, what is your renamed >>>>> directory name? >>>>> >>>>> Dongjoon. >>>>> >>>>> >>>>> On Thu, Mar 17, 2022 at 11:56 AM Alkis Evlogimenos >>>>> <alkis.evlogime...@databricks.com.invalid> wrote: >>>>> >>>>>> AFAIK there is not. `external` has been baked in bazel since the >>>>>> beginning and there is no plan from bazel devs to attempt to fix this >>>>>> <https://github.com/bazelbuild/bazel/issues/4508#issuecomment-724055371> >>>>>> . >>>>>> >>>>>> On Thu, Mar 17, 2022 at 7:52 PM Sean Owen <sro...@gmail.com> wrote: >>>>>> >>>>>>> Just checking - there is no way to tell bazel to look somewhere else >>>>>>> for whatever 'external' means to it? >>>>>>> It's a kinda big ugly change but it's not a functional change. If >>>>>>> anything it might break some downstream builds that rely on the current >>>>>>> structure too. But such is life for developers? I don't have a strong >>>>>>> reason we can't. >>>>>>> >>>>>>> On Thu, Mar 17, 2022 at 1:47 PM Alkis Evlogimenos >>>>>>> <alkis.evlogime...@databricks.com.invalid> wrote: >>>>>>> >>>>>>>> Hi Spark devs. >>>>>>>> >>>>>>>> The Apache Spark repo has a top level external/ directory. This is >>>>>>>> a reserved name for the bazel build system and it causes all sorts of >>>>>>>> problems: some can be worked around and some cannot (for some details >>>>>>>> on >>>>>>>> one that cannot see >>>>>>>> https://github.com/hedronvision/bazel-compile-commands-extractor/issues/30 >>>>>>>> ). >>>>>>>> >>>>>>>> Some forks of Apache Spark use bazel as a build system. It would be >>>>>>>> nice if we can make this change in Apache Spark without resorting to >>>>>>>> complex renames/merges whenever changes are pulled from upstream. >>>>>>>> >>>>>>>> As such I proposed to rename external/ directory to want to rename >>>>>>>> the external/ directory to something else [SPARK-38569 >>>>>>>> <https://issues.apache.org/jira/browse/SPARK-38569>]. I also sent >>>>>>>> a tentative [PR-35874 <https://github.com/apache/spark/pull/35874>] >>>>>>>> that renames external/ to vendor/. >>>>>>>> >>>>>>>> My questions to you are: >>>>>>>> 1. Are there any objections to renaming external to X? >>>>>>>> 2. Is vendor a good new name for external? >>>>>>>> >>>>>>>> Cheers, >>>>>>>> >>>>>>>