I do not have a very informed opinion here, so take this with a grain of salt.

I'd say that we need to either commit a coherent version of this for
Spark 3, or not at all. If it doesn't have support, I'd back out the
existing changes.
I was initially skeptical about how much this needs to be in Spark vs
a third-party package, and that still stands.

The addition of another dependency isn't that big a deal IMHO, but,
yes, it does add something to the maintenance overhead. But that's all
the more true of a new module.

I don't feel strongly about it, but if this isn't obviously getting
support from any committers, can we keep it as a third party library
for now?


On Tue, Oct 15, 2019 at 8:53 PM Weichen Xu <weichen...@databricks.com> wrote:
>
> Hi Mats Rydberg,
>
> Although this dependency "org.opencypher:okapi-shade.okapi" was added into 
> spark, but Xiangrui raised two concerns (see above mail) about it, so we'd 
> better rethink on this and consider whether this is a good choice, so I call 
> this vote.
>
> Thanks!
>
> On Tue, Oct 15, 2019 at 10:56 PM Mats Rydberg <m...@neo4j.org.invalid> wrote:
>>
>> Hello Weichen, community
>>
>> I'm sorry, I'm feeling a little bit confused about this vote. Is this about 
>> the PR (https://github.com/apache/spark/pull/24490) that was merged in early 
>> June and introduced the spark-graph module including the okapi-shade 
>> dependency?
>>
>> Regarding the okapi-shade dependency which was developed as part of the 
>> above PR work, some advice was offered by Scala experts at TripleQuote which 
>> helped find a satisfactory solution. The shading mechanism used is standard 
>> and very comparable to a Java library shading solution.
>>
>> The PR you link (https://github.com/apache/spark/pull/24297) is not meant 
>> for merging. It is just a proof-of-concept branch containing a full 
>> implementation of the system, which is kept up-to-date with the API 
>> discussion on the currently proposed PR: 
>> https://github.com/apache/spark/pull/24851.
>>
>> Thank you
>> Mats
>>
>>
>> On Tue, Oct 15, 2019 at 10:38 AM Weichen Xu <weichen...@databricks.com> 
>> wrote:
>>>
>>> Hi everyone,
>>>
>>> I'd like to call a new vote on the issue: should we add dependency 
>>> "org.opencypher:okapi-shade.okapi" into spark ? The issue background is:
>>>
>>> Spark is going to add a big feature "Spark Graph", the prototypical 
>>> implementation is here
>>> https://github.com/apache/spark/pull/24297
>>> which will introduce dependency org.opencypher:okapi-shade.okapi
>>>
>>> Xiangrui already mentioned 2 concerns on this dependency change:
>>>>
>>>> On the technical side, my main concern is the runtime dependency on 
>>>> org.opencypher:okapi-shade.okapi depends on several Scala libraries. We 
>>>> came out with the solution to shade a few Scala libraries to avoid 
>>>> pollution. However, I'm not super confident that the approach is 
>>>> sustainable for two reasons: a) there exists no proper shading libraries 
>>>> for Scala, 2) We will have to wait for upgrades from those Scala libraries 
>>>> before we can upgrade Spark to use a newer Scala version. So it would be 
>>>> great if some Scala experts can help review the current implementation and 
>>>> help assess the risk.
>>>
>>>
>>> So let's discuss and vote whether this is a good choice.
>>> Before this spark graph feature to get into spark ASAP, this issue should 
>>> be resolved first.
>>>
>>> This vote is open until next Tuseday (Oct. 22).
>>>
>>> [ ] +1: Accept the proposal
>>> [ ] +0
>>> [ ] -1: I don't think this is a good idea because ...
>>>
>>> Thank you!
>>>
>>> Weichen
>>>

---------------------------------------------------------------------
To unsubscribe e-mail: dev-unsubscr...@spark.apache.org

Reply via email to