Hey Matt,

This is cool to see. To be clear, this is an implementation of Spark 
DataSourceV2 using Arrow Flight?

I think the questions I have are:

- Does this belong under Arrow, or under Spark - I lean towards it being closer 
to Spark than Arrow;
- How would we do the testing with Spark;
- What else do you envision as being part of 'productionize';
- Whether this should perhaps target a higher level protocol like Flight SQL or 
ADBC instead.

I'm also curious if you have a particular use case in mind for the connector. 
Finally,
this may need to go through the IP clearance cycle since it's been developed 
for a while outside of the Apache organization.

-David

On Mon, Oct 17, 2022, at 15:06, Matt Phelps wrote:
> Hi Arrow team,
>
> I’d like to make a contribution to the Arrow project by submitting an 
> existing Spark connector for Flight. Ryan Murray and Kyle Brooks wrote 
> a Flight implementation (“flight-spark”) of Spark’s DataSource V2 
> interface found here: https://github.com/rymurr/flight-spark-source. We 
> would like to productionize this effort and are looking for advice on 
> where it should live within Arrow. We had envisioned that this 
> connector would align well with the existing functionality under 
> Arrow’s java/flight.
>
> Thanks,
> Matt Phelps

Reply via email to