Re: [VOTE] SPIP: XML data source support

2023-07-28 Thread Jia Fan
+ 1 > 2023年7月29日 13:06,Adrian Pop-Tifrea 写道: > > +1, the more data source formats, the better, and if the solution is already > thoroughly tested, I say we should go for it. > > On Sat, Jul 29, 2023, 06:35 Xiao Li > wrote: >> +1 >> >> On Fri, Jul 28, 2023 at 15

Re: [VOTE] SPIP: XML data source support

2023-07-28 Thread Adrian Pop-Tifrea
+1, the more data source formats, the better, and if the solution is already thoroughly tested, I say we should go for it. On Sat, Jul 29, 2023, 06:35 Xiao Li wrote: > +1 > > On Fri, Jul 28, 2023 at 15:54 Sean Owen wrote: > >> +1 I think that porting the package 'as is' into Spark is probably >

Re: [VOTE] SPIP: XML data source support

2023-07-28 Thread Xiao Li
+1 On Fri, Jul 28, 2023 at 15:54 Sean Owen wrote: > +1 I think that porting the package 'as is' into Spark is probably > worthwhile. > That's relatively easy; the code is already pretty battle-tested and not > that big and even originally came from Spark code, so is more or less > similar alread

Re: [VOTE] SPIP: XML data source support

2023-07-28 Thread Sean Owen
+1 I think that porting the package 'as is' into Spark is probably worthwhile. That's relatively easy; the code is already pretty battle-tested and not that big and even originally came from Spark code, so is more or less similar already. One thing it never got was DSv2 support, which means XML re

[VOTE] SPIP: XML data source support

2023-07-28 Thread Sandip Agarwala
Dear Spark community, I would like to start the vote for "SPIP: XML data source support". XML is a widely used data format. An external spark-xml package ( https://github.com/databricks/spark-xml) is available to read and write XML data in spark. Making spark-xml built-in will provide a better us

Re: Apache Arrow integration issue with Spark involving Netty

2023-07-28 Thread Dane Pitkin
Update! Netty has reverted the affecting change in v4.1.96. See netty commit here[1] and arrow PR to upgrade here[2]. The upcoming release of arrow-memory-netty v13 should work with netty versions <4.1.94 and >=4.1.96. [1] https://github.com/netty/netty/commit/dc16c5818a5cd0711f17e0a966783cdc84c9

Time for Spark 3.3.3 release?

2023-07-28 Thread Yuming Wang
Hi Spark devs, Since Apache Spark 3.3.2 tag creation (Feb 11), 60 patches have arrived at branch-3.3. Shall we make a new release, Apache Spark 3.3.3, as the third release at branch-3.3? I'd like to volunteer as the release manager for