BTW the largest change to SS right now is probably the entire data source API v2 effort, which aims to unify streaming and batch from data source perspective, and provide a reliable, expressive source/sink API.
On Mon, Jan 14, 2019 at 5:34 PM, Reynold Xin < r...@databricks.com > wrote: > > There are a few things to keep in mind: > > > > 1. Structured Streaming isn't an independent project. It actually (by > design) depends on all the rest of Spark SQL, and virtually all > improvements to Spark SQL benefit Structured Streaming. > > > > 2. The project as far as I can tell is relatively mature for core ETL and > incremental processing purpose. I interact with a lot of users using it > everyday. We can always expand the use cases and add more, but that also > adds maintenance burden. In any case, it'd be good to get some activity > here. > > > > > > > > > On Mon, Jan 14, 2019 at 5:11 PM, Nicholas Chammas < nicholas. chammas@ gmail. > com ( nicholas.cham...@gmail.com ) > wrote: > >> As an observer, this thread is interesting and concerning. Is there an >> emerging consensus that Structured Streaming is somehow not relevant >> anymore? Or is it just that folks consider it "complete enough"? >> >> >> Structured Streaming was billed as the replacement to DStreams. If >> committers, generally speaking, have lost interest in Structured >> Streaming, does that mean the Apache Spark project is somehow no longer >> aiming to provide a "first-class" solution to the problem of stream >> processing? >> >> On Mon, Jan 14, 2019 at 3:43 PM Jungtaek Lim < kabhwan@ gmail. com ( >> kabh...@gmail.com ) > wrote: >> >> >>> Cody, I guess I already addressed your comments in the PR (#22138). The >>> approach was changed to address your concern, and after that Gabor helped >>> to review the PR. Please take a look again when you have time to get into. >>> >>> >>> >>> 2019년 1월 15일 (화) 오전 1:01, Cody Koeninger < cody@ koeninger. org ( >>> c...@koeninger.org ) >님이 작성: >>> >>> >>>> I feel like I've already said my piece on >>>> https:/ / github. com/ apache/ spark/ pull/ 22138 ( >>>> https://github.com/apache/spark/pull/22138 ) let me know if you have >>>> more questions. >>>> >>>> As for SS in general, I don't have a production SS deployment, so I'm >>>> less comfortable with reviewing large changes to it. But if no other >>>> committers are working on it... >>>> >>>> On Sun, Jan 13, 2019 at 5:19 PM Sean Owen < srowen@ gmail. com ( >>>> sro...@gmail.com ) > wrote: >>>> > >>>> > Yes you're preaching to the choir here. SS does seem somewhat >>>> > abandoned by those that have worked on it. I have also been at times >>>> > frustrated that some areas fall into this pattern. >>>> > >>>> > There isn't a way to make people work on it, and I personally am not >>>> > interested in it nor have a background in SS. >>>> > >>>> > I did leave some comments on your PR and will see if we can get >>>> > comfortable with merging it, as I presume you are pretty knowledgeable >>>> > about the change. >>>> > >>>> > On Sun, Jan 13, 2019 at 4:55 PM Jungtaek Lim < kabhwan@ gmail. com ( >>>> kabh...@gmail.com ) > wrote: >>>> > > >>>> > > Sean, this is actually a fail-back on pinging committers. I know who >>>> can review and merge in SS area, and pinged to them, didn't work. Even >>>> there's a PR which approach was encouraged by committer and reviewed the >>>> first phase, and no review. >>>> > > >>>> > > That's not the first time I have faced the situation, and I used the >>>> fail-back approach at that time. (You can see there was no response even >>>> in the mail thread.) Not sure which approach worked. >>>> > > https:/ / lists. apache. org/ thread. html/ >>>> > > c61f32249949b1ff1b265c1a7148c2ea7eda08891e3016fb24008561@ >>>> %3Cdev. spark. apache. org%3E ( >>>> https://lists.apache.org/thread.html/c61f32249949b1ff1b265c1a7148c2ea7eda08891e3016fb24008561@%3Cdev.spark.apache.org%3E >>>> ) >>>> > > >>>> > > I've observed that only (critical) bugfixes are being reviewed and >>>> merged in time for SS area. For other stuffs like new features and >>>> improvements, both discussions and PRs were pretty less popular from >>>> committers: though there was even participation/approve from non-committer >>>> community. I don't think SS is the thing to be turned into maintenance. >>>> > > >>>> > > I guess PMC members should try to resolve such situation, as it will >>>> (slowly and quietly) make some issues like contributors leaving, module >>>> stopped growing up, etc.. The problem will grow up like a snowball: >>>> getting bigger and bigger. I don't mind if there's no interest on both >>>> contributors and committers for such module, but SS is not. Maybe either >>>> other committers who weren't familiar with should try to get familiar and >>>> cover the area, or the area needs more committers. >>>> > > >>>> > > -Jungtaek Lim (HeartSaVioR) >>>> > > >>>> > > 2019년 1월 13일 (일) 오후 11:37, Sean Owen < srowen@ gmail. com ( >>>> sro...@gmail.com ) >님이 작성: >>>> > >> >>>> > >> Jungtaek, the best strategy is to find who wrote the code you are >>>> > >> modifying (use Github history or git blame) and ping them directly on >>>> >>>> > >> the PR. I don't know this code well myself. >>>> > >> It also helps if you can address why the functionality is important, >>>> > >> and describe compatibility implications. >>>> > >> >>>> > >> Most PRs are not merged, note. Not commenting on this particular one, >>>> >>>> > >> but it's not a 'bug' if it's not being merged. >>>> > >> >>>> > >> On Sun, Jan 13, 2019 at 12:29 AM Jungtaek Lim < kabhwan@ gmail. com ( >>>> kabh...@gmail.com ) > wrote: >>>> > >> > >>>> > >> > I'm sorry but let me remind this, as non-SS PRs are being reviewed >>>> accordingly, whereas many of SS PRs (regardless of who create) are still >>>> not reviewed and merged in time. >>>> > >> > >>>> > >> > 2019년 1월 3일 (목) 오전 7:57, Jungtaek Lim < kabhwan@ gmail. com ( >>>> kabh...@gmail.com ) >님이 작성: >>>> > >> >> >>>> > >> >> Spark devs, happy new year! >>>> > >> >> >>>> > >> >> I would like to remind this kindly, since there was actually no >>>> review after initiating the thread. >>>> > >> >> >>>> > >> >> Thanks, >>>> > >> >> Jungtaek Lim (HeartSaVioR) >>>> > >> >> >>>> > >> >> 2018년 12월 12일 (수) 오후 11:12, Vaclav Kosar < admin@ vaclavkosar. com >>>> ( ad...@vaclavkosar.com ) >님이 작성: >>>> > >> >>> >>>> > >> >>> I am also waiting for any finalization of my PR [3]. I seems that >>>> SS PRs are not being reviewed much these days. >>>> > >> >>> >>>> > >> >>> [3] https:/ / github. com/ apache/ spark/ pull/ 21919 ( >>>> https://github.com/apache/spark/pull/21919 ) >>>> > >> >>> >>>> > >> >>> >>>> > >> >>> On 12. 12. 18 14:37, Dongjin Lee wrote: >>>> > >> >>> >>>> > >> >>> If it is possible, could you review my PR on Kafka's header >>>> functionality[^1] also? It was added in Kafka 0.11.0.0 but still not >>>> supported in Spark. >>>> > >> >>> >>>> > >> >>> Thanks, >>>> > >> >>> Dongjin >>>> > >> >>> >>>> > >> >>> [^1]: https:/ / github. com/ apache/ spark/ pull/ 22282 ( >>>> https://github.com/apache/spark/pull/22282 ) >>>> > >> >>> [^2]: https:/ / issues. apache. org/ jira/ browse/ KAFKA-4208 ( >>>> https://issues.apache.org/jira/browse/KAFKA-4208 ) >>>> > >> >>> >>>> > >> >>> On Wed, Dec 12, 2018 at 6:43 PM Jungtaek Lim < kabhwan@ gmail. com >>>> ( kabh...@gmail.com ) > wrote: >>>> > >> >>>> >>>> > >> >>>> Hi devs, >>>> > >> >>>> >>>> > >> >>>> Would I kindly ask for reviewing on PRs for Structured >>>> Streaming? I have 5 open pull requests on SS side [1] (earliest PR was >>>> opened around 4 months so far), and there looks like couple of PR for >>>> others [2] which looks good to be reviewed, too. >>>> > >> >>>> >>>> > >> >>>> Thanks in advance, >>>> > >> >>>> Jungtaek Lim (HeartSaVioR) >>>> > >> >>>> >>>> > >> >>>> 1. https:/ / github. com/ apache/ spark/ >>>> > >> >>>> pulls?utf8=%E2%9C%93&q=is%3Aopen+is%3Apr+author%3AHeartSaVioR+%5BSS%5D >>>> ( >>>> https://github.com/apache/spark/pulls?utf8=%E2%9C%93&q=is%3Aopen+is%3Apr+author%3AHeartSaVioR+%5BSS%5D >>>> ) >>>> > >> >>>> 2. https:/ / github. com/ apache/ spark/ >>>> > >> >>>> pulls?utf8=%E2%9C%93&q=is%3Aopen+is%3Apr+%5BSS%5D+ >>>> ( >>>> https://github.com/apache/spark/pulls?utf8=%E2%9C%93&q=is%3Aopen+is%3Apr+%5BSS%5D+ >>>> ) >>>> > >> >>>> >>>> > >> >>> >>>> > >> >>> >>>> > >> >>> -- >>>> > >> >>> Dongjin Lee >>>> > >> >>> >>>> > >> >>> A hitchhiker in the mathematical world. >>>> > >> >>> >>>> > >> >>> github: github. com/ dongjinleekr ( http://github.com/dongjinleekr >>>> ) >>>> > >> >>> linkedin: kr. linkedin. com/ in/ dongjinleekr ( >>>> http://kr.linkedin.com/in/dongjinleekr ) >>>> > >> >>> speakerdeck: speakerdeck. com/ dongjin ( >>>> http://speakerdeck.com/dongjin ) >>>> > >>>> > --------------------------------------------------------------------- >>>> > To unsubscribe e-mail: dev-unsubscribe@ spark. apache. org ( >>>> dev-unsubscr...@spark.apache.org ) >>>> > >>> >>> >>> >> >> > >