Sure, I will do it. SPARK-40010 <https://issues.apache.org/jira/browse/SPARK-40010> is built to track progress.
Hyukjin Kwon gurwls...@gmail.com <http://mailto:gurwls...@gmail.com> 于2022年8月9日周二 10:58写道: Please go ahead. Would be very appreciated. > > On Tue, 9 Aug 2022 at 11:58, Qian SUN <qian.sun2...@gmail.com> wrote: > >> Hi Hyukjin >> >> I would like to do some work and pick up *Window.py *if possible. >> >> Thanks, >> Qian >> >> Hyukjin Kwon <gurwls...@gmail.com> 于2022年8月9日周二 10:41写道: >> >>> Thanks Khalid for taking a look. >>> >>> On Tue, 9 Aug 2022 at 00:37, Khalid Mammadov <khalidmammad...@gmail.com> >>> wrote: >>> >>>> Hi Hyukjin >>>> That's great initiative, here is a PR that address one of those issues >>>> that's waiting for review: https://github.com/apache/spark/pull/37408 >>>> >>>> Perhaps, it would be also good to track these pending issues somewhere >>>> to avoid effort duplication. >>>> >>>> For example, I would like to pick up *union* and *union all* if no >>>> one has already. >>>> >>>> Thanks, >>>> Khalid >>>> >>>> >>>> On Mon, Aug 8, 2022 at 1:44 PM Hyukjin Kwon <gurwls...@gmail.com> >>>> wrote: >>>> >>>>> Hi all, >>>>> >>>>> I am trying to improve PySpark documentation especially: >>>>> >>>>> - Make the examples self-contained, e.g., >>>>> >>>>> https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.pivot.html >>>>> - Document Parameters >>>>> >>>>> https://pandas.pydata.org/docs/reference/api/pandas.DataFrame.pivot.html#pandas.DataFrame.pivot. >>>>> There are many API that misses parameters in PySpark, e.g., >>>>> DataFrame.union >>>>> >>>>> Here is one example PR I am working on: >>>>> https://github.com/apache/spark/pull/37437 >>>>> I can't do it all by myself. Any help, review, and contributions >>>>> would be welcome and appreciated. >>>>> >>>>> Thank you all in advance. >>>>> >>>> >> >> -- >> Best! >> Qian SUN >> > -- Best! Qian SUN