That sounds like a great suggestion.
发件人: Jungtaek Lim
日期: 2024年3月5日 星期二 10:46
收件人: Hyukjin Kwon
抄送: yangjie01 , Dongjoon Hyun ,
dev , user
主题: Re: [ANNOUNCE] Apache Spark 3.5.1 released
Yes, it's relevant to that PR. I wonder, if we want to expose version switcher,
it should be in versionle
Yes, it's relevant to that PR. I wonder, if we want to expose version
switcher, it should be in versionless doc (spark-website) rather than the
doc being pinned to a specific version.
On Tue, Mar 5, 2024 at 11:18 AM Hyukjin Kwon wrote:
> Is this related to https://github.com/apache/spark/pull/42
Is this related to https://github.com/apache/spark/pull/42428?
cc @Yang,Jie(INF)
On Mon, 4 Mar 2024 at 22:21, Jungtaek Lim
wrote:
> Shall we revisit this functionality? The API doc is built with individual
> versions, and for each individual version we depend on other released
> versions. This
I have downloaded Amazon reviews for sentiment analysis from here. The file
is not particularly large (just over 500MB) but comes in the following
format
test.ft.txt.bz2.zip
So it is a text file that is compressed by bz2 followed by zip. Now I like
tro do all these operations in PySpark. In PySpa