commits
Thread
Date
Earlier messages
Later messages
Messages by Thread
svn commit: r82461 - dev/spark/v4.2.0-preview1-rc1-docs
dongjoon
svn commit: r82460 - dev/spark/v4.1.1-rc2-docs
dongjoon
svn commit: r82459 - dev/spark/v4.1.1-rc1-docs
dongjoon
svn commit: r82458 - dev/spark/v4.1.1-rc1-bin
dongjoon
(spark) branch master updated: [SPARK-55020][PYTHON][FOLLOW-UP] Move release into disable gc protection to prevent deadlock
gurwls223
(spark) branch master updated: [SPARK-54740][PYTHON] Start faulthandler early in daemon mode
gurwls223
(spark) branch master updated: [SPARK-54784][ML][DOCS] Document the security policy on ml models
dongjoon
(spark) branch branch-3.5 updated: [SPARK-55495][CORE] Fix `EventLogFileWriters.closeWriter` to handle `checkError`
dongjoon
(spark) branch branch-4.0 updated: [SPARK-55495][CORE] Fix `EventLogFileWriters.closeWriter` to handle `checkError`
dongjoon
(spark) branch master updated: [SPARK-55495][CORE] Fix `EventLogFileWriters.closeWriter` to handle `checkError`
dongjoon
(spark) branch branch-4.1 updated: [SPARK-55495][CORE] Fix `EventLogFileWriters.closeWriter` to handle `checkError`
dongjoon
(spark) branch master updated: [SPARK-55498][BUILD][TESTS] Upgrade `oracle-free` docker image to `23.26.1-slim`
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55499] Update `pi-with-eventlog` to generate multiple log files
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55486] Fix `StatusRecorder.patchAndStatusWithVersionLocked` not to log errors
dongjoon
(spark) branch master updated (5b80958c0b01 -> 17bdbde20acd)
ruifengz
(spark) branch master updated (59f3a16590d8 -> 5b80958c0b01)
ruifengz
(spark) branch master updated (15ca64ddc90c -> 59f3a16590d8)
chengpan
(spark) branch master updated (4e1cb88bba0c -> 15ca64ddc90c)
wenchen
(spark) branch master updated: [SPARK-54805][SS][PYTHON][FOLLOW-UP] Add test_tws_tester to modules
ruifengz
(spark) branch master updated (77980546e305 -> 2538cc832bdd)
gurwls223
(spark) branch branch-4.1 updated: [SPARK-52407][SQL][FOLLOW-UP] Remove Theta Sketch aggregation buffer re-wrapping
dtenedor
(spark) branch master updated (58fbd7f6b1b0 -> 6112a0bfc481)
dtenedor
(spark) branch master updated: [SPARK-54173][K8S][FOLLOWUP] Fix `spark.kubernetes.executor.podDeletionCost` config doc
dongjoon
(spark) branch master updated: [SPARK-55484][K8S] Simplify `KubernetesClusterSchedulerBackend` by reducing private class variables
dongjoon
(spark) branch master updated: [SPARK-55485][K8S] Add `Constants.POD_DELETION_COST` for reuse
dongjoon
(spark) branch branch-4.0 updated: [SPARK-55411][SQL][4.0] SPJ may throw ArrayIndexOutOfBoundsException when join keys are less than cluster keys
ptoth
(spark) branch master updated: [SPARK-55480][PYTHON] Remove all unused noqa for ruff
ruifengz
(spark) branch master updated: [MINOR] Remove unused `InMemoryRelation.convertToColumnarIfPossible` method
viirya
(spark) branch branch-4.1 updated (3b797bc169a0 -> d4d034699464)
ptoth
(spark) branch master updated: [MINOR][INFRA] Add `build_infra_images_cache` into `Build Pipeline Status`
ruifengz
(spark) branch branch-4.0 updated (9f7325c349de -> 964a3efa854d)
chengpan
[PR] [WIP] Document the security policy on ml models [spark-website]
via GitHub
Re: [PR] [SPARK-54784] Document the security policy on ml models [spark-website]
via GitHub
Re: [PR] [SPARK-54784] Document the security policy on ml models [spark-website]
via GitHub
Re: [PR] [SPARK-54784] Document the security policy on ml models [spark-website]
via GitHub
Re: [PR] [SPARK-54784] Document the security policy on ml models [spark-website]
via GitHub
(spark) branch master updated: [SPARK-55405][PYTHON][TESTS][FOLLOWUP] Skip PyArrow array cast tests when numpy < 2.0
ruifengz
(spark) branch master updated (6ced6b477625 -> 935e5cd146c8)
ruifengz
(spark) branch master updated: [SPARK-55475][BUILD] Disable Maven Parallel PUT
dongjoon
(spark) branch master updated: [SPARK-55020][PYTHON][FOLLOW-UP] Disable gc only when we communicate through gRPC for ExecutePlan
gurwls223
(spark) branch master updated: [SPARK-55473][PYTHON] Replace itertools.tee with chain in applyInPandasWithState
gurwls223
(spark) branch master updated: [SPARK-55395][SQL][FOLLOW-UP] Delete obsolete `withSequenceColumn`
ruifengz
(spark) branch master updated: [SPARK-55472][PS] Raise `AttributeError` from methods removed in pandas 3
ruifengz
(spark) branch master updated: [SPARK-55451][SQL] Cursors must start collecting results on OPEN, not first FETCH
gengliang
(spark-website) branch asf-site updated: improve build script and instruction (#675)
wenchen
(spark-kubernetes-operator) branch main updated: [SPARK-55470] Add a `Checkstyle` rule to enforce symbolic placeholder for logging
dongjoon
(spark) branch master updated: [SPARK-55411][SQL] SPJ may throw ArrayIndexOutOfBoundsException when join keys are less than cluster keys
ptoth
(spark-website) branch asf-site updated: Add Cheng Pan to committers (#673)
chengpan
[PR] Improve build script and instruction [spark-website]
via GitHub
Re: [PR] Improve build script and instruction [spark-website]
via GitHub
Re: [PR] Improve build script and instruction [spark-website]
via GitHub
Re: [PR] Improve build script and instruction [spark-website]
via GitHub
[PR] Update info for Connect Swift/Rust/.NET repo [spark-website]
via GitHub
Re: [PR] Update info for Connect Swift/Rust/.NET repo [spark-website]
via GitHub
Re: [PR] Update info for Connect Swift/Rust/.NET repo [spark-website]
via GitHub
(spark) branch master updated: [SPARK-55458][PYTHON][TESTS] Apply the new test pattern for newly added tests
ruifengz
(spark) branch master updated: [SPARK-55460][PYTHON] Remove E203 from ruff's ignore list
ruifengz
(spark-kubernetes-operator) branch main updated: [SPARK-55468] Log `Built-in Spark Version`
dongjoon
(spark) branch master updated (7a0abe4f0859 -> 8a74912251e3)
ruifengz
[PR] Add Cheng Pan to committers [spark-website]
via GitHub
Re: [PR] Add Cheng Pan to committers [spark-website]
via GitHub
Re: [PR] Add Cheng Pan to committers [spark-website]
via GitHub
Re: [PR] Add Cheng Pan to committers [spark-website]
via GitHub
(spark) branch master updated (238efa134ceb -> 7a0abe4f0859)
ruifengz
(spark) branch master updated: [SPARK-55459][PYTHON] Fix 3x performance regression in applyInPandas for large groups
ruifengz
(spark) branch master updated (d353b4706647 -> 3cf6e6b1020e)
dongjoon
(spark) branch master updated (deb09eec6176 -> d353b4706647)
ruifengz
(spark) branch master updated: [SPARK-55385][CORE][SQL][FOLLOWUP] Rename preservesDistribution to preservesPartitionSizes
ruifengz
(spark) branch master updated (378e74a9efe3 -> b4b8165b39d9)
gurwls223
(spark) branch master updated: [SPARK-55455][BUILD] Upgrade `RoaringBitmap` to 1.6.0
dongjoon
(spark) branch master updated: [SPARK-55402][SS] Move streamingSourceIdentifyingName from CatalogTable to DataSource
ashrigondekar
(spark-connect-swift) branch main updated: [SPARK-55454] Use `4.2.0-preview2` for Spark 4.2 integration tests
dongjoon
[PR] add Apache Iceberg to index.md; add alt text to logos [spark-website]
via GitHub
Re: [PR] add Apache Iceberg to index.md; add alt text to logos [spark-website]
via GitHub
Re: [PR] add Apache Iceberg to index.md; add alt text to logos [spark-website]
via GitHub
Re: [PR] add Apache Iceberg to index.md; add alt text to logos [spark-website]
via GitHub
(spark) branch master updated: [SPARK-55432][K8S] Support built-in K8s `ExecutorResizePlugin`
dongjoon
(spark) branch master updated (a6787fd8cc12 -> 6757f7877401)
ruifengz
(spark) branch master updated: [SPARK-55437][INFRA][R] Upgrade SparkR test image to Ubuntu 24.04
ruifengz
(spark) branch master updated: [SPARK-55229][SPARK-55231][PYTHON] Implement DataFrame.zipWithIndex in PySpark
ruifengz
(spark) branch master updated (2121a5a31d69 -> f3ad0f6db854)
yangjie01
(spark) branch master updated: [SPARK-55436][INFRA] Upgrade lint and doc test images to Ubuntu 24.04
gurwls223
(spark) branch master updated: [MINOR][INFRA] Use `lsb_release -a` to display the container os version
ruifengz
(spark) branch master updated: [SPARK-55366][SQL][PYTHON][FOLLOW-UP] Relax the duplicated field name check
ruifengz
(spark) branch master updated: [SPARK-55431][K8S] Set `resizePolicy` to `NotRequired` explicitly for executor pods
dongjoon
(spark) branch master updated: [SPARK-55408][PS] Handle unexpected keyword argument errors related to datetime with pandas 3
ruifengz
(spark) branch master updated (26384d7de53f -> 8d46ddb251b8)
ruifengz
(spark) branch master updated: [SPARK-55224][PYTHON][FOLLOWUP] Remove redundant `use_legacy_pandas_udf_conversion` condition in serializer setup
ruifengz
(spark) branch branch-3.5 updated: [SPARK-55434][INFRA] Add username and password at svn with rm at finalize step
gurwls223
(spark) branch branch-4.0 updated: [SPARK-55434][INFRA] Add username and password at svn with rm at finalize step
gurwls223
(spark) branch branch-4.1 updated: [SPARK-55434][INFRA] Add username and password at svn with rm at finalize step
gurwls223
(spark) branch master updated (ee58e0e17501 -> f6031fef94f3)
gurwls223
svn commit: r82371 - release/spark/spark-4.2.0-preview1
gurwls223
(spark) branch master updated: [SPARK-55433][INFRA] Remove labeler in GitHub Actions
gurwls223
svn commit: r82369 - dev/spark/v4.2.0-preview2-rc1-docs/_site release/spark/docs/4.2.0-preview2
gurwls223
svn commit: r82370 - dev/spark/v4.2.0-preview2-rc1-bin release/spark/spark-4.2.0-preview2
gurwls223
(spark) tag v4.2.0-preview2 created (now a2edb559299d)
gurwls223
(spark) branch master updated: [SPARK-54860][INFRA] Followup of the revert to set the permission correctly
gurwls223
(spark) branch master updated: Revert "[SPARK-54860][INFRA] Add JIRA Ticket Validating in GHA"
gurwls223
(spark) branch master updated: [SPARK-55429][K8S][TESTS] Improve `VolcanoTestsSuite` to use `Server-Side Apply` pattern
gurwls223
(spark) branch master updated: [SPARK-55424][PYTHON] Explicitly pass the series name in `convert_numpy`
gurwls223
(spark) branch master updated: [SPARK-55175][PYTHON][FOLLOW-UP] Remove unused `arrow_to_pandas` method
gurwls223
(spark) branch master updated: [SPARK-55414][PYTHON][INFRA] Upgrade Python 3.12 test images for classic-only and pandas 3 to Ubuntu 24.04
gurwls223
(spark) branch master updated: [SPARK-55358][PYTHON][INFRA][FOLLOW-UP] Do not apt-get install `python3-xxx`
gurwls223
(spark) branch master updated (d54498861119 -> 668b2c5860ed)
gurwls223
(spark) branch master updated (4c336897859c -> d54498861119)
gurwls223
(spark) branch master updated: [SPARK-55404][PYTHON] Always raise KeyboardInterrupt from SIGINT handler
gurwls223
(spark) branch master updated: [SPARK-55395][SQL] Disable RDD cache in `DataFrame.zipWithIndex`
gurwls223
(spark) branch master updated: [SPARK-55385][CORE][SQL] Mitigate the recomputation in `zipWithIndex`
gurwls223
(spark) branch master updated: [SPARK-55383][INFRA] Only send test report to codecov in coverage run
gurwls223
(spark) branch master updated: [SPARK-55413][PYTHON][INFRA] Upgrade Python minimum dep test images to Ubuntu 24.04
dongjoon
(spark) branch master updated (d1dbcdab1af9 -> e72ddacc568f)
dongjoon
(spark) branch master updated: [SPARK-55428][BUILD] Sync Netty Java options everywhere
dongjoon
(spark) branch master updated: [SPARK-55407][PYSPARK] Replace logger.warn with logger.warning
dongjoon
(spark) branch master updated: [SPARK-54881][SQL][FOLLOWUP] Extract simplifyNot method in BooleanSimplification
wenchen
(spark) branch branch-4.1 updated: [SPARK-55337][SS] Fix MemoryStream backward compatibility
wenchen
(spark) branch master updated: [SPARK-55337][SS] Fix MemoryStream backward compatibility
wenchen
(spark) branch pr-54140-update deleted (was 32afc45731d7)
dongjoon
(spark) branch master updated: [SPARK-55420][BUILD] Upgrade Netty to `4.2.10.Final`
dongjoon
(spark-connect-swift) branch main updated: [SPARK-55418] Add `create_spark_jira.py` script
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55417] Add `create_spark_jira.py` script
dongjoon
(spark) branch master updated (474e07efed0a -> a1c41a819f8d)
ruifengz
(spark) branch master updated (de345288830c -> 474e07efed0a)
ruifengz
(spark-connect-swift) branch main updated: [SPARK-55426] Set `strategy.max-parrallel` to 20 for all GitHub Action jobs
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55425] Set `strategy.max-parrallel` to 20 for all GitHub Action jobs
dongjoon
(spark) branch branch-3.5 updated: [SPARK-55423][INFRA] Set `strategy.max-parrallel` to 20 for all GitHub Action jobs
dongjoon
(spark) branch branch-4.0 updated: [SPARK-55423][INFRA] Set `strategy.max-parrallel` to 20 for all GitHub Action jobs
dongjoon
(spark) branch branch-4.1 updated: [SPARK-55423][INFRA] Set `strategy.max-parrallel` to 20 for all GitHub Action jobs
dongjoon
(spark) branch master updated: [SPARK-55423][INFRA] Set `strategy.max-parrallel` to 20 for all GitHub Action jobs
dongjoon
(spark) branch master updated: [SPARK-55410][K8S] Improve `SparkKubernetesDiagnosticsSetter` to use `patch` instead of `edit` API
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55422] Fix the default value of `readinessProbe.failureThreshold` to 1
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55421] Increase `livenessProbe.failureThreshold` to 3
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55419] Upgrade Netty to `4.2.10.Final`
dongjoon
(spark) branch master updated: [SPARK-55180][PYTHON][INFRA][FOLLOW-UP] Delete unused yml file
dongjoon
(spark) branch master updated: [MINOR][DOCS] Update Maven version and MAVEN_OPTS setting in `building-spark.md` docs
dongjoon
(spark) branch master updated: [SPARK-55401][PYTHON] Add retry logic and timeout handling to pyspark install download
yao
(spark) branch branch-4.0 updated: [SPARK-55387][CORE][UI] Fix DAG visualization not rendering due to malformed DOT label
yao
(spark) branch branch-4.1 updated: [SPARK-55387][CORE][UI] Fix DAG visualization not rendering due to malformed DOT label
yao
(spark) branch master updated: [SPARK-55387][CORE][UI] Fix DAG visualization not rendering due to malformed DOT label
yao
(spark) branch master updated: [SPARK-55394][PYTHON][INFRA] Upgrade Python 3.10 test image to Ubuntu 24.04
ruifengz
(spark) branch master updated: [SPARK-55393][PYTHON][INFRA] Upgrade Python 3.11 test image to Ubuntu 24.04
ruifengz
(spark) branch master updated: [SPARK-55392][PYTHON][INFRA] Upgrade Python 3.14 test image to Ubuntu 24.04
ruifengz
(spark) branch master updated: [SPARK-55391][PYTHON][INFRA] Upgrade Python 3.13 test image to Ubuntu 24.04
ruifengz
(spark) branch master updated: [SPARK-55399][K8S] Improve `KubernetesDriverEndpoint` to use `patch` instead of `edit` API
dongjoon
(spark) branch master updated: [SPARK-55304][SS][PYTHON] Introduce support of Admission Control and Trigger.AvailableNow in Python data source - streaming reader
kabhwan
(spark) branch master updated: [SPARK-55317][SQL] Add SequentialUnion logical plan node and planning rule
dtenedor
(spark) branch master updated: [SPARK-55131][SS] Change the default merge operator delimiter for RocksDB to empty string to concat without delimiter
kabhwan
(spark) branch pr-54140-update updated (76c8e0b10195 -> 32afc45731d7)
yao
(spark) 01/01: [SPARK-XXXXX][SQL] Add cost-based guard to CrossJoinArrayContainsToInnerJoin
yao
(spark) branch master updated: [SPARK-55334][PYTHON] Enable `TimestampType` and `TimestampNTZType` in `convert_numpy`
ruifengz
(spark) branch master updated (ee324696f916 -> ec29abb3033d)
ruifengz
(spark) branch master updated (861ba537250d -> ee324696f916)
ruifengz
(spark) branch master updated (f0d9f993fc3e -> 861ba537250d)
kabhwan
(spark) branch master updated: [SPARK-55386][INFRA] Run `Java 17/25` Maven install tests on PR build only
dongjoon
(spark) branch master updated: [SPARK-55376][PS] Make numeric_only argument in groupby functions accept only boolean with pandas 3
ruifengz
(spark) branch master updated: [SPARK-55382][CORE] Make `Executor` to log `Running Spark version`
dongjoon
(spark-connect-swift) branch main updated: [SPARK-55381] Use Spark `4.0.2` instead of `4.0.1`
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55380] Upgrade `Iceberg` example to use Spark 4.0.2
dongjoon
(spark) branch master updated: [MINOR][PYTHON] Add `tabulate` in `dev/requirements.txt`
gurwls223
(spark) branch master updated (a48e3131b1ea -> b28db406c357)
gurwls223
(spark) branch master updated (5de75d8ef5b2 -> a48e3131b1ea)
dongjoon
[PR] Redirect `4.1.0-preview*` docs to ASF archive service [spark-website]
via GitHub
Re: [PR] Redirect `4.1.0-preview*` docs to ASF archive service [spark-website]
via GitHub
Re: [PR] Redirect `4.1.0-preview*` docs to ASF archive service [spark-website]
via GitHub
Re: [PR] Redirect `4.1.0-preview*` docs to ASF archive service [spark-website]
via GitHub
Re: [PR] Redirect `4.1.0-preview*` docs to ASF archive service [spark-website]
via GitHub
Re: [PR] Redirect `4.1.0-preview*` docs to ASF archive service [spark-website]
via GitHub
Re: [PR] Redirect `4.1.0-preview*` docs to ASF archive service [spark-website]
via GitHub
Re: [PR] Redirect `4.1.0-preview*` docs to ASF archive service [spark-website]
via GitHub
(spark) branch master updated (509aa00ccf63 -> 5de75d8ef5b2)
ruifengz
[PR] Add 4.0.2 news [spark-website]
via GitHub
Re: [PR] Add 4.0.2 news [spark-website]
via GitHub
Re: [PR] Add 4.0.2 news [spark-website]
via GitHub
(spark-docker) branch master updated: [SPARK-55378] Publish Apache Spark 4.0.2 to docker registry (#104)
dongjoon
(spark) branch master updated (62824a8f0236 -> 509aa00ccf63)
ueshin
(spark) branch master updated: [SPARK-55368][PYTHON][TESTS] Make sure `worker_util.py` can only be imported in python workers
gurwls223
(spark) branch master updated: [SPARK-55366][SQL][PYTHON] Remove `errorOnDuplicatedFieldNames` from Python UDFs
gurwls223
(spark) branch master updated: [MINOR][PS] Convert loop append in Pyspark to list comprehension
gurwls223
(spark) branch pr-54140-update created (now 76c8e0b10195)
yao
(spark) 01/01: [SPARK-XXXXX][SQL] Add cost-based guard to CrossJoinArrayContainsToInnerJoin
yao
[PR] Add Spark 4.0.2 documentation [spark-website]
via GitHub
Re: [PR] Add Spark 4.0.2 documentation [spark-website]
via GitHub
Re: [PR] Add Spark 4.0.2 documentation [spark-website]
via GitHub
Re: [PR] Add Spark 4.0.2 documentation [spark-website]
via GitHub
svn commit: r82317 - dev/spark/v4.0.2-rc1-docs
dongjoon
svn commit: r82315 - dev/spark/v4.0.2-rc1-bin release/spark/spark-4.0.2
dongjoon
svn commit: r82316 - dev/spark/v4.0.2-rc1-docs/_site release/spark/docs/4.0.2
dongjoon
(spark) tag v4.0.2 created (now 7cc3b9bcdaab)
dongjoon
(spark) branch master updated: [SPARK-55370][K8S] Improve `annotateExecutorDeletionCost` to use `patch` instead of `edit` API
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55371] Increase `Gradle` retry setting to stablize CIs
dongjoon
(spark-kubernetes-operator) branch main updated: [SPARK-55374] Remove `vendor` requirement from Java toolchain
dongjoon
(spark-kubernetes-operator) branch main updated: MINOR: Add release version badge and link to `README.md`
dongjoon
(spark) branch master updated: [SPARK-55373][CONNECT] Improve noHandlerFoundForExtension error message
hvanhovell
(spark) branch master updated: [SPARK-55341][SQL] Add storage level flag for cached local relations
hvanhovell
(spark) branch master updated: [SPARK-55356][SQL] Support alias for PIVOT clause
wenchen
svn commit: r82303 - in dev/spark/v4.2.0-preview2-rc1-docs: . _site _site/api _site/api/R _site/api/R/articles _site/api/R/articles/sparkr-vignettes_files _site/api/R/articles/sparkr-vignettes_files/accessible-code-block-0.0.1 _site/api/R/deps _site/ap...
gurwls223
(spark-website) branch asf-site updated: Change Spark 4.2 release timeline (#668)
dongjoon
svn commit: r82302 - dev/spark/v4.2.0-preview2-rc1-bin
gurwls223
(spark) branch master updated: [SPARK-55365][PYTHON] Generalize the utils for arrow array conversion
ruifengz
(spark) branch master updated: [SPARK-55228][SPARK-55230][SQL][CONNECT] Implement Dataset.zipWithIndex in Scala API
ruifengz
Earlier messages
Later messages