uros-db commented on code in PR #46762:
URL: https://github.com/apache/spark/pull/46762#discussion_r1677116591
##
common/unsafe/src/main/java/org/apache/spark/sql/catalyst/util/CollationAwareUTF8String.java:
##
@@ -841,117 +842,255 @@ public static UTF8String translate(final UTF
uros-db commented on code in PR #46762:
URL: https://github.com/apache/spark/pull/46762#discussion_r1677116630
##
common/unsafe/src/main/java/org/apache/spark/sql/catalyst/util/CollationAwareUTF8String.java:
##
@@ -841,117 +842,255 @@ public static UTF8String translate(final UTF
uros-db commented on code in PR #46762:
URL: https://github.com/apache/spark/pull/46762#discussion_r1677116677
##
common/unsafe/src/main/java/org/apache/spark/sql/catalyst/util/CollationAwareUTF8String.java:
##
@@ -841,117 +842,255 @@ public static UTF8String translate(final UTF
zhengruifeng opened a new pull request, #47342:
URL: https://github.com/apache/spark/pull/47342
### What changes were proposed in this pull request?
Inspired by https://github.com/apache/spark/pull/47258, I am checking other
ML implementations, and find that we can also optimize `Tokenize
wayneguow opened a new pull request, #47343:
URL: https://github.com/apache/spark/pull/47343
### What changes were proposed in this pull request?
This PR aims to add some extra examples for `linearRegression` built-in
functions.
### Why are the changes needed?
wayneguow opened a new pull request, #47344:
URL: https://github.com/apache/spark/pull/47344
### What changes were proposed in this pull request?
This PR aims to upgrade `docker-java` to 3.4.0.
### Why are the changes needed?
There some improvements, such as:
uros-db commented on code in PR #47154:
URL: https://github.com/apache/spark/pull/47154#discussion_r1677169283
##
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/Mode.scala:
##
@@ -106,11 +155,13 @@ case class Mode(
val collationAwareBuffer =
uros-db commented on code in PR #47154:
URL: https://github.com/apache/spark/pull/47154#discussion_r1677169498
##
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/Mode.scala:
##
@@ -106,11 +155,13 @@ case class Mode(
val collationAwareBuffer =
wayneguow opened a new pull request, #47345:
URL: https://github.com/apache/spark/pull/47345
### What changes were proposed in this pull request?
Add a missing param in func docs of `partitioning.py`.
### Why are the changes needed?
- Make python api docs bett
uros-db commented on code in PR #47154:
URL: https://github.com/apache/spark/pull/47154#discussion_r1677169573
##
sql/catalyst/src/main/scala/org/apache/spark/sql/catalyst/expressions/aggregate/Mode.scala:
##
@@ -86,6 +75,66 @@ case class Mode(
buffer
}
+ private def
dongjoon-hyun commented on PR #47328:
URL: https://github.com/apache/spark/pull/47328#issuecomment-2227419001
Thank you for reverting this, Hyukjin.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go t
dongjoon-hyun commented on PR #47341:
URL: https://github.com/apache/spark/pull/47341#issuecomment-2227419448
Thank you, @HyukjinKwon and @zhengruifeng .
In the PR description, could you add specific JIRA issue links for the
following ?
> In order to leverage Catalyst optimizer and
dongjoon-hyun commented on code in PR #47325:
URL: https://github.com/apache/spark/pull/47325#discussion_r1677194257
##
sql/catalyst/src/main/scala/org/apache/spark/sql/internal/SQLConf.scala:
##
@@ -3229,6 +3229,15 @@ object SQLConf {
.booleanConf
.createWithDefault(f
dongjoon-hyun commented on PR #47325:
URL: https://github.com/apache/spark/pull/47325#issuecomment-2227463911
However, SPARK-48658 was merged as an improvement JIRA, @yaooqinn . Do you
mean we need to convert it as a bug fix?
, I
think we can only
LuciferYang commented on code in PR #47310:
URL: https://github.com/apache/spark/pull/47310#discussion_r1677272149
##
sql/core/benchmarks/DataSourceReadBenchmark-results.txt:
##
@@ -1,431 +1,438 @@
-DataSourceReadBenchmark-jdk21-results.txt===
LuciferYang commented on code in PR #47310:
URL: https://github.com/apache/spark/pull/47310#discussion_r1677272492
##
sql/core/benchmarks/DataSourceReadBenchmark-results.txt:
##
@@ -1,431 +1,438 @@
-DataSourceReadBenchmark-jdk21-results.txt===
LuciferYang commented on code in PR #47310:
URL: https://github.com/apache/spark/pull/47310#discussion_r1677272149
##
sql/core/benchmarks/DataSourceReadBenchmark-results.txt:
##
@@ -1,431 +1,438 @@
-DataSourceReadBenchmark-jdk21-results.txt===
HeartSaVioR commented on PR #47336:
URL: https://github.com/apache/spark/pull/47336#issuecomment-2227627161
Shall we follow the way we have been doing for versioning? We tend to
reserve the first line for version and use the format `s"v$version"` - this
applies to offset/commit log as well
HeartSaVioR commented on code in PR #47336:
URL: https://github.com/apache/spark/pull/47336#discussion_r1677278405
##
sql/core/src/main/scala/org/apache/spark/sql/execution/streaming/state/RocksDBFileManager.scala:
##
@@ -193,11 +208,18 @@ class RocksDBFileManager(
versio
HeartSaVioR commented on PR #47338:
URL: https://github.com/apache/spark/pull/47338#issuecomment-2227634609
Thanks! Merging to master.
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific
HeartSaVioR closed pull request #47338: [SPARK-4][SS] Remove snapshot
creation based on changelog ops size
URL: https://github.com/apache/spark/pull/47338
--
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above
1 - 100 of 137 matches
Mail list logo