[jira] [Created] (SPARK-52195) Fix initial state column dropping issue for Python TWS

2025-05-16 Thread Bo Gao (Jira)
Bo Gao created SPARK-52195: -- Summary: Fix initial state column dropping issue for Python TWS Key: SPARK-52195 URL: https://issues.apache.org/jira/browse/SPARK-52195 Project: Spark Issue Type: Task

[jira] [Created] (SPARK-51889) Fix MapState bug on clear() for TWS Python

2025-04-23 Thread Bo Gao (Jira)
Bo Gao created SPARK-51889: -- Summary: Fix MapState bug on clear() for TWS Python Key: SPARK-51889 URL: https://issues.apache.org/jira/browse/SPARK-51889 Project: Spark Issue Type: Task Com

[jira] [Created] (SPARK-51697) PySpark - Fix list state test failure for TWS

2025-04-02 Thread Bo Gao (Jira)
Bo Gao created SPARK-51697: -- Summary: PySpark - Fix list state test failure for TWS Key: SPARK-51697 URL: https://issues.apache.org/jira/browse/SPARK-51697 Project: Spark Issue Type: Task

[jira] [Created] (SPARK-51684) PySpark - Fix test failure for TWS

2025-04-01 Thread Bo Gao (Jira)
Bo Gao created SPARK-51684: -- Summary: PySpark - Fix test failure for TWS Key: SPARK-51684 URL: https://issues.apache.org/jira/browse/SPARK-51684 Project: Spark Issue Type: Task Components:

[jira] [Updated] (SPARK-51587) [PySpark] Fix an issue where timestamp cannot be used in ListState when multiple state data is involved

2025-03-21 Thread Bo Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-51587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bo Gao updated SPARK-51587: --- Description: Fix an issue where timestamp cannot be used in ListState when multiple state data is involved.

[jira] [Updated] (SPARK-51587) [PySpark] Fix an issue where timestamp cannot be used in ListState when multiple state data is involved

2025-03-21 Thread Bo Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-51587?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bo Gao updated SPARK-51587: --- Description: Fix an issue where timestamp cannot be used in ListState when multiple state data is involved.

[jira] [Created] (SPARK-51587) [PySpark] Fix an issue where timestamp cannot be used in ListState when multiple state data is involved

2025-03-21 Thread Bo Gao (Jira)
Bo Gao created SPARK-51587: -- Summary: [PySpark] Fix an issue where timestamp cannot be used in ListState when multiple state data is involved Key: SPARK-51587 URL: https://issues.apache.org/jira/browse/SPARK-51587

[jira] [Updated] (SPARK-51506) Do not enforce users to implement close() in TransformWithStateInPandas

2025-03-13 Thread Bo Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-51506?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bo Gao updated SPARK-51506: --- Summary: Do not enforce users to implement close() in TransformWithStateInPandas (was: Not enforce implemen

[jira] [Created] (SPARK-51506) Not enforce implement close() in TransformWithStateInPandas

2025-03-13 Thread Bo Gao (Jira)
Bo Gao created SPARK-51506: -- Summary: Not enforce implement close() in TransformWithStateInPandas Key: SPARK-51506 URL: https://issues.apache.org/jira/browse/SPARK-51506 Project: Spark Issue Type:

[jira] [Created] (SPARK-51147) Refactor some streaming related classes to a dedicated streaming directory

2025-02-10 Thread Bo Gao (Jira)
Bo Gao created SPARK-51147: -- Summary: Refactor some streaming related classes to a dedicated streaming directory Key: SPARK-51147 URL: https://issues.apache.org/jira/browse/SPARK-51147 Project: Spark

[jira] [Created] (SPARK-50540) [PySpark] Fix the issue for string schema in StatefulProcessHandle

2024-12-10 Thread Bo Gao (Jira)
Bo Gao created SPARK-50540: -- Summary: [PySpark] Fix the issue for string schema in StatefulProcessHandle Key: SPARK-50540 URL: https://issues.apache.org/jira/browse/SPARK-50540 Project: Spark Issue

[jira] [Updated] (SPARK-50341) PySpark - Use UDS for communication between JVM and Streaming Python worker

2024-11-19 Thread Bo Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-50341?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bo Gao updated SPARK-50341: --- Summary: PySpark - Use UDS for communication between JVM and Streaming Python worker (was: PySpark - Use UD

[jira] [Created] (SPARK-49899) [PySpark] Support deleteIfExists

2024-10-07 Thread Bo Gao (Jira)
Bo Gao created SPARK-49899: -- Summary: [PySpark] Support deleteIfExists Key: SPARK-49899 URL: https://issues.apache.org/jira/browse/SPARK-49899 Project: Spark Issue Type: Task Components: S

[jira] [Created] (SPARK-49821) [PySpark] Support MapState

2024-09-27 Thread Bo Gao (Jira)
Bo Gao created SPARK-49821: -- Summary: [PySpark] Support MapState Key: SPARK-49821 URL: https://issues.apache.org/jira/browse/SPARK-49821 Project: Spark Issue Type: Task Components: Structu

[jira] [Created] (SPARK-49744) [PySpark] State TTL support - ListState

2024-09-20 Thread Bo Gao (Jira)
Bo Gao created SPARK-49744: -- Summary: [PySpark] State TTL support - ListState Key: SPARK-49744 URL: https://issues.apache.org/jira/browse/SPARK-49744 Project: Spark Issue Type: Task Compon

[jira] [Created] (SPARK-49463) [PySpark] Support ListState

2024-08-29 Thread Bo Gao (Jira)
Bo Gao created SPARK-49463: -- Summary: [PySpark] Support ListState Key: SPARK-49463 URL: https://issues.apache.org/jira/browse/SPARK-49463 Project: Spark Issue Type: Task Components: Struct

[jira] [Updated] (SPARK-48755) [PySpark] Base implementation and ValueState support

2024-08-19 Thread Bo Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-48755?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bo Gao updated SPARK-48755: --- Summary: [PySpark] Base implementation and ValueState support (was: [Python State V2] Base implementation a

[jira] [Updated] (SPARK-49233) [PySpark] Classify internal and user facing errors in PySpark transformWithState

2024-08-19 Thread Bo Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bo Gao updated SPARK-49233: --- Summary: [PySpark] Classify internal and user facing errors in PySpark transformWithState (was: [Python Sta

[jira] [Updated] (SPARK-49212) [PySpark] Implement schema evolution support

2024-08-19 Thread Bo Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49212?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bo Gao updated SPARK-49212: --- Summary: [PySpark] Implement schema evolution support (was: [Python State V2] Implement schema evolution su

[jira] [Updated] (SPARK-49100) [PySpark] Add verification for result iterator of transformWithState UDF

2024-08-19 Thread Bo Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bo Gao updated SPARK-49100: --- Summary: [PySpark] Add verification for result iterator of transformWithState UDF (was: [Python State V2] A

[jira] [Updated] (SPARK-49233) [Python State V2] Classify internal and user facing errors in PySpark transformWithState

2024-08-14 Thread Bo Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bo Gao updated SPARK-49233: --- Summary: [Python State V2] Classify internal and user facing errors in PySpark transformWithState (was: [Py

[jira] [Updated] (SPARK-49233) [Python State V2] Classify internal and user facing errors in PySpark

2024-08-14 Thread Bo Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49233?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bo Gao updated SPARK-49233: --- Summary: [Python State V2] Classify internal and user facing errors in PySpark (was: [Python State V2] Clas

[jira] [Created] (SPARK-49233) [Python State V2] Classify errors thrown by internal methods

2024-08-13 Thread Bo Gao (Jira)
Bo Gao created SPARK-49233: -- Summary: [Python State V2] Classify errors thrown by internal methods Key: SPARK-49233 URL: https://issues.apache.org/jira/browse/SPARK-49233 Project: Spark Issue Type:

[jira] [Created] (SPARK-49212) [Python State V2] Implement schema evolution support

2024-08-12 Thread Bo Gao (Jira)
Bo Gao created SPARK-49212: -- Summary: [Python State V2] Implement schema evolution support Key: SPARK-49212 URL: https://issues.apache.org/jira/browse/SPARK-49212 Project: Spark Issue Type: Task

[jira] [Updated] (SPARK-49146) Move assertion errors related to watermark missing in append mode streaming queries to error framework

2024-08-07 Thread Bo Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bo Gao updated SPARK-49146: --- Summary: Move assertion errors related to watermark missing in append mode streaming queries to error framew

[jira] [Updated] (SPARK-49146) Move assertion errors related to watermarks to error framework

2024-08-07 Thread Bo Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49146?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bo Gao updated SPARK-49146: --- Description: This is a followup for https://issues.apache.org/jira/browse/SPARK-45539. The errors added ther

[jira] [Created] (SPARK-49146) Move assertion errors related to watermarks to error framework

2024-08-07 Thread Bo Gao (Jira)
Bo Gao created SPARK-49146: -- Summary: Move assertion errors related to watermarks to error framework Key: SPARK-49146 URL: https://issues.apache.org/jira/browse/SPARK-49146 Project: Spark Issue Typ

[jira] [Updated] (SPARK-49100) [Python State V2] Add verification for result iterator of transformWithState UDF

2024-08-02 Thread Bo Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-49100?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bo Gao updated SPARK-49100: --- Description: add verification that elements in result_iter for are indeed of type pd.DataFrame and confirm t

[jira] [Created] (SPARK-48755) [Python State V2] Base implementation and ValueState support

2024-06-28 Thread Bo Gao (Jira)
Bo Gao created SPARK-48755: -- Summary: [Python State V2] Base implementation and ValueState support Key: SPARK-48755 URL: https://issues.apache.org/jira/browse/SPARK-48755 Project: Spark Issue Type:

[jira] [Resolved] (SPARK-46963) Verify AQE is not enabled for Structured Streaming

2024-02-06 Thread Bo Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-46963?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bo Gao resolved SPARK-46963. Resolution: Won't Do > Verify AQE is not enabled for Structured Streaming > --

[jira] [Created] (SPARK-46963) Verify AQE is not enabled for Structured Streaming

2024-02-02 Thread Bo Gao (Jira)
Bo Gao created SPARK-46963: -- Summary: Verify AQE is not enabled for Structured Streaming Key: SPARK-46963 URL: https://issues.apache.org/jira/browse/SPARK-46963 Project: Spark Issue Type: Task

[jira] [Created] (SPARK-44877) Support python protobuf functions for Spark Connect

2023-08-18 Thread Bo Gao (Jira)
Bo Gao created SPARK-44877: -- Summary: Support python protobuf functions for Spark Connect Key: SPARK-44877 URL: https://issues.apache.org/jira/browse/SPARK-44877 Project: Spark Issue Type: Task

[jira] [Created] (SPARK-44626) Followup on streaming query termination when client session is timed out for Spark Connect

2023-08-01 Thread Bo Gao (Jira)
Bo Gao created SPARK-44626: -- Summary: Followup on streaming query termination when client session is timed out for Spark Connect Key: SPARK-44626 URL: https://issues.apache.org/jira/browse/SPARK-44626 Projec

[jira] [Updated] (SPARK-44434) Add more tests for Scala foreachBatch and streaming listeners

2023-07-14 Thread Bo Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44434?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bo Gao updated SPARK-44434: --- Summary: Add more tests for Scala foreachBatch and streaming listeners (was: Add more tests for Scala fore

[jira] [Created] (SPARK-44436) Session improvement for Scala foreachBatch

2023-07-14 Thread Bo Gao (Jira)
Bo Gao created SPARK-44436: -- Summary: Session improvement for Scala foreachBatch Key: SPARK-44436 URL: https://issues.apache.org/jira/browse/SPARK-44436 Project: Spark Issue Type: Task Com

[jira] [Updated] (SPARK-44400) Improve Scala StreamingQueryListener to provide users a way to access the Spark session for Spark Connect

2023-07-12 Thread Bo Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-44400?page=com.atlassian.jira.plugin.system.issuetabpanels:all-tabpanel ] Bo Gao updated SPARK-44400: --- Description: Improve the Listener to provide users a way to access the Spark session and perform arbitrary a

[jira] [Created] (SPARK-44400) Improve Scala StreamingQueryListener to provide users a way to access the Spark session for Spark Connect

2023-07-12 Thread Bo Gao (Jira)
Bo Gao created SPARK-44400: -- Summary: Improve Scala StreamingQueryListener to provide users a way to access the Spark session for Spark Connect Key: SPARK-44400 URL: https://issues.apache.org/jira/browse/SPARK-44400

[jira] [Created] (SPARK-44201) Add support for Streaming Listener in Scala for Spark Connect

2023-06-26 Thread Bo Gao (Jira)
Bo Gao created SPARK-44201: -- Summary: Add support for Streaming Listener in Scala for Spark Connect Key: SPARK-44201 URL: https://issues.apache.org/jira/browse/SPARK-44201 Project: Spark Issue Type

[jira] [Created] (SPARK-44136) StateManager may get materialized in executor instead of driver in FlatMapGroupsWithStateExec

2023-06-21 Thread Bo Gao (Jira)
Bo Gao created SPARK-44136: -- Summary: StateManager may get materialized in executor instead of driver in FlatMapGroupsWithStateExec Key: SPARK-44136 URL: https://issues.apache.org/jira/browse/SPARK-44136 Pro

[jira] [Comment Edited] (SPARK-43511) Implemented State APIs for Spark Connect Scala

2023-06-12 Thread Bo Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-43511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17722891#comment-17722891 ] Bo Gao edited comment on SPARK-43511 at 6/12/23 6:59 PM: - Create

[jira] [Commented] (SPARK-43511) Implemented State APIs for Spark Connect Scala

2023-05-15 Thread Bo Gao (Jira)
[ https://issues.apache.org/jira/browse/SPARK-43511?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17722891#comment-17722891 ] Bo Gao commented on SPARK-43511: Created PR https://github.com/apache/spark/pull/40959

[jira] [Created] (SPARK-43511) Implemented State APIs for Spark Connect Scala

2023-05-15 Thread Bo Gao (Jira)
Bo Gao created SPARK-43511: -- Summary: Implemented State APIs for Spark Connect Scala Key: SPARK-43511 URL: https://issues.apache.org/jira/browse/SPARK-43511 Project: Spark Issue Type: Task