user

Messages by Thread

[ANNOUNCE] Apache Kyuubi released 1.7.3 Zhen Wang
Spark Connect Multi-tenant Support Kezhi Xiong
Parallel write to different partitions Shrikant Prasad
- Re: Parallel write to different partitions Shrikant Prasad
Need to split incoming data into PM on time column and find the top 5 by volume of data [email protected]
- Re: Need to split incoming data into PM on time column and find the top 5 by volume of data Mich Talebzadeh
PySpark 3.5.0 on PyPI Kezhi Xiong
- Re: PySpark 3.5.0 on PyPI Sean Owen
- Re: PySpark 3.5.0 on PyPI Kezhi Xiong
[Spark 3.5.0] Is the protobuf-java JAR no longer shipped with Spark? Gijs Hendriksen
Create an external table with DataFrameWriterV2 Christophe Préaud
Spark streaming sourceArchiveDir does not move file to archive directory Yunus Emre G?rses
Discriptency sample standard deviation pyspark and Excel Helene Bøe
- Re: Discriptency sample standard deviation pyspark and Excel Sean Owen
- Re: Discriptency sample standard deviation pyspark and Excel Mich Talebzadeh
- Re: Discriptency sample standard deviation pyspark and Excel Sean Owen
- Re: Discriptency sample standard deviation pyspark and Excel Bjørn Jørgensen
- Re: Discriptency sample standard deviation pyspark and Excel Mich Talebzadeh
Urgent: Seeking Guidance on Kafka Slow Consumer and Data Skew Problem Karthick
- Re: Urgent: Seeking Guidance on Kafka Slow Consumer and Data Skew Problem Gowtham S
- Re: Urgent: Seeking Guidance on Kafka Slow Consumer and Data Skew Problem Karthick
getting emails in different order! Mich Talebzadeh
- Re: getting emails in different order! Sean Owen
- Re: getting emails in different order! Mich Talebzadeh
[ANNOUNCE] Apache Kyuubi released 1.7.2 Zhen Wang
About Peak Jvm Memory Onheap Nebi Aydin
Fwd: First Time contribution. ram manickam
- Re: First Time contribution. Denny Lee
- Re: First Time contribution. Haejoon Lee
[Spark Core]: How does rpc threads influence shuffle? Nebi Aydin
Re: Filter out 20% of rows Bjørn Jørgensen
- Re: Filter out 20% of rows Mich Talebzadeh
- Re: Filter out 20% of rows Bjørn Jørgensen
- Re: Filter out 20% of rows Mich Talebzadeh
- Re: Filter out 20% of rows Mich Talebzadeh
- Re: Filter out 20% of rows Bjørn Jørgensen
- Re: Filter out 20% of rows Bjørn Jørgensen
- Re: Filter out 20% of rows [email protected]
Spark stand-alone mode Ilango
- Re: Spark stand-alone mode Patrick Tucci
- Re: Spark stand-alone mode Sean Owen
- Re: Spark stand-alone mode Mich Talebzadeh
- Re: Spark stand-alone mode Bjørn Jørgensen
- Re: Spark stand-alone mode Ilango
- Re: Spark stand-alone mode Patrick Tucci
- Re: Spark stand-alone mode Ilango
Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2 Craig Alfieri
- Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2 Jerry Peng
- Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2 russell . spitzer
- Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2 Craig Alfieri
- Re: Data Duplication Bug Found - Structured Streaming Versions 3..4.1, 3.2.4, and 3.3.2 Jerry Peng
APACHE Spark adoption/growth chart Andrew Petersen
Write Spark Connection client application in Go bo yang
- Re: Write Spark Connection client application in Go Holden Karau
- Re: Write Spark Connection client application in Go Martin Grund
- Re: Write Spark Connection client application in Go bo yang
Feedback on Testing Guidelines for Data Stream Processing Applications Alexandre Strapacao Guedes Vianna
Re: IDEA compile fail but sbt test succeed Pasha Finkelshteyn
About /mnt/hdfs/current/BP directories Nebi Aydin
- Re: About /mnt/hdfs/current/BP directories Jack Wells
- Re: [External Email] Re: About /mnt/hdfs/current/BP directories Nebi Aydin
- Re: [External Email] Re: About /mnt/hdfs/current/BP directories Jack Wells
- Re: [External Email] Re: About /mnt/hdfs/current/BP directories Nebi Aydin
RE: Spark 3.4.1 and Hive 3.1.3 Agrawal, Sanket
- Re: Spark 3.4.1 and Hive 3.1.3 Yeachan Park
- Re: Spark 3.4.1 and Hive 3.1.3 Chao Sun
- RE: Spark 3.4.1 and Hive 3.1.3 Agrawal, Sanket
- Re: Spark 3.4.1 and Hive 3.1.3 Nagatomi Yasukazu
- RE: Spark 3.4.1 and Hive 3.1.3 Agrawal, Sanket
- RE: Spark 3.4.1 and Hive 3.1.3 Agrawal, Sanket
how can i use spark with yarn cluster in java BCMS
- Re: how can i use spark with yarn cluster in java Mich Talebzadeh
Change default timestamp offset on data load Jack Goodson
- Re: Change default timestamp offset on data load Mich Talebzadeh
- Re: Change default timestamp offset on data load Jack Goodson
- Re: Change default timestamp offset on data load Mich Talebzadeh
- Re: Change default timestamp offset on data load Jack Goodson
Seeking Professional Advice on Career and Personal Growth in the Apache Spark Community Varun Shah
- Re: Seeking Professional Advice on Career and Personal Growth in the Apache Spark Community Mich Talebzadeh
- Re: Seeking Professional Advice on Career and Personal Growth in the Apache Spark Community [email protected]
- Re: Seeking Professional Advice on Career and Personal Growth in the Apache Spark Community Mich Talebzadeh
pyspark.ml.recommendation is using the wrong python version Harry Jamison
- Re: pyspark.ml.recommendation is using the wrong python version Harry Jamison
- Re: pyspark.ml.recommendation is using the wrong python version Mich Talebzadeh
Running Spark Connect Server in Cluster Mode on Kubernetes Nagatomi Yasukazu
- Re: Running Spark Connect Server in Cluster Mode on Kubernetes Cleyson Barros
- Re: Running Spark Connect Server in Cluster Mode on Kubernetes Nagatomi Yasukazu
- Re: Running Spark Connect Server in Cluster Mode on Kubernetes Mich Talebzadeh
- Re: Running Spark Connect Server in Cluster Mode on Kubernetes Nagatomi Yasukazu
- Re: Running Spark Connect Server in Cluster Mode on Kubernetes Nagatomi Yasukazu
- Re: Re: Running Spark Connect Server in Cluster Mode on Kubernetes [email protected]
- Re: Re: Running Spark Connect Server in Cluster Mode on Kubernetes [email protected]
- Re: Re: Running Spark Connect Server in Cluster Mode on Kubernetes Nagatomi Yasukazu
[Spark Connect]Running Spark Connect Server in Cluster Mode on Kubernetes Nagatomi Yasukazu
Reg read json inference schema Manoj Babu
Okio Vulnerability in Spark 3.4.1 Agrawal, Sanket
- Re: Okio Vulnerability in Spark 3.4.1 Sean Owen
- RE: Okio Vulnerability in Spark 3.4.1 Agrawal, Sanket
- Re: Okio Vulnerability in Spark 3.4.1 Sean Owen
- Re: Okio Vulnerability in Spark 3.4.1 Bjørn Jørgensen
- Re: Okio Vulnerability in Spark 3.4.1 Bjørn Jørgensen
- Re: Okio Vulnerability in Spark 3.4.1 Bjørn Jørgensen
CommunityOverCode(CoC) 2023 Uma Maheswara Rao Gangumalla
Registration open for Community Over Code North America Rich Bowen
Two new tickets for Spark on K8s Mich Talebzadeh
Elasticsearch support for Spark 3.x Dipayan Dev
- Re: Elasticsearch support for Spark 3.x Holden Karau
- Re: Elasticsearch support for Spark 3.x Dipayan Dev
- Re: Elasticsearch support for Spark 3.x Koert Kuipers
- Re: Elasticsearch support for Spark 3.x Dipayan Dev
- Re: Elasticsearch support for Spark 3.x Dipayan Dev
- Re: Elasticsearch support for Spark 3.x Dipayan Dev
- Re: Elasticsearch support for Spark 3.x Dipayan Dev
- Re: Elasticsearch support for Spark 3.x Sean Owen
- Re: Elasticsearch support for Spark 3.x Dipayan Dev
- Re: Elasticsearch support for Spark 3.x Sean Owen
Spark 2.4.7 Harry Jamison
- Re: Spark 2.4.7 Varun Shah
- Re: Spark 2.4.7 Harry Jamison
- Re: Spark 2.4.7 Mich Talebzadeh
- Re: Spark 2.4.7 Mich Talebzadeh
mysterious spark.sql.utils.AnalysisException Union in spark 3.3.2, but not seen in 3.4.0+ Srivatsan vn
- Re: mysterious spark.sql.utils.AnalysisException Union in spark 3.3.2, but not seen in 3.4.0+ Mich Talebzadeh
Spark Connect: API mismatch in SparkSesession#execute Stefan Hagedorn
Fwd: 📅 Wednesday: Join 6 Members at "Ofir Press | Complementing Scale: Novel Guidance Methods for Improving LMs" Mich Talebzadeh
[no subject] ayan guha
- Re: leibnitz
$SPARK_HOME/sbin/start-worker.sh spark://{main_host}:{cluster_port} failing Jeremy Brent
- Re: $SPARK_HOME/sbin/start-worker.sh spark://{main_host}:{cluster_port} failing Mich Talebzadeh
Fwd: Recap on current status of "SPIP: Support Customized Kubernetes Schedulers" Mich Talebzadeh
[ANNOUNCE] Apache Spark 3.3.3 released Yuming Wang
error trying to save to database (Phoenix) Kal Stevens
- Re: error trying to save to database (Phoenix) Sean Owen
- Re: error trying to save to database (Phoenix) Kal Stevens
- Re: error trying to save to database (Phoenix) Sean Owen
- Re: error trying to save to database (Phoenix) Kal Stevens
- Re: error trying to save to database (Phoenix) Gera Shegalov
DataFrame cache keeps growing Varun .N
Spark doesn’t create SUCCESS file when external path is passed Dipayan Dev
k8s+ YARN Spark Крюков Виталий Семенович
- Re: k8s+ YARN Spark Mich Talebzadeh
Problem with spark 3.4.1 not finding spark java classes Kal Stevens
- Re: Problem with spark 3.4.1 not finding spark java classes Kal Stevens
- Re: Problem with spark 3.4.1 not finding spark java classes Mich Talebzadeh
- Problem with spark 3.4.1 not finding spark java classes Kal Stevens
- Re: Problem with spark 3.4.1 not finding spark java classes Bjørn Jørgensen
Probable Spark Bug while inserting into flat GCS bucket? Dipayan Dev
- Re: Probable Spark Bug while inserting into flat GCS bucket? Mich Talebzadeh
- Re: Probable Spark Bug while inserting into flat GCS bucket? Dipayan Dev
[Spark Core]: What's difference among spark.shuffle.io.threads Nebi Aydin
- Re: [Spark Core]: What's difference among spark.shuffle.io.threads Mich Talebzadeh
- Re: [External Email] Re: [Spark Core]: What's difference among spark.shuffle.io.threads Nebi Aydin
- Re: [External Email] Re: [Spark Core]: What's difference among spark.shuffle.io.threads Mich Talebzadeh
- Re: [External Email] Re: [Spark Core]: What's difference among spark.shuffle.io.threads Nebi Aydin
- [Spark Core]: What's difference among spark.shuffle.io.threads Nebi Aydin
[no subject] Dipayan Dev
read dataset from only one node in YARN cluster marc nicole
- Re: read dataset from only one node in YARN cluster Mich Talebzadeh
Managing python modules in docker for PySpark? Mich Talebzadeh
why advisoryPartitionSize <= maxShuffledHashJoinLocalMapThreshold ??????
- Re: why advisoryPartitionSize <= maxShuffledHashJoinLocalMapThreshold XiDuo You
Spark Vulnerabilities Sankavi Nagalingam
- Re: Spark Vulnerabilities Bjørn Jørgensen
- Re: Spark Vulnerabilities Sean Owen
- RE: Re: Spark Vulnerabilities Sankavi Nagalingam
- Re: Spark Vulnerabilities Cheng Pan
Spark-SQL - Query Hanging, How To Troubleshoot Patrick Tucci
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Mich Talebzadeh
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Patrick Tucci
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Mich Talebzadeh
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Mich Talebzadeh
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Patrick Tucci
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Stephen Coy
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Mich Talebzadeh
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Patrick Tucci
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Mich Talebzadeh
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Patrick Tucci
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Mich Talebzadeh
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Patrick Tucci
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Mich Talebzadeh
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Patrick Tucci
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Mich Talebzadeh
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Patrick Tucci
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Mich Talebzadeh
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Patrick Tucci
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Mich Talebzadeh
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Patrick Tucci
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Mich Talebzadeh
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Patrick Tucci
- Re: Spark-SQL - Query Hanging, How To Troubleshoot Mich Talebzadeh
[PySpark][UDF][PickleException] Sanket Sharma
- Re: [PySpark][UDF][PickleException] Bjørn Jørgensen
Spark Connect, Master, and Workers Kezhi Xiong
- Re: Spark Connect, Master, and Workers Brian Huynh
- Re: Spark Connect, Master, and Workers James Yu
dockerhub does not contain apache/spark-py 3.4.1 Mark Elliot
- Re: dockerhub does not contain apache/spark-py 3.4.1 Mich Talebzadeh
- Re: dockerhub does not contain apache/spark-py 3.4.1 Mich Talebzadeh
[PySpark] Failed to add file [file:///tmp/app-submodules.zip] specified in 'spark.submit.pyFiles' to Python path: lnxpgn
- Re: [PySpark] Failed to add file [file:///tmp/app-submodules.zip] specified in 'spark.submit.pyFiles' to Python path: Mich Talebzadeh