user

Messages by Thread

- Re: Issue with Spark Session Initialization in Kubernetes Deployment Mich Talebzadeh
Select Columns from Dataframe in Java PRASHANT L
- Re: Select Columns from Dataframe in Java Grisha Weintraub
- Re: Select Columns from Dataframe in Java PRASHANT L
- Re: Select Columns from Dataframe in Java Grisha Weintraub
Fwd: the life cycle shuffle Dependency yang chen
- Re: the life cycle shuffle Dependency murat migdisoglu
Pyspark UDF as a data source for streaming Поротиков Станислав Вячеславович
- Re: Pyspark UDF as a data source for streaming Mich Talebzadeh
- RE: Pyspark UDF as a data source for streaming Поротиков Станислав Вячеславович
- RE: Pyspark UDF as a data source for streaming Поротиков Станислав Вячеславович
- RE: Pyspark UDF as a data source for streaming Поротиков Станислав Вячеславович
- Re: Pyspark UDF as a data source for streaming Hyukjin Kwon
- Re: Pyspark UDF as a data source for streaming Mich Talebzadeh
- RE: Pyspark UDF as a data source for streaming Поротиков Станислав Вячеславович
- Re: Pyspark UDF as a data source for streaming Mich Talebzadeh
- Re: Pyspark UDF as a data source for streaming Mich Talebzadeh
- Re: Pyspark UDF as a data source for streaming Mich Talebzadeh
Re: Validate spark sql Nicholas Chammas
- Re: Validate spark sql Mich Talebzadeh
- Re: Validate spark sql ram manickam
- 回复：Validate spark sql tianlangstudio
- Re: Validate spark sql Mich Talebzadeh
- Re: Validate spark sql Bjørn Jørgensen
- Re: Validate spark sql Gourav Sengupta
- Re: Validate spark sql Bjørn Jørgensen
India Scala & Big Data Job Referral sri hari kali charan Tummala
About shuffle partition size Nebi Aydin
[ANNOUNCE] Apache Spark 3.3.4 released Dongjoon Hyun
Architecture of Spark Connect Nikhil Goyal
- Re: Architecture of Spark Connect Nikhil Goyal
- Re: Architecture of Spark Connect Kezhi Xiong
- Re: Architecture of Spark Connect Hyukjin Kwon
Does Spark support role-based authentication and access to Amazon S3? (Kubernetes cluster deployment) Patil, Atul
- Does Spark support role-based authentication and access to Amazon S3? (Kubernetes cluster deployment) Atul Patil
- Re: Does Spark support role-based authentication and access to Amazon S3? (Kubernetes cluster deployment) Koert Kuipers
- Re: Does Spark support role-based authentication and access to Amazon S3? (Kubernetes cluster deployment) Mich Talebzadeh
- Re: Does Spark support role-based authentication and access to Amazon S3? (Kubernetes cluster deployment) Mich Talebzadeh
Cluster-mode job compute-time/cost metrics Jack Wells
- Re: Cluster-mode job compute-time/cost metrics Jörn Franke
- Re: Cluster-mode job compute-time/cost metrics murat migdisoglu
Spark 3.1.3 with Hive dynamic partitions fails while driver moves the staged files Shay Elbaz
Spark on Java 17 Faiz Halde
- RE: Spark on Java 17 Luca Canali
- Re: Spark on Java 17 Faiz Halde
- Re: Spark on Java 17 Jörn Franke
- Re: Spark on Java 17 Jörn Franke
SSH Tunneling issue with Apache Spark Venkatesan Muniappan
- Re: SSH Tunneling issue with Apache Spark Venkatesan Muniappan
- Re: SSH Tunneling issue with Apache Spark Nicholas Chammas
- Re: SSH Tunneling issue with Apache Spark Venkatesan Muniappan
ordering of rows in dataframe Som Lima
- Re: ordering of rows in dataframe Enrico Minack
ML advice Zahid Rahman
Do we have any mechanism to control requests per second for a Kafka connect sink? Yeikel Santana
- Re: Do we have any mechanism to control requests per second for a Kafka connect sink? Yeikel Santana
Spark-Connect: Param `--packages` does not take effect for executors. Xiaolong Wang
- Re: Spark-Connect: Param `--packages` does not take effect for executors. Aironman DirtDiver
- Re: Spark-Connect: Param `--packages` does not take effect for executors. Holden Karau
[PySpark][Spark Dataframe][Observation] Why empty dataframe join doesn't let you get metrics from observation? Михаил Кулаков
- Re: [PySpark][Spark Dataframe][Observation] Why empty dataframe join doesn't let you get metrics from observation? Enrico Minack
- Re: [PySpark][Spark Dataframe][Observation] Why empty dataframe join doesn't let you get metrics from observation? Enrico Minack
- Re: [PySpark][Spark Dataframe][Observation] Why empty dataframe join doesn't let you get metrics from observation? Михаил Кулаков
ML using Spark Connect Faiz Halde
[FYI] SPARK-45981: Improve Python language test coverage Dongjoon Hyun
- Re: [FYI] SPARK-45981: Improve Python language test coverage Hyukjin Kwon
[Streaming (DStream) ] : Does Spark Streaming supports pause/resume consumption of message from Kafka? Saurabh Agrawal (180813)
- Re: [Streaming (DStream) ] : Does Spark Streaming supports pause/resume consumption of message from Kafka? Mich Talebzadeh
[ANNOUNCE] Apache Spark 3.4.2 released Dongjoon Hyun
- Re:[ANNOUNCE] Apache Spark 3.4.2 released beliefer
[sql] how to connect query stage to Spark job/stages? Chenghao Lyu
Tuning Best Practices Bryant Wright
- Re: Tuning Best Practices Jack Goodson
- Re: Tuning Best Practices Bryant Wright
Classpath isolation per SparkSession without Spark Connect Faiz Halde
- Re: Classpath isolation per SparkSession without Spark Connect Holden Karau
- Re: Classpath isolation per SparkSession without Spark Connect Faiz Halde
- Re: Classpath isolation per SparkSession without Spark Connect Pasha Finkelshtein
- Re: Classpath isolation per SparkSession without Spark Connect Faiz Halde
- Re: Classpath isolation per SparkSession without Spark Connect Pasha Finkelshtein
Re: Spark structured streaming tab is missing from spark web UI Jungtaek Lim
[Spark-sql 3.2.4] Wrong Statistic INFO From 'ANALYZE TABLE' Command Nick Luo
Query fails on CASE statement depending on order of summed columns Evgenii Ignatev
How exactly does dropDuplicatesWithinWatermark work? Perfect Stranger
- Re: How exactly does dropDuplicatesWithinWatermark work? Jungtaek Lim
Setting fs.s3a.aws.credentials.provider through a connect server. Leandro Martelli
Spark-submit without access to HDFS Eugene Miretsky
- Re: Spark-submit without access to HDFS [email protected]
- Re: [EXTERNAL] Re: Spark-submit without access to HDFS Eugene Miretsky
- Re: Re: [EXTERNAL] Re: Spark-submit without access to HDFS [email protected]
- Re: Spark-submit without access to HDFS Jörn Franke
- Re: Spark-submit without access to HDFS Mich Talebzadeh
- Re: [EXTERNAL] Re: Spark-submit without access to HDFS Eugene Miretsky
- Re: [EXTERNAL] Re: Spark-submit without access to HDFS Eugene Miretsky
- Re: [EXTERNAL] Re: Spark-submit without access to HDFS Mich Talebzadeh
- Re: [EXTERNAL] Re: [EXTERNAL] Re: Spark-submit without access to HDFS Eugene Miretsky
[Spark Structured Streaming] Two sink from Single stream Subash Prabanantham
The job failed when we upgraded from spark 3.3.1 to spark3.4.1 Hanyu Huang
- The job failed when we upgraded from spark 3.3.1 to spark3.4.1 Hanyu Huang
- RE: The job failed when we upgraded from spark 3.3.1 to spark3.4.1 Stevens, Clay
- The job failed when we upgraded from spark 3.3.1 to spark3.4.1 Hanyu Huang
Why create/drop/alter/rename partition does not post listener event in ExternalCatalogWithListener? 李响
Pass xmx values to SparkLauncher launched Java process Deepthi Sathia Raj
How grouping rows without shuffle Yoel Benharrous
help needed with SPARK-45598 and SPARK-45769 Maksym M
Storage Partition Joins only works for buckets? Arwin Tio
org.apache.ranger.authorization.hive.authorizer.RangerHiveAuthorizerFactory ClassNotFoundException Yi Zheng
[ANNOUNCE] Apache Kyuubi released 1.8.0 Cheng Pan
Spark master shuts down when one of zookeeper dies Kaustubh Ghode
- Re: Spark master shuts down when one of zookeeper dies Mich Talebzadeh
How to configure authentication from a pySpark client to a Spark Connect server ? Xiaolong Wang
[Spark SQL] [Bug] Adding `checkpoint()` causes "column [...] cannot be resolved" error Robin Zimmerman
Parser error when running PySpark on Windows connecting to GCS Richard Smith
- Re: Parser error when running PySpark on Windows connecting to GCS Mich Talebzadeh
Data analysis issues Jauru Lin
- Re: Data analysis issues Mich Talebzadeh
Spark / Scala conflict Harry Jamison
- Re: Spark / Scala conflict Aironman DirtDiver
- Re: Spark / Scala conflict Harry Jamison
Fixed byte array issue KhajaAsmath Mohammed
jackson-databind version mismatch moshik.vitas
- Re: jackson-databind version mismatch [email protected]
- Re: jackson-databind version mismatch Bjørn Jørgensen
- Re: jackson-databind version mismatch Bjørn Jørgensen
- Re: Re: jackson-databind version mismatch [email protected]
- RE: jackson-databind version mismatch moshik.vitas
Elasticity and scalability for Spark in Kubernetes Mich Talebzadeh
[Structured Streaming] Joins after aggregation don't work in streaming Andrzej Zera
- Re: [Structured Streaming] Joins after aggregation don't work in streaming Jungtaek Lim
- Re: [Structured Streaming] Joins after aggregation don't work in streaming Andrzej Zera
spark schema conflict behavior records being silently dropped Carlos Aguni
submitting tasks failed in Spark standalone mode due to missing failureaccess jar file [email protected]
Contribution Recommendations Phil Dakin
Maximum executors in EC2 Machine KhajaAsmath Mohammed
- Re: Maximum executors in EC2 Machine Riccardo Ferrari
automatically/dinamically renew aws temporary token Carlos Aguni
- Re: automatically/dinamically renew aws temporary token Jörn Franke
- Re: automatically/dinamically renew aws temporary token Pol Santamaria
- Re: automatically/dinamically renew aws temporary token Carlos Aguni
Spark join produce duplicate rows in resultset Meena Rajani
- Re: Spark join produce duplicate rows in resultset Patrick Tucci
- Re: Spark join produce duplicate rows in resultset Sadha Chilukoori
- Re: Spark join produce duplicate rows in resultset Bjørn Jørgensen
- Re: Spark join produce duplicate rows in resultset Meena Rajani
Error when trying to get the data from Hive Materialized View Siva Sankar Reddy
spark.stop() cannot stop spark connect session [email protected]
- [Resolved] Re: spark.stop() cannot stop spark connect session [email protected]
"Premature end of Content-Length" Error Sandhya Bala
hive: spark as execution engine. class not found problem Amirhossein Kabiri
- Re: hive: spark as execution engine. class not found problem Vijay Shankar
[ANNOUNCE] Apache Celeborn(incubating) 0.3.1 available Cheng Pan
[ SPARK SQL ]: PPER in WHERE condition is not working in Apache Spark 3.5.0 for Mysql ENUM Column Suyash Ajmera
- Re: [ SPARK SQL ]: UPPER in WHERE condition is not working in Apache Spark 3.5.0 for Mysql ENUM Column Suyash Ajmera
- Re: [ SPARK SQL ]: UPPER in WHERE condition is not working in Apache Spark 3.5.0 for Mysql ENUM Column Suyash Ajmera
Can not complete the read csv task Kelum Perera
- Fw: Can not complete the read csv task Kelum Perera
- Fwd: Fw: Can not complete the read csv task KP Youtuber
- Re: Can not complete the read csv task Khalid Mammadov
Autoscaling in Spark Kiran Biswal
- Re: Autoscaling in Spark Mich Talebzadeh
Log file location in Spark on K8s Agrawal, Sanket
- Re: Log file location in Spark on K8s Prashant Sharma
Clarification with Spark Structured Streaming [email protected]
- Re: Clarification with Spark Structured Streaming Mich Talebzadeh
- Re: Clarification with Spark Structured Streaming [email protected]
- Re: Clarification with Spark Structured Streaming Mich Talebzadeh
- Re: Clarification with Spark Structured Streaming Danilo Sousa
Spark Compatibility with Spring Boot 3.x Ahmed Albalawi
- Re: Spark Compatibility with Spring Boot 3.x Sean Owen
- Re: Spark Compatibility with Spring Boot 3.x Angshuman Bhattacharya
- RE: Re: Spark Compatibility with Spring Boot 3.x Guru Panda
Connection pool shut down in Spark Iceberg Streaming Connector Agrawal, Sanket
- Re: Connection pool shut down in Spark Iceberg Streaming Connector Prashant Sharma
- Re: Connection pool shut down in Spark Iceberg Streaming Connector Igor Calabria
[PySpark Structured Streaming] How to tune .repartition(N) ? Shao Yang Hong
- Re: [PySpark Structured Streaming] How to tune .repartition(N) ? Raghavendra Ganesh
- Re: [PySpark Structured Streaming] How to tune .repartition(N) ? Shao Yang Hong
- Re: [PySpark Structured Streaming] How to tune .repartition(N) ? Perez
- [PySpark Structured Streaming] How to tune .repartition(N) ? Shao Yang Hong
- Re: [PySpark Structured Streaming] How to tune .repartition(N) ? Mich Talebzadeh
[Spark Core]: Recomputation cost of a job due to executor failures Faiz Halde
Updating delta file column data Karthick Nk
- Re: Updating delta file column data Karthick Nk
- Re: Updating delta file column data Mich Talebzadeh
- Re: Updating delta file column data Mich Talebzadeh
using facebook Prophet + pyspark for forecasting - Dataframe has less than 2 non-NaN rows karan alang
Seeking Guidance on Spark on Kubernetes Secrets Configuration Jon Rodríguez Aranguren
- Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration Jörn Franke
- Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration Jayabindu Singh
- Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration Mich Talebzadeh
- Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration Jörn Franke
- Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration Jon Rodríguez Aranguren
- Re: Seeking Guidance on Spark on Kubernetes Secrets Configuration Jörn Franke
Thread dump only shows 10 shuffle clients Nebi Aydin
Files io threads vs shuffle io threads Nebi Aydin
Inquiry about Processing Speed Haseeb Khalid
- Re: Inquiry about Processing Speed Deepak Goel
- Re: Inquiry about Processing Speed Jack Goodson
Reading Glue Catalog Views through Spark. Agrawal, Sanket
[PySpark][Spark logs] Is it possible to dynamically customize Spark logs? Ayman Rekik