spark-submit fails in kubernetes 1.24.x cluster

2022-12-23 Thread Thimme Gowda TP (Nokia)
Hello, We are facing issue with ingress during spark-submit with kubernetes cluster 1.24.x . We are using spark 3.3.0 to do spark-submit. # kubectl version WARNING: This version information is deprecated and will be replaced with the output from kubectl version --short. Use --output=yaml|json

Re: Check if shuffle is caused for repartitioned pyspark dataframes

2022-12-23 Thread Shivam Verma
Hi Gurunandan, Thanks for the reply! I do see the exchange operator in the SQL tab, but I can see it in both the experiments: 1. Using repartitioned dataframes 2. Using initial dataframes Does that mean that the repartitioned dataframes are not actually "co-partitioned"? If that's the case, I ha

Re: Check if shuffle is caused for repartitioned pyspark dataframes

2022-12-23 Thread Russell Jurney
This may not be good advice but... could you sort by the partition key to ensure the partitions match up? Thinking of olden times :) On Fri, Dec 23, 2022 at 4:42 AM Shivam Verma wrote: > Hi Gurunandan, > > Thanks for the reply! > > I do see the exchange operator in the SQL tab, but I can see it