date:20220627

[FINAL CALL] - Travel Assistance to ApacheCon New Orleans 2022

2022-06-27 Thread Gavin McDonald

To all committers and non-committers. This is a final call to apply for travel/hotel assistance to get to and stay in New Orleans for ApacheCon 2022. Applications have been extended by one week and so the application deadline is now the 8th July 2022. The rest of this email is a copy of what ha

Understanding about joins in spark

2022-06-27 Thread Sid

Hi Team, As per my understanding, assume it to be a large dataset. When we apply joins, data from different executors are shuffled in such a way that the same "keys" are landed in one partition. So, this is done for both the dataframes, right? For eg: Key A for df1 will be sorted and kept in one