[Spark SQL]:to calculate distance between four coordinates(Latitude1, Longtitude1, Latitude2, Longtitude2) in the pysaprk dataframe

2021-04-09 Thread Rao Bandaru
Hi All, I have a requirement to calculate distance between four coordinates(Latitude1, Longtitude1, Latitude2, Longtitude2) in the pysaprk dataframe with the help of from geopy import distance without using UDF (user defined function),Please help how to achieve this scenario and do the needful.

Re: [Spark SQL]:to calculate distance between four coordinates(Latitude1, Longtitude1, Latitude2, Longtitude2) in the pysaprk dataframe

2021-04-09 Thread Rao Bandaru
, April 9, 2021 6:11 PM To: ayan guha Cc: Rao Bandaru ; User Subject: Re: [Spark SQL]:to calculate distance between four coordinates(Latitude1, Longtitude1, Latitude2, Longtitude2) in the pysaprk dataframe This can be significantly faster with a pandas UDF, note, because you can vectorize the

Load Share point list(file) data to impala table using pyspark

2021-05-28 Thread Rao Bandaru
Hi Team, I have a requirement to load Share point list(file) data to impala table using pyspark Please help me how to achieve this type of requirement Thanks, Ankamma Rao

Re: Load Share point list(file) data to impala table using pyspark

2021-05-28 Thread Rao Bandaru
Hi Team, Could you please provide any update on below requirement Thanks, Ankamma Rao From: Rao Bandaru Sent: Friday, May 28, 2021 1:34 PM To: user@spark.apache.org Subject: Load Share point list(file) data to impala table using pyspark Hi Team, I have a