Re: API Problem

Enrico Minack Fri, 10 Jun 2022 04:39:24 -0700

Hi Sid,

||finalDF = finalDF.repartition(finalDF.rdd.getNumPartitions()).withColumn("status_for_batch", call_to_cust_bulk_api(policyUrl,to_json(struct(*colsListToBePassed)))) | |

You are calling ||withColumn|| with the result of||call_to_cust_bulk_api|| as the second argument. That result looks likeit is of type string. But ||withColumn|| expects type ||Column||. Youcan turn that string into a ||Column|| using ||lit||:

||finalDF = finalDF.repartition(finalDF.rdd.getNumPartitions()).withColumn("status_for_batch", lit(call_to_cust_bulk_api(policyUrl,to_json(struct(*colsListToBePassed))))) ||

You are saying that gives you an error of column not iterable. I reckonthe ||struct(*colsListToBePassed))|| is wrong.

Method ||struct|| requires a single string followed by a list ofstrings. Given your ||colsListToBePassed|| is a list of strings, thisdoes not work. Try:

|| struct(||||||colsListToBePassed.head,||colsListToBePassed.tail|||||: _*|))||


Alternatively, ||struct|| requires a list of ||Column||, so try this:

||  struct(||||||colsListToBePassed.map(col)|||||||: _*|))||

The API is pretty clear about the types it expects.


If you are still having errors, you better please paste the code and error.

Enrico



Am 09.06.22 um 21:31 schrieb Sid:

Hi Experts,
I am facing one problem while passing a column to the method. Theproblem is described in detail here:
https://stackoverflow.com/questions/72565095/how-to-pass-columns-as-a-json-record-to-the-api-method-using-pyspark

TIA,
Sid

Re: API Problem

Reply via email to