Hi,
 
I am trying to ad a column to a data frame that I created based on a JSON file 
like this:
 
val input =
hiveCtx.jsonFile("wasb://n...@cmwhdinsightdatastore.blob.core.windows.net/json/*").toDF().persist(StorageLevel.MEMORY_AND_DISK)
 
I have a function that is generating the values for the new column:
 
def determineDayPartID(evntStDate: String, evntStHour: String) : Int  = {


    val stFormat = new
java.text.SimpleDateFormat("yyMMdd")


    var stDateStr:String = evntStDate.substring(2,8)


    val stDate:Date = stFormat.parse(stDateStr)


    val stHour = evntStHour.substring(1,3).toDouble + 0.1


    var bucket = Math.ceil(stHour/3.0).toInt


    val cal:Calendar = Calendar.getInstance


    cal.setTime(stDate)


    var dayOfWeek = cal.get(Calendar.DAY_OF_WEEK)


    if (dayOfWeek == 1) dayOfWeek = 8


    if (dayOfWeek > 6) bucket = bucket + 8


    return bucket


  }


When I try:
 
input.withColumn("DayPartID", callUDF(determineDayPartID, IntegerType, 
col("StartDate"), col("EventStartHour")))
 
I am getting the error:
 
missing arguments for
method determineDayPartID in object rating; follow this method with `_' if you
want to treat it as a partially applied function

Can you please help?


Stefan Panayotov, PhD 
Home: 610-355-0919 
Cell: 610-517-5586 
email: spanayo...@msn.com 
spanayo...@outlook.com 
spanayo...@comcast.net
                                          

Reply via email to