Hi ,
I am trying to run an ETL on spark which involves expensive shuffle
operation. Basically I require a self-join to be performed on a
sparkDataFrame RDD . The job runs fine for around 15 hours and when the
stage(which performs the sef-join) is about to complete, I get a
*"java.io.IOException:
Hi,
I was testing spark to read data from hive using HiveContext. I got the
following error, when I used a simple query with constants in predicates.
I am using spark 1.3*. *Anyone encountered error like this ??
*Error:*
Exception in thread "main" org.apache.spark.sql.AnalysisException: