See https://issues.apache.org/jira/browse/SPARK-7301
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/Ambiguous-references-to-a-field-set-in-a-partitioned-table-AND-the-data-tp22325p25740.html
Sent from the Apache Spark User List mailing list archive at
d(InternalParquetRecordReader.java:116)
at
parquet.hadoop.InternalParquetRecordReader.nextKeyValue(InternalParquetRecordReader.java:193)
--
View this message in context:
http://apache-spark-user-list.1001560.n3.nabble.com/Ambiguous-references-to-a-field-set-in-a-partitioned-table-AND-the-data-tp22325.html
Sent from
Hi,
I save Parquet files in a partitioned table, so in /path/to/table/myfield=a/ .
But I also kept the field "myfield" in the Parquet data. Thus. when I query the
field, I get this error:
df.select("myfield").show(10)
"Exception in thread "main" org.apache.spark.sql.AnalysisException: Ambigu