Hi all, Our data has lots of human readable column names (names that include spaces), is it possible to use these with Parquet and Dataframes?
When I try and write the Dataframe I get the following error: (I am using PySpark) `AnalysisException: Attribute name "Name with Space" contains invalid character(s) among " ,;{}()\n\t=". Please use alias to rename it.` How can I alias that column name? `df['Name with Space'] = df['Name with Space'].alias('Name')` doesn't work as you can't assign to a dataframe column. `df.withColumnRenamed('Name with Space', 'Name')` overwrites the column and doesn't alias it. Any ideas? Thanks -- View this message in context: http://apache-spark-user-list.1001560.n3.nabble.com/Parquet-Dataframes-Column-names-with-spaces-tp24088.html Sent from the Apache Spark User List mailing list archive at Nabble.com. --------------------------------------------------------------------- To unsubscribe, e-mail: user-unsubscr...@spark.apache.org For additional commands, e-mail: user-h...@spark.apache.org