Re: How to load a big csv to dataframe in Spark 1.6

2017-01-03 Thread Steve Loughran
On 31 Dec 2016, at 16:09, Raymond Xie mailto:xie3208...@gmail.com>> wrote: Hello Felix, I followed the instruction and ran the command: > $SPARK_HOME/bin/spark-shell --packages com.databricks:spark-csv_2.11:1.5.0 and I received the following error message: java.lang.RuntimeException: java.net

Re: How to load a big csv to dataframe in Spark 1.6

2016-12-31 Thread Felix Cheung
___ From: Raymond Xie mailto:xie3208...@gmail.com>> Sent: Friday, December 30, 2016 6:46:11 PM To: user@spark.apache.org<mailto:user@spark.apache.org> Subject: How to load a big csv to dataframe in Spark 1.6 Hello, I see there is usually this way to load a c

Re: How to load a big csv to dataframe in Spark 1.6

2016-12-31 Thread Raymond Xie
ote: > Have you tried the spark-csv package? > > https://spark-packages.org/package/databricks/spark-csv > > > -- > *From:* Raymond Xie > *Sent:* Friday, December 30, 2016 6:46:11 PM > *To:* user@spark.apache.org > *Subject:* How to load a

Re: How to load a big csv to dataframe in Spark 1.6

2016-12-30 Thread Raymond Xie
-- > *From:* Raymond Xie > *Sent:* Friday, December 30, 2016 6:46:11 PM > *To:* user@spark.apache.org > *Subject:* How to load a big csv to dataframe in Spark 1.6 > > Hello, > > I see there is usually this way to load a csv to dataframe: > > sqlContext = SQLCo

Re: How to load a big csv to dataframe in Spark 1.6

2016-12-30 Thread Raymond Xie
- Original message From: Raymond Xie Date: 31/12/2016 10:46 (GMT+08:00) To: user@spark.apache.org Subject: How to load a big csv to dataframe in Spark 1.6 Hello, I see there is usually this way to load a csv to dataframe: sqlContext = SQLContext(sc) Employee_rdd = sc.textFile("\

Re: How to load a big csv to dataframe in Spark 1.6

2016-12-30 Thread theodondre
Re: How to load a big csv to dataframe in Spark 1.6 Hi Raymond, Your problem is to pass those 100 fields to .toDF() method?? Sent from my Samsung device Original message From: Raymond Xie Date: 31/12/2016 10:46 (GMT+08:00) To: user@spark.apache.org Subject: How to

Re: How to load a big csv to dataframe in Spark 1.6

2016-12-30 Thread Felix Cheung
Have you tried the spark-csv package? https://spark-packages.org/package/databricks/spark-csv From: Raymond Xie Sent: Friday, December 30, 2016 6:46:11 PM To: user@spark.apache.org Subject: How to load a big csv to dataframe in Spark 1.6 Hello, I see there is

Re: How to load a big csv to dataframe in Spark 1.6

2016-12-30 Thread write2sivakumar@gmail
Hi Raymond, Your problem is to pass those 100 fields to .toDF() method?? Sent from my Samsung device Original message From: Raymond Xie Date: 31/12/2016 10:46 (GMT+08:00) To: user@spark.apache.org Subject: How to load a big csv to dataframe in Spark 1.6 Hello, I

How to load a big csv to dataframe in Spark 1.6

2016-12-30 Thread Raymond Xie
Hello, I see there is usually this way to load a csv to dataframe: sqlContext = SQLContext(sc) Employee_rdd = sc.textFile("\..\Employee.csv") .map(lambda line: line.split(",")) Employee_df = Employee_rdd.toDF(['Employee_ID','Employee_name']) Employee_df.show() However in my cas