Re: Reading file with Unicode characters

2015-04-08 Thread Arun Lists
you may want to vote for this Jira: > https://issues.apache.org/jira/browse/MAPREDUCE-232 > > Yong > > -- > Date: Wed, 8 Apr 2015 10:35:18 -0700 > Subject: Reading file with Unicode characters > From: lists.a...@gmail.com > To: user@spark.apac

RE: Reading file with Unicode characters

2015-04-08 Thread java8964
Yong Date: Wed, 8 Apr 2015 10:35:18 -0700 Subject: Reading file with Unicode characters From: lists.a...@gmail.com To: user@spark.apache.org CC: lists.a...@gmail.com Hi, Does SparkContext's textFile() method handle files with Unicode characters? How about files in UTF-8 format? Going further,

Reading file with Unicode characters

2015-04-08 Thread Arun Lists
Hi, Does SparkContext's textFile() method handle files with Unicode characters? How about files in UTF-8 format? Going further, is it possible to specify encodings to the method? If not, what should one do if the files to be read are in some encoding? Thanks, arun