Any idea on this ? How to convert parquet file into JavaRDD<String> ?
*Thanks*, <https://in.linkedin.com/in/ramkumarcs31> On Thu, Mar 31, 2016 at 4:30 PM, Ramkumar V <ramkumar.c...@gmail.com> wrote: > Hi, > > Thanks for the reply. I tried this. It's returning JavaRDD<row> instead > of JavaRDD<String>. How to get JavaRDD<String> ? > > Error : > incompatible types: > org.apache.spark.api.java.JavaRDD<org.apache.spark.sql.Row> cannot be > converted to org.apache.spark.api.java.JavaRDD<java.lang.String> > > > > > > *Thanks*, > <https://in.linkedin.com/in/ramkumarcs31> > > > On Thu, Mar 31, 2016 at 2:57 PM, UMESH CHAUDHARY <umesh9...@gmail.com> > wrote: > >> From Spark Documentation: >> >> DataFrame parquetFile = sqlContext.read().parquet("people.parquet"); >> >> JavaRDD<String> jRDD= parquetFile.javaRDD() >> >> javaRDD() method will convert the DF to RDD >> >> On Thu, Mar 31, 2016 at 2:51 PM, Ramkumar V <ramkumar.c...@gmail.com> >> wrote: >> >>> Hi, >>> >>> I'm trying to read parquet log files in Java Spark. Parquet log files >>> are stored in hdfs. I want to read and convert that parquet file into >>> JavaRDD. I could able to find Sqlcontext dataframe api. How can I read if >>> it is sparkcontext and rdd ? what is the best way to read it ? >>> >>> *Thanks*, >>> <https://in.linkedin.com/in/ramkumarcs31> >>> >>> >> >