Hi Gene, Could you give some suggestions on that?
> On Jul 1, 2016, at 5:31 PM, Ted Yu <yuzhih...@gmail.com> wrote: > > The comment from zhangxiongfei was from a year ago. > > Maybe something changed since them ? > > On Fri, Jul 1, 2016 at 12:07 AM, Chanh Le <giaosu...@gmail.com > <mailto:giaosu...@gmail.com>> wrote: > Hi Ted, > I set sc.hadoopConfiguration.setBoolean("fs.hdfs.impl.disable.cache", true) > sc.hadoopConfiguration.setLong("fs.local.block.size", 268435456) > but It seems not working. > > <Screen_Shot_2016-07-01_at_2_06_27_PM.png> > > >> On Jul 1, 2016, at 11:38 AM, Ted Yu <yuzhih...@gmail.com >> <mailto:yuzhih...@gmail.com>> wrote: >> >> Looking under Alluxio source, it seems only "fs.hdfs.impl.disable.cache" is >> in use. >> >> FYI >> >> On Thu, Jun 30, 2016 at 9:30 PM, Deepak Sharma <deepakmc...@gmail.com >> <mailto:deepakmc...@gmail.com>> wrote: >> Ok. >> I came across this issue. >> Not sure if you already assessed this: >> https://issues.apache.org/jira/plugins/servlet/mobile#issue/SPARK-6921 >> <https://issues.apache.org/jira/plugins/servlet/mobile#issue/SPARK-6921> >> The workaround mentioned may work for you . >> >> Thanks >> Deepak >> >> On 1 Jul 2016 9:34 am, "Chanh Le" <giaosu...@gmail.com >> <mailto:giaosu...@gmail.com>> wrote: >> Hi Deepark, >> Thank for replying. The way to write into alluxio is >> df.write.mode(SaveMode.Append).partitionBy("network_id", >> "time").parquet("alluxio://master1:19999/FACT_ADMIN_HOURLY <>”) >> >> >> I partition by 2 columns and store. I just want when I write it automatic >> write a size properly for what I already set in Alluxio 512MB per block. >> >> >>> On Jul 1, 2016, at 11:01 AM, Deepak Sharma <deepakmc...@gmail.com >>> <mailto:deepakmc...@gmail.com>> wrote: >>> >>> Before writing coalesing your rdd to 1 . >>> It will create only 1 output file . >>> Multiple part file happens as all your executors will be writing their >>> partitions to separate part files. >>> >>> Thanks >>> Deepak >>> >>> On 1 Jul 2016 8:01 am, "Chanh Le" <giaosu...@gmail.com >>> <mailto:giaosu...@gmail.com>> wrote: >>> Hi everyone, >>> I am using Alluxio for storage. But I am little bit confuse why I am do set >>> block size of alluxio is 512MB and my file part only few KB and too many >>> part. >>> Is that normal? Because I want to read it fast? Is that many part effect >>> the read operation? >>> How to set the size of file part? >>> >>> Thanks. >>> Chanh >>> >>> >>> >>> >>> >>> <Screen_Shot_2016-07-01_at_9_24_55_AM.png> >> >> > >