Re: Hadoop Mapper Intermediate Result Storage with No Reducer

2012-04-25 Thread Harsh J
A map-only job does not write to local disk, but rather directly to the configured FileSystem the job's running on (HDFS usually). It has always been so as far as I know, as map-only jobs do not do any form of sorting/partitioning and hence do not require transient storage. On Thu, Apr 26, 2012 a

Hadoop Mapper Intermediate Result Storage with No Reducer

2012-04-25 Thread Xun TANG
Hi, We are running Hadoop jobs with mappers only, no reducers. After each map computation, it writes its result (i.e. intermediate results) to local disk. How fast is this speed, compared with write-speed to HDFS? We used TestFDSIO to do the benchmark to get write-speed to HDFS, but we wonder how