Hi,

In that case, The atomic unit of split is a file. So, you need to
increase the number of files. or Use the TextInputFormat as below.

jobConf.setInputFormat(TextInputFormat.class);

On Wed, Apr 22, 2009 at 4:35 PM, nguyenhuynh.mr
<[email protected]> wrote:
> Hi all!
>
>
> I have a MR job use to import contents into HBase.
>
> The content is text file in HDFS. I used the maps file to store local
> path of contents.
>
> Each content has the map file. ( the map is a text file in HDFS and
> contain 1 line info).
>
>
> I created the maps directory used to contain map files. And the  this
> maps directory used to input path for job.
>
> When i run job, the number map task is same number map files.
> Ex: I have 5 maps file -> 5 map tasks.
>
> Therefor, the map phase is slowly :(
>
> Why the map phase is slowly if the number map task large and the number
> map task is equal number of files?.
>
> * p/s: Run jobs with: 3 node: 1 server and 2 slaver
>
> Please help me!
> Thanks.
>
> Best,
> Nguyen.
>
>
>



-- 
Best Regards, Edward J. Yoon
[email protected]
http://blog.udanax.org

Reply via email to