Edward J. Yoon wrote:

> Hi,
>
> In that case, The atomic unit of split is a file. So, you need to
> increase the number of files. or Use the TextInputFormat as below.
>
> jobConf.setInputFormat(TextInputFormat.class);
>
> On Wed, Apr 22, 2009 at 4:35 PM, nguyenhuynh.mr
> <[email protected]> wrote:
>   
>> Hi all!
>>
>>
>> I have a MR job use to import contents into HBase.
>>
>> The content is text file in HDFS. I used the maps file to store local
>> path of contents.
>>
>> Each content has the map file. ( the map is a text file in HDFS and
>> contain 1 line info).
>>
>>
>> I created the maps directory used to contain map files. And the  this
>> maps directory used to input path for job.
>>
>> When i run job, the number map task is same number map files.
>> Ex: I have 5 maps file -> 5 map tasks.
>>
>> Therefor, the map phase is slowly :(
>>
>> Why the map phase is slowly if the number map task large and the number
>> map task is equal number of files?.
>>
>> * p/s: Run jobs with: 3 node: 1 server and 2 slaver
>>
>> Please help me!
>> Thanks.
>>
>> Best,
>> Nguyen.
>>
>>
>>
>>     
>
>
>
>   
Current, I use TextInputformat to set InputFormat for map phase.

Reply via email to