Hi all!

I have a MR job use to import contents into HBase.

The content is text file in HDFS. I used the maps file to store local
path of contents.

Each content has the map file. ( the map is a text file in HDFS and
contain 1 line info).


I created the maps directory used to contain map files. And the  this
maps directory used to input path for job.

When i run job, the number map task is same number map files.
Ex: I have 5 maps file -> 5 map tasks.

Therefor, the map phase is slowly :(

Why the map phase is slowly if the number map task large and the number
map task is equal number of files?.

* p/s: Run jobs with: 3 node: 1 server and 2 slaver

Please help me!
Thanks.

Best,
Nguyen.


Reply via email to