Hi Mohammed

Internally In hive the processing is done using MapReduce. So like in mapreduce 
the splits are calculated on job submission and a mapper is assigned per split. 
So a mapper ideally process a split and not a row.

You can store data in various formats as text, sequence files, RC files etc. No 
restriction just on text files.


Regards
Bejoy KS

Sent from handheld, please excuse typos.

-----Original Message-----
From: Mohammad Tariq <[email protected]>
Date: Fri, 29 Jun 2012 00:17:05 
To: user<[email protected]>
Reply-To: [email protected]
Subject: Hive mapper creation

Hello list,

         Since Hive tables are assumed to be of text input format, is
it right to assume that a mapper is created per row of a particular
table??Please correct me if my understanding is wrong. Also let me
know how mappers are created corresponding to a Hive query. Many
thanks.

Regards,
    Mohammad Tariq

Reply via email to