Hi Mohammed Internally In hive the processing is done using MapReduce. So like in mapreduce the splits are calculated on job submission and a mapper is assigned per split. So a mapper ideally process a split and not a row.
You can store data in various formats as text, sequence files, RC files etc. No restriction just on text files. Regards Bejoy KS Sent from handheld, please excuse typos. -----Original Message----- From: Mohammad Tariq <donta...@gmail.com> Date: Fri, 29 Jun 2012 00:17:05 To: user<user@hive.apache.org> Reply-To: user@hive.apache.org Subject: Hive mapper creation Hello list, Since Hive tables are assumed to be of text input format, is it right to assume that a mapper is created per row of a particular table??Please correct me if my understanding is wrong. Also let me know how mappers are created corresponding to a Hive query. Many thanks. Regards, Mohammad Tariq