Hi Mohammed Internally In hive the processing is done using MapReduce. So like in mapreduce the splits are calculated on job submission and a mapper is assigned per split. So a mapper ideally process a split and not a row.
You can store data in various formats as text, sequence files, RC files etc. No restriction just on text files. Regards Bejoy KS Sent from handheld, please excuse typos. -----Original Message----- From: Mohammad Tariq <[email protected]> Date: Fri, 29 Jun 2012 00:17:05 To: user<[email protected]> Reply-To: [email protected] Subject: Hive mapper creation Hello list, Since Hive tables are assumed to be of text input format, is it right to assume that a mapper is created per row of a particular table??Please correct me if my understanding is wrong. Also let me know how mappers are created corresponding to a Hive query. Many thanks. Regards, Mohammad Tariq
