Hello Nitin, Bejoy,
Thanks a lot for the quick response. Could you please tell me
what is the default criterion of split creation??How the splits for a
Hive query are created??(Pardon my ignorance).
Regards,
Mohammad Tariq
On Fri, Jun 29, 2012 at 12:22 AM, Bejoy KS <[email protected]> wrote:
> Hi Mohammed
>
> Internally In hive the processing is done using MapReduce. So like in
> mapreduce the splits are calculated on job submission and a mapper is
> assigned per split. So a mapper ideally process a split and not a row.
>
> You can store data in various formats as text, sequence files, RC files etc.
> No restriction just on text files.
>
>
> Regards
> Bejoy KS
>
> Sent from handheld, please excuse typos.
>
> -----Original Message-----
> From: Mohammad Tariq <[email protected]>
> Date: Fri, 29 Jun 2012 00:17:05
> To: user<[email protected]>
> Reply-To: [email protected]
> Subject: Hive mapper creation
>
> Hello list,
>
> Since Hive tables are assumed to be of text input format, is
> it right to assume that a mapper is created per row of a particular
> table??Please correct me if my understanding is wrong. Also let me
> know how mappers are created corresponding to a Hive query. Many
> thanks.
>
> Regards,
> Mohammad Tariq