map tasks, you
> just need to play with min and max split size properties.
>
> Regards
> Bejoy KS
>
> Sent from handheld, please excuse typos.
>
> -Original Message-
> From: Mohammad Tariq
> Date: Fri, 29 Jun 2012 00:55:54
> To: ;
> Reply-To: user@h
: ;
Reply-To: user@hive.apache.org
Subject: Re: Hive mapper creation
Thanks a lot for the valuable response Bejoy. Actually I wanted to
know if it is possible to set the size of filesplits or the criterion
on which filesplits are created (in turn controlling the creation of
mappers) for a Hive query
. Feel free to revert if you still have any queries.
>
>
> Regards
> Bejoy KS
>
> Sent from handheld, please excuse typos.
>
> -Original Message-
> From: Mohammad Tariq
> Date: Fri, 29 Jun 2012 00:29:13
> To: ;
> Reply-To: user@hive.apache.org
> Subje
if you still have any queries.
Regards
Bejoy KS
Sent from handheld, please excuse typos.
-Original Message-
From: Mohammad Tariq
Date: Fri, 29 Jun 2012 00:29:13
To: ;
Reply-To: user@hive.apache.org
Subject: Re: Hive mapper creation
Hello Nitin, Bejoy,
Thanks a lot for the
Hello Nitin, Bejoy,
Thanks a lot for the quick response. Could you please tell me
what is the default criterion of split creation??How the splits for a
Hive query are created??(Pardon my ignorance).
Regards,
Mohammad Tariq
On Fri, Jun 29, 2012 at 12:22 AM, Bejoy KS wrote:
> Hi Moha
Hi Mohammed
Internally In hive the processing is done using MapReduce. So like in mapreduce
the splits are calculated on job submission and a mapper is assigned per split.
So a mapper ideally process a split and not a row.
You can store data in various formats as text, sequence files, RC files
mappers are not created per row
but instead they are based on your query and hive configuration
if u set max input size etc
you can actually set max number of mappers you want to set as well to limit
how many maps are launched
On Fri, Jun 29, 2012 at 12:17 AM, Mohammad Tariq wrote:
> Hello list,