Re: Hive mapper creation

2012-06-28 Thread Mohammad Tariq
map tasks, you > just need to play with min and max split size properties. > > Regards > Bejoy KS > > Sent from handheld, please excuse typos. > > -Original Message- > From: Mohammad Tariq > Date: Fri, 29 Jun 2012 00:55:54 > To: ; > Reply-To: user@h

Re: Hive mapper creation

2012-06-28 Thread Bejoy KS
: ; Reply-To: user@hive.apache.org Subject: Re: Hive mapper creation Thanks a lot for the valuable response Bejoy. Actually I wanted to know if it is possible to set the size of filesplits or the criterion on which filesplits are created (in turn controlling the creation of mappers) for a Hive query

Re: Hive mapper creation

2012-06-28 Thread Mohammad Tariq
. Feel free to revert if you still have any queries. > > > Regards > Bejoy KS > > Sent from handheld, please excuse typos. > > -Original Message- > From: Mohammad Tariq > Date: Fri, 29 Jun 2012 00:29:13 > To: ; > Reply-To: user@hive.apache.org > Subje

Re: Hive mapper creation

2012-06-28 Thread Bejoy KS
if you still have any queries. Regards Bejoy KS Sent from handheld, please excuse typos. -Original Message- From: Mohammad Tariq Date: Fri, 29 Jun 2012 00:29:13 To: ; Reply-To: user@hive.apache.org Subject: Re: Hive mapper creation Hello Nitin, Bejoy, Thanks a lot for the

Re: Hive mapper creation

2012-06-28 Thread Mohammad Tariq
Hello Nitin, Bejoy, Thanks a lot for the quick response. Could you please tell me what is the default criterion of split creation??How the splits for a Hive query are created??(Pardon my ignorance). Regards,     Mohammad Tariq On Fri, Jun 29, 2012 at 12:22 AM, Bejoy KS wrote: > Hi Moha

Re: Hive mapper creation

2012-06-28 Thread Bejoy KS
Hi Mohammed Internally In hive the processing is done using MapReduce. So like in mapreduce the splits are calculated on job submission and a mapper is assigned per split. So a mapper ideally process a split and not a row. You can store data in various formats as text, sequence files, RC files

Re: Hive mapper creation

2012-06-28 Thread Nitin Pawar
mappers are not created per row but instead they are based on your query and hive configuration if u set max input size etc you can actually set max number of mappers you want to set as well to limit how many maps are launched On Fri, Jun 29, 2012 at 12:17 AM, Mohammad Tariq wrote: > Hello list,