Hi Raihan

The propery 'mapred.jobtracker.maxtasks.per.job' is a JobTracker level one and 
not a task level one. Hence you cannot override it at task level. You need to 
make modifications in mapred_site.xml  also you may need to rebounce the JT as 
well for the new value to come into effect.
 
Regards,
Bejoy KS


________________________________
 From: Raihan Jamal <jamalrai...@gmail.com>
To: user@hive.apache.org 
Sent: Thursday, October 4, 2012 5:24 AM
Subject: Re: org.apache.hadoop.ipc.RemoteException(java.io.IOException: 
java.io.IOException
 

Just to add here
SojTimestampToDate will return data in this format only 2012/02/29 17:01:43





Raihan Jamal



On Wed, Oct 3, 2012 at 4:46 PM, Raihan Jamal <jamalrai...@gmail.com> wrote:

This is still not working as in the XML file the final property has been set as 
true so that means I cannot override it. 
>And this below simple query is also throwing same exception-
>
>
>SELECT event.app_payload ['n'] AS changed_cguid
>FROM soj_session_container a LATERAL VIEW explode(a.events) t AS event
>WHERE a.dt = '20120918'
>AND unix_timestamp(SojTimestampToDate(event.event_timestamp), 'yyyy/MM/dd 
>HH:mm:ss') >= unix_timestamp('2012/09/18 00:00:00', 'yyyy/MM/dd HH:mm:ss')
>AND unix_timestamp(SojTimestampToDate(event.event_timestamp), 'yyyy/MM/dd 
>HH:mm:ss') <= unix_timestamp('2012/09/18 02:00:00', 'yyyy/MM/dd HH:mm:ss')
>
>
>Exception I am getting:-
>
>
>Job Submission failed with exception 
>'org.apache.hadoop.ipc.RemoteException(java.io.IOException: 
>java.io.IOException: The number of tasks for this job 2070929 exceeds the 
>configured limit 200000
> 
>
>
>
>Any other suggestion what should I do to overcome this problem? May be any 
>changes in the query can overcome this problem?
>
>
>Raihan Jamal
>
>
>
>On Wed, Oct 3, 2012 at 2:59 PM, Chalcy Raja <chalcy.r...@careerbuilder.com> 
>wrote:
>
>Hi Raihan,
>> 
>>You can set it in hive prompt like below,
>>set mapred.jobtracker.maxtasks.per.job=7777777; 
>> 
>>To see if it is set just type in hive prompt, set;  and you’ll see this 
>>parameter in the output.
>> 
>>Hope this helps,
>>Chalcy
>> 
>>From:Raihan Jamal [mailto:jamalrai...@gmail.com] 
>>Sent: Wednesday, October 03, 2012 5:51 PM
>>To: user@hive.apache.org
>>Subject: Re: org.apache.hadoop.ipc.RemoteException(java.io.IOException: 
>>java.io.IOException
>> 
>>Ok. Found the issue I guess.
>> 
>>This is the below settings we have in the  mapred-site.xml for the 
>>site-specific configuration in Hadoop. And that is the reason exception is 
>>getting thrown.
>> 
>><property>
>>    <!-- 10,000 is 100 tasks per node on a 100-node cluster -->
>>    <name>mapred.jobtracker.maxtasks.per.job</name>
>>    <value>200000</value>
>>    <final>true</final>
>>  </property>
>> 
>>How can I override these changes manually from the Hive prompt? Any 
>>suggestions?
>>
>>
>>
>>Raihan Jamal
>>
>>
>>
>>On Wed, Oct 3, 2012 at 2:19 PM, Raihan Jamal <jamalrai...@gmail.com> wrote:
>>Can anyone help me out here? What does the below error means? And this is the 
>>query I am using-
>> 
>>SELECT cguid,
>>event_item,
>>event_timestamp,
>>event_site_id
>>FROM (
>>SELECT event.app_payload ['n'] AS cguid,
>>event.app_payload ['itm'] AS event_item,
>>max(event.event_timestamp) AS event_timestamp,
>>event.site_id AS event_site_id
>>FROM soj_session_container a LATERAL VIEW explode(a.events) t AS event
>>WHERE a.dt = '20120917'
>>AND event.app_payload ['n'] IS NOT NULL
>>AND instr(event.app_payload ['itm'], '%') = 0
>>AND event.app_payload ['itm'] IS NOT NULL
>>AND (
>>event.page_type_id = '4340'
>>OR event.page_type_id = '2047675'
>>)
>>GROUP BY event.app_payload ['n'],
>>event.site_id,
>>event.app_payload ['itm']
>>ORDER BY cguid,
>>event_timestamp DESC
>>) m
>>LEFT OUTER JOIN (
>>SELECT event.app_payload ['n'] AS changed_cguid
>>FROM soj_session_container a LATERAL VIEW explode(a.events) t AS event
>>WHERE a.dt = '20120918'
>>AND unix_timestamp(SojTimestampToDate(event.event_timestamp), 'yyyy/MM/dd 
>>HH:mm:ss') >= unix_timestamp('2012/09/18 00:00:00', 'yyyy/MM/dd HH:mm:ss')
>>AND unix_timestamp(SojTimestampToDate(event.event_timestamp), 'yyyy/MM/dd 
>>HH:mm:ss') <= unix_timestamp('2012/09/18 02:00:00', 'yyyy/MM/dd HH:mm:ss')
>>) n ON m.cguid = n.changed_cguid
>>WHERE n.changed_cguid IS NULL
>>
>>
>>
>>Raihan Jamal
>>
>>
>>
>>On Wed, Oct 3, 2012 at 11:05 AM, Raihan Jamal <jamalrai...@gmail.com> wrote:
>>I am running a Hive query and I am getting this exception below-
>> 
>>Job Submission failed with exception 
>>'org.apache.hadoop.ipc.RemoteException(java.io.IOException: 
>>java.io.IOException: The number of tasks for this job 2072020 exceeds the 
>>configured limit 200000
>> 
>>I am not sure what does this error means? Can anyone help me out here?
>>
>>
>>
>>Raihan Jamal
>> 
>> 
>> 
>

Reply via email to