Re: Hive on Tez: Tez taking nX more containers than Mapreduce for union all

2017-03-20 Thread Ravi Teja Chilukuri
Hi Gopal, Please find the answers inline. On Fri, Mar 17, 2017 at 9:01 PM, Gopal Vijayaraghavan wrote: > > > We are using a query with union all and groupby and same table is read > multiple times in the union all subquery. > … > > When run with Mapreduce, the job is run in one stage consuming

Hive on Tez: Tez taking nX more containers than Mapreduce for union all

2017-03-16 Thread Ravi Teja Chilukuri
Hi, We are migrating our hive queries from Mapreduce to Tez . We are using a query with union all and groupby and same table is read multiple times in the union all subquery. We have noticed a issue with tez here, it runs with kX times more tasks than MR where k is the number of union alls in the

Re: [Hive on Tez] Running queries in tez non-session mode not working

2017-03-14 Thread Ravi Teja Chilukuri
Hi Gopal, I have tried setting by both the means, setting after launching cli via set command and setting before launching cli using hive --hiveconf tez.am.mode.session=false. Both didn't work. Thanks, Ravi On Wed, Mar 15, 2017 at 12:44 AM, Gopal Vijayaraghavan wrote: > > > > by setting tez.am

[Hive on Tez] Running queries in tez non-session mode not working

2017-03-14 Thread ravi teja
Hi, We wanted to run our hive queries in non-session mode as we have a problem tracking the app ids. Until our tracking is fixed we wanted to run each dag in a separate app master. I have read that running tez in non-session mode will achieve this functionality, by setting *tez.am.mode.session*=f

Re: Quota for rogue ad-hoc queries

2016-09-09 Thread ravi teja
vi On Fri, Sep 2, 2016 at 2:35 PM, ravi teja wrote: > Hi Gopal, > > We are using MR not Tez. > I feel since the adhoc queries data output size is something we can > determine, rather than the time the job takes, I was wondering more from > output size/number of rows quota. >

Re: Quota for rogue ad-hoc queries

2016-09-02 Thread ravi teja
Hi Gopal, We are using MR not Tez. I feel since the adhoc queries data output size is something we can determine, rather than the time the job takes, I was wondering more from output size/number of rows quota. Thanks, Ravi On Fri, Sep 2, 2016 at 2:57 AM, Gopal Vijayaraghavan wrote: > > > Are t

Re: Quota for rogue ad-hoc queries

2016-08-31 Thread ravi teja
ata or any other property which may arise > from relying on this email's technical content is explicitly disclaimed. > The author will in no case be liable for any monetary damages arising from > such loss, damage or destruction. > > > > On 31 August 2016 at 13:42, ravi teja wrot

Quota for rogue ad-hoc queries

2016-08-31 Thread ravi teja
Hi Community, Many users run adhoc hive queries on our platform. Some rogue queries managed to fill up the hdfs space and causing mainstream queries to fail. We wanted to limit the data generated by these adhoc queries. We are aware of strict param which limits the data being scanned, but it is o

Empty file getting included while reading from external table.

2016-01-06 Thread ravi teja
Hi, Sometimes, In the stage 1 of a query which is reading from an external partitioned table, empty file is being read and the job is failing. We are not sure why this is happening , please help us with this issue, we are really stuck. MR Logs: Processing split: Paths: /be0d0d33-1101-4116-b5ae-6

Re: Tables missing on the file system

2015-09-15 Thread ravi teja
file in > > hadoop dfs -ls hdfs://aa:port/location > > > ca...@korea.com > cazen@samsung.com > http://www.Cazen.co.kr <http://www.cazen.co.kr/> > > 2015-09-15 20:03 GMT+09:00 ravi teja : > >> Its an external table, hence I am only creating the table

Re: Tables missing on the file system

2015-09-15 Thread ravi teja
e exists at >>> the time of *'MSCK REPAIR'*. This error occurs, if location does not >>> exists on fs. >>> >>> Thanks, >>> Chetna Chaudhari >>> >>> On 15 September 2015 at 12:03, ravi teja wrote: >>> >>>> Hi

Re: Tables missing on the file system

2015-09-15 Thread ravi teja
ists on fs. > > Thanks, > Chetna Chaudhari > > On 15 September 2015 at 12:03, ravi teja wrote: > >> Hi, >> I am getting this exception when I repair a table. >> Not sure what this means, didnt get any info while searching also. >> >> Can someone

Tables missing on the file system

2015-09-14 Thread ravi teja
Hi, I am getting this exception when I repair a table. Not sure what this means, didnt get any info while searching also. Can someone guide , what this means? CREATE EXTERNAL TABLE IF NOT EXISTS test_table OK Time taken: 0.124 seconds MSCK REPAIR TABLE test_table OK Tables missing o

Re: Repair table doesnt update the transient_lastDdlTime of updated partitions.

2015-08-25 Thread ravi teja
ue, Aug 25, 2015 at 6:43 PM, ravi teja wrote: > >> Sorry For the incomplete mail, sent bymistake >> >> I am working towards a incremental solution on hive based on the >> transient_lastDdlTime of the partitions. >> We mostly deal with hive external tables. >>

Repair table doesnt update the transient_lastDdlTime of updated partitions.

2015-08-25 Thread ravi teja
Hi, I am working towards a incremental solution on hive based on the transient_lastDdlTime of the partitions. If the we in Thanks, Ravi

Re: Repair table doesnt update the transient_lastDdlTime of updated partitions.

2015-08-25 Thread ravi teja
ws an already exists exception. Is there any other way? Thanks in advance. Thanks, Ravi On Tue, Aug 25, 2015 at 9:02 PM, ravi teja wrote: > Hi, > > I am working towards a incremental solution on hive based on the > transient_lastDdlTime of the partitions. > If the we in > > Thanks, > Ravi >

Re: [blocker] ArrayIndexoutofbound in a hive query

2015-07-31 Thread ravi teja
problems. On Fri, Jul 31, 2015 at 12:47 PM, Nitin Pawar wrote: > is there a different output format or the output table bucketed? > can you try putting a not null condition on join columns > > On Fri, Jul 31, 2015 at 12:45 PM, ravi teja wrote: > >> Hi Nithin, >> Thanks

Re: [blocker] ArrayIndexoutofbound in a hive query

2015-07-31 Thread ravi teja
possible share the query and underlying file formats with some sample > data *I cant really share the query.* > > On Fri, Jul 31, 2015 at 12:14 PM, ravi teja wrote: > >> Hi, >> >> We are facing issue with our hive query with ArrayIndexoutofbound >> exception.

[blocker] ArrayIndexoutofbound in a hive query

2015-07-30 Thread ravi teja
Hi, We are facing issue with our hive query with ArrayIndexoutofbound exception. I have tried googling out and I see many users facing the same error, but no solution yet. This is a blocker for our production and we really need help on this. We are using Hive version : 1.3.0. Our query is doing