Thank you Nitin for reply. I am using mysql database ,and also I can see new 
row created for the partition ,but all values are zero . I think explicitly 
giving mysql data base wont make a difference .

From: Nitin Pawar [mailto:nitinpawar...@gmail.com]
Sent: Tuesday, July 22, 2014 11:05 PM
To: user@hive.apache.org
Subject: Re: Hive Statistics

by default hive stores the statistics in derby database.

If you want a persistent look at column statistics, you may want to create 
mysql based database for column statistics.

Your queries look fine

On Tue, Jul 22, 2014 at 10:50 PM, Navdeep Agrawal 
<navdeep_agra...@symantec.com<mailto:navdeep_agra...@symantec.com>> wrote:

Hi ,



i am trying to compute statistics on ORC File but i am unable see any changes 
in PART_COL_STATS as well on using

set hive.compute.query.using.stats=true;
set hive.stats.reliable=true;
set hive.stats.fetch.column.stats=true;
set hive.stats.fetch.partition.stats=true;
set hive.cbo.enable=true;

to get max value of a column it is running full Map reduce on column ..
what i want to use is max value stored in meta store ,but i am unable to catch 
these statistics .

my table desc is
load_inst_id int
src_filename string
server_date date

my analyze query is
analyze table mytable partition(server_date=’2013-11-30′) compute statistics 
for columns load_inst_id;

i am always getting 0 as loadinstant id ,i have to turn off my 
hive.compute.query.using.stats to get correct result(through map reduce 
max(load_inst_id))




--
Nitin Pawar

Reply via email to