FW: Table statistics

2015-12-15 Thread peter.mar...@baesystems.com
Sorry, wrong list. Z From: peter.mar...@baesystems.com [mailto:peter.mar...@baesystems.com] Sent: 15 December 2015 09:39 To: user@hive.apache.org Subject: Table statistics Hi, I was wondering if there is any "recognized" way to obtain table statistics. Ideally, given a Key range I wou

Table statistics

2015-12-15 Thread peter.mar...@baesystems.com
Hi, I was wondering if there is any "recognized" way to obtain table statistics. Ideally, given a Key range I would like to know the number of distinct rowids, entries and amount of data (in bytes) in that key range. I assume that Accumulo holds at least some of this information

Guaranteeing Partition Statistics & Deprecating Table Statistics

2013-02-13 Thread Bhushan Mandhani
Hi All, In HIVE-3959, I'm working actively on guaranteeing accuracy of physical stats. For context, the status quo in Hive is that both Table stats and Partition stats exist but are quite unreliable (even with hive.stats.reliable set to true). Either stats should be reliable or they should not

Re: Table Statistics In Hive

2012-04-03 Thread Nitin Pawar
I am not sure of this but just making a guess from what i have learned for getting row count i always run select count(1) from table desc extended table tells the entire table metadata information but nothing about the records inside the table On Mon, Apr 2, 2012 at 9:20 PM, Ladda, Anand wrote:

Table Statistics In Hive

2012-04-02 Thread Ladda, Anand
I've tried to collect statistics on an existing table in hive using the commands mentioned in this wiki page - https://cwiki.apache.org/confluence/display/Hive/StatsDev ANALYZE TABLE [TABLENAME] PARTITION(parcol1=..., partcol2=) COMPUTE STATISTICS But when I do a DESCRIBE EXTENDED [TABLENAME