Re: num_rows is always 0 in statistics

2012-08-29 Thread Hiroyuki Yamada
Hi, Sorry, it works now. Thank you. But, the value is not correct. (about half of real number of rows.) Is this sampled value ? It seems counting every row as far as i checked TableScanOperator.java . Thanks, Hiroyuki On Wed, Aug 29, 2012 at 5:39 PM, Hiroyuki Yamada wrote: > Hi, > >

Re: num_rows is always 0 in statistics

2012-08-29 Thread Hiroyuki Yamada
sue with following way. > > Configure > "hive.stats.dbconnectionstring=jdbc:derby:;databaseName=/home/TempStore". > This works only in single node cluster. > > > Please check HIVE-3324. > > > -Original Message- > From: Hiroyuki Yamada [mailto:mo

num_rows is always 0 in statistics

2012-08-28 Thread Hiroyuki Yamada
Hi, I have run "analyse table" command several times to get statistics, but I always get num_rows=0 like below. (also, raw_data_size is 0) - hive> analyze table lineitem compute statistics; Total MapReduce jobs = 1 Launching Job 1 out of 1 Number of reduce tasks is set to 0 since there's no r

How to replace a input table name inside Hive (How Hive pass input file names to Hadoop ?)

2011-10-27 Thread Hiroyuki Yamada
Hello, I am trying to understand how hive compiles and opmizes HiveQL queries for future development. I would like to know how to replace a input table name in the compilation process. For example, the following HiveQL is queried, SELECT l_orderkey FROM lineitem WHERE l_shipdate < '1993-01-01';

How to see the intermediate results between AST and optimized logical query plan.

2011-10-19 Thread Hiroyuki Yamada
Hello, I have been trying to learn the Hive query compiler and I am wondering if there is a way to see the result of semantic analysis (query block tree) and non-optimized logical query plan. I know we can get AST and optimized logical query plan with "explain", but I want to know the intermediate