[ https://issues.apache.org/jira/browse/HIVE-3324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13426345#comment-13426345 ]
rohithsharma commented on HIVE-3324: ------------------------------------ There are 2 problems for analyze command. 1. JDBCStastPublisher fails to connect to database since derby jar is not in classpath.In YarnChild logs we get below exception. {noformat} 2012-08-01 10:25:17,954 ERROR [main] org.apache.hadoop.hive.ql.stats.jdbc.JDBCStatsPublisher: Error during instantiating JDBC driver org.apache.derby.jdbc.EmbeddedDriver. java.lang.ClassNotFoundException: org.apache.derby.jdbc.EmbeddedDriver at java.net.URLClassLoader$1.run(URLClassLoader.java:202) at java.security.AccessController.doPrivileged(Native Method) at java.net.URLClassLoader.findClass(URLClassLoader.java:190) at java.lang.ClassLoader.loadClass(ClassLoader.java:306) at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301) at java.lang.ClassLoader.loadClass(ClassLoader.java:247) at java.lang.Class.forName0(Native Method) at java.lang.Class.forName(Class.java:169) at org.apache.hadoop.hive.ql.stats.jdbc.JDBCStatsPublisher.connect(JDBCStatsPublisher.java:69) at org.apache.hadoop.hive.ql.exec.TableScanOperator.publishStats(TableScanOperator.java:236) {noformat} >>> Above problem can be resolved by setting "hive.aux.jars.path" for derby jar. 2. JVM of Hive and YarnChild are different.So dbconnectionstring should be common location.If it is set to common location , then there is problme if the NodeManager and Hive are running in differetn machine. > analyze command is not gathering "num_rows" present in the table. > ----------------------------------------------------------------- > > Key: HIVE-3324 > URL: https://issues.apache.org/jira/browse/HIVE-3324 > Project: Hive > Issue Type: Bug > Components: Statistics > Affects Versions: 0.10.0, 0.9.1 > Reporter: rohithsharma > > When analyze command is executed, "collectableStats" i.e num_rows and > raw_data_size is always zero even though table contains data. > bq. [num_partitions: 0, num_files: 1, num_rows: 0, total_size: 5812, > raw_data_size: 0] -- This message is automatically generated by JIRA. If you think it was sent incorrectly, please contact your JIRA administrators: https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa For more information on JIRA, see: http://www.atlassian.com/software/jira