[ 
https://issues.apache.org/jira/browse/HIVE-3324?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=13426345#comment-13426345
 ] 

rohithsharma commented on HIVE-3324:
------------------------------------

There are 2 problems for analyze command.

1. JDBCStastPublisher fails to connect to database since derby jar is not in 
classpath.In YarnChild logs we get below exception.
{noformat}
2012-08-01 10:25:17,954 ERROR [main] 
org.apache.hadoop.hive.ql.stats.jdbc.JDBCStatsPublisher: Error during 
instantiating JDBC driver org.apache.derby.jdbc.EmbeddedDriver. 
java.lang.ClassNotFoundException: org.apache.derby.jdbc.EmbeddedDriver
        at java.net.URLClassLoader$1.run(URLClassLoader.java:202)
        at java.security.AccessController.doPrivileged(Native Method)
        at java.net.URLClassLoader.findClass(URLClassLoader.java:190)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:306)
        at sun.misc.Launcher$AppClassLoader.loadClass(Launcher.java:301)
        at java.lang.ClassLoader.loadClass(ClassLoader.java:247)
        at java.lang.Class.forName0(Native Method)
        at java.lang.Class.forName(Class.java:169)
        at 
org.apache.hadoop.hive.ql.stats.jdbc.JDBCStatsPublisher.connect(JDBCStatsPublisher.java:69)
        at 
org.apache.hadoop.hive.ql.exec.TableScanOperator.publishStats(TableScanOperator.java:236)
{noformat}

>>> Above problem can be resolved by setting "hive.aux.jars.path" for derby jar.

2. JVM of Hive and YarnChild are different.So dbconnectionstring should be 
common location.If it is set to common location , then there is problme if the 
NodeManager and Hive are running in differetn machine.
                
> analyze command is not gathering "num_rows" present in the table.
> -----------------------------------------------------------------
>
>                 Key: HIVE-3324
>                 URL: https://issues.apache.org/jira/browse/HIVE-3324
>             Project: Hive
>          Issue Type: Bug
>          Components: Statistics
>    Affects Versions: 0.10.0, 0.9.1
>            Reporter: rohithsharma
>
> When analyze command is executed, "collectableStats" i.e num_rows and 
> raw_data_size is always zero even though table contains data.
> bq. [num_partitions: 0, num_files: 1, num_rows: 0, total_size: 5812, 
> raw_data_size: 0]

--
This message is automatically generated by JIRA.
If you think it was sent incorrectly, please contact your JIRA administrators: 
https://issues.apache.org/jira/secure/ContactAdministrators!default.jspa
For more information on JIRA, see: http://www.atlassian.com/software/jira

        

Reply via email to