I tried to use the default settings and with that it works (at least it doesn't throw an error). What's weird is that it collects the data on files/files size etc., but it doesn't compute the row count. Do you have any idea why that could be? The table is based on a textfile and is handled as managed table. The log is clean when I use the default parameters. The stats information is stored in one of the MySQL tables.

On 03/05/2011 11:09 AM, Ning Zhang wrote:
Can you search your /tmp/<username>/hive.log for 'Stats' and see if there is any error message? You can also log on to mysql and see if the database you specified in the JDBC URI has been created and if there is any table in the database.

On Mar 5, 2011, at 7:35 AM, Anja Gruenheid wrote:

I also tried using the original parameters from the wiki (for derby), but it gives me the same error...

On 03/04/2011 05:18 PM, Anja Gruenheid wrote:
I fixed the XML problem and wrote everything into hive-site.xml. The update error still exists though.

Anja

On 03/04/2011 09:47 AM, Ajo Fod wrote:
The good news is that this is a simple XML section .. and this looks like a XML read error.

Try to copy-paste one of the existing properties sections and pasting over just the name and value strings from the message.

Cheers,
Ajo

On Fri, Mar 4, 2011 at 6:40 AM, Anja Gruenheid <anja.gruenh...@gatech.edu <mailto:anja.gruenh...@gatech.edu>> wrote:

    Hi!

    When I add this to hive-site.xml, I get the following exception
    when starting Hive:

    [Fatal Error] hive-site.xml:31:2: The markup in the document
    following the root element must be well-formed.
    Exception in thread "main" java.lang.RuntimeException:
    org.xml.sax.SAXParseException: The markup in the document
    following the root element must be well-formed.
           at
    org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:1168)
           at
    org.apache.hadoop.conf.Configuration.loadResources(Configuration.java:1040)
           at
    org.apache.hadoop.conf.Configuration.getProps(Configuration.java:980)
           at
    org.apache.hadoop.conf.Configuration.get(Configuration.java:382)
           at
    org.apache.hadoop.hive.conf.HiveConf.initialize(HiveConf.java:618)
           at
    org.apache.hadoop.hive.conf.HiveConf.<init>(HiveConf.java:550)
           at
    org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:431)
           at sun.reflect.NativeMethodAccessorImpl.invoke0(Native
    Method)
           at
    
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
           at
    
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
           at java.lang.reflect.Method.invoke(Method.java:597)
           at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
    Caused by: org.xml.sax.SAXParseException: The markup in the
    document following the root element must be well-formed.

    It does not matter what the value is, it always throws this
    exception.

    Anja




    On 03/04/2011 01:48 AM, Ning Zhang wrote:

        Hive CLI interprete ';' as the end of a command. You should
        put this property in hive-site.xml:

        <property>
        <name>hive.stats.dbconnectionstring</name>
        <value>jdbc:mysql://localhost/mstore</value>
        <description>The JDBC conneciton URL. For example,
        
jdbc:mysql:localhost/stats_db?createDatabaseIfNotExist=true&amp;user=stat_u;password=pass</description>
        </property>

        On Mar 3, 2011, at 7:28 PM, Anja Gruenheid wrote:

            Hi!

            I'm trying to gather statistics for tables by using the
            autogather functionality. It works for the size of the
            table and the number of files, but when I use the
            ANALYZE command, it tells me 'could not update stats'
            and no row counts are computed. I followed the
            instructions on the wiki and set autogather to true, I
            also replaced the parameters like this:

            set hive.stats.dbclass=jdbc:mysql;

            set
            hive.stats.dbconnectionstring="jdbc:mysql://localhost/mstore";

            set
            hive.stats.jdbcdriver="org.apache.mysql.jdbc.EmbeddedDriver";

            The problem with the second parameter was that whenever
            I specified something after the ';' like suggested in
            the wiki, it threw an error. Does anyone have
            suggestions what might be wrong?

            Thanks,
            Anja







Reply via email to