I tried to use the default settings and with that it works (at least it
doesn't throw an error). What's weird is that it collects the data on
files/files size etc., but it doesn't compute the row count. Do you have
any idea why that could be? The table is based on a textfile and is
handled as managed table. The log is clean when I use the default
parameters. The stats information is stored in one of the MySQL tables.
On 03/05/2011 11:09 AM, Ning Zhang wrote:
Can you search your /tmp/<username>/hive.log for 'Stats' and see if
there is any error message? You can also log on to mysql and see if
the database you specified in the JDBC URI has been created and if
there is any table in the database.
On Mar 5, 2011, at 7:35 AM, Anja Gruenheid wrote:
I also tried using the original parameters from the wiki (for derby),
but it gives me the same error...
On 03/04/2011 05:18 PM, Anja Gruenheid wrote:
I fixed the XML problem and wrote everything into hive-site.xml. The
update error still exists though.
Anja
On 03/04/2011 09:47 AM, Ajo Fod wrote:
The good news is that this is a simple XML section .. and this
looks like a XML read error.
Try to copy-paste one of the existing properties sections and
pasting over just the name and value strings from the message.
Cheers,
Ajo
On Fri, Mar 4, 2011 at 6:40 AM, Anja Gruenheid
<anja.gruenh...@gatech.edu <mailto:anja.gruenh...@gatech.edu>> wrote:
Hi!
When I add this to hive-site.xml, I get the following exception
when starting Hive:
[Fatal Error] hive-site.xml:31:2: The markup in the document
following the root element must be well-formed.
Exception in thread "main" java.lang.RuntimeException:
org.xml.sax.SAXParseException: The markup in the document
following the root element must be well-formed.
at
org.apache.hadoop.conf.Configuration.loadResource(Configuration.java:1168)
at
org.apache.hadoop.conf.Configuration.loadResources(Configuration.java:1040)
at
org.apache.hadoop.conf.Configuration.getProps(Configuration.java:980)
at
org.apache.hadoop.conf.Configuration.get(Configuration.java:382)
at
org.apache.hadoop.hive.conf.HiveConf.initialize(HiveConf.java:618)
at
org.apache.hadoop.hive.conf.HiveConf.<init>(HiveConf.java:550)
at
org.apache.hadoop.hive.cli.CliDriver.main(CliDriver.java:431)
at sun.reflect.NativeMethodAccessorImpl.invoke0(Native
Method)
at
sun.reflect.NativeMethodAccessorImpl.invoke(NativeMethodAccessorImpl.java:39)
at
sun.reflect.DelegatingMethodAccessorImpl.invoke(DelegatingMethodAccessorImpl.java:25)
at java.lang.reflect.Method.invoke(Method.java:597)
at org.apache.hadoop.util.RunJar.main(RunJar.java:156)
Caused by: org.xml.sax.SAXParseException: The markup in the
document following the root element must be well-formed.
It does not matter what the value is, it always throws this
exception.
Anja
On 03/04/2011 01:48 AM, Ning Zhang wrote:
Hive CLI interprete ';' as the end of a command. You should
put this property in hive-site.xml:
<property>
<name>hive.stats.dbconnectionstring</name>
<value>jdbc:mysql://localhost/mstore</value>
<description>The JDBC conneciton URL. For example,
jdbc:mysql:localhost/stats_db?createDatabaseIfNotExist=true&user=stat_u;password=pass</description>
</property>
On Mar 3, 2011, at 7:28 PM, Anja Gruenheid wrote:
Hi!
I'm trying to gather statistics for tables by using the
autogather functionality. It works for the size of the
table and the number of files, but when I use the
ANALYZE command, it tells me 'could not update stats'
and no row counts are computed. I followed the
instructions on the wiki and set autogather to true, I
also replaced the parameters like this:
set hive.stats.dbclass=jdbc:mysql;
set
hive.stats.dbconnectionstring="jdbc:mysql://localhost/mstore";
set
hive.stats.jdbcdriver="org.apache.mysql.jdbc.EmbeddedDriver";
The problem with the second parameter was that whenever
I specified something after the ';' like suggested in
the wiki, it threw an error. Does anyone have
suggestions what might be wrong?
Thanks,
Anja