Re: ORC tables loading

2015-11-17 Thread James Pirz
ad and write, at least in this case, the time stays roughly > the same. > > Alan. > > James Pirz > November 16, 2015 at 21:23 > Hi, > > I am using Hive 1.2 with ORC tables on Hadoop 2.6 on a cluster. > I load data into an ORC table by reading the data from an external

ORC tables loading

2015-11-16 Thread James Pirz
Hi, I am using Hive 1.2 with ORC tables on Hadoop 2.6 on a cluster. I load data into an ORC table by reading the data from an external table on raw text files and using insert statement: INSERT into TABLE myorctab SELECT * FROM mytxttab; I ran a simple scale-up test to find out how the loading t

Re: Getting dot files for DAGs

2015-10-01 Thread James Pirz
Franke wrote: > Why not use tez ui? > > Le jeu. 1 oct. 2015 à 2:29, James Pirz a écrit : > >> I am using Tez 0.7.0 on Hadopp 2.6 to run Hive queries. >> I am interested in checking DAGs for my queries visually, and I realized >> that I can do that by graphviz once I

Re: Getting dot files for DAGs

2015-09-30 Thread James Pirz
to find the contents of the .dot file within them. > This approach only works if you have YARN log aggregation enabled. > > thanks > — Hitesh > > > On Sep 30, 2015, at 5:29 PM, James Pirz wrote: > > > I am using Tez 0.7.0 on Hadopp 2.6 to run Hive queries. > >

Getting dot files for DAGs

2015-09-30 Thread James Pirz
I am using Tez 0.7.0 on Hadopp 2.6 to run Hive queries. I am interested in checking DAGs for my queries visually, and I realized that I can do that by graphviz once I can get "dot" files of my DAGs. My issue is I can not find those files, they are not in the log directory of Yarn or Hadoop or under

Checking the number of Readers

2015-09-11 Thread James Pirz
I am using Hive 1.2.0 on Hadoop 2.6 (on a cluster with 10 machines) and I am trying to understand the performance of a full-table scan. I am running the following query: SELECT * FROM LINEITEM WHERE L_LINENUMBER < 0; and I am measuring its performance in different scenarios: using "MR vs. Tez" an

Re: Aggregated Expression not in GROUP BY key

2015-07-29 Thread James Pirz
failed to fix it for HiveQL. On Wed, Jul 29, 2015 at 5:03 PM, James Pirz wrote: > Hi, > > I am using Hive 1.2, and I am trying to run some queries based on TPCH > schema. My original query is: > > SELECT N_NAME, AVERAGE(C_ACCTBAL) > FROM customer JOIN nation > on C_NATIONKE

Aggregated Expression not in GROUP BY key

2015-07-29 Thread James Pirz
Hi, I am using Hive 1.2, and I am trying to run some queries based on TPCH schema. My original query is: SELECT N_NAME, AVERAGE(C_ACCTBAL) FROM customer JOIN nation on C_NATIONKEY=N_NATIONKEY GROUP BY N_NAME; for which I get: FAILED: SemanticException [Error 10025]: Line 1:15 Expression not in G

Re: Hive 1.2.0 Unable to start metastore

2015-06-08 Thread James Pirz
metastore server. On Mon, Jun 8, 2015 at 1:11 PM, Slava Markeyev wrote: > Sounds like you ran into this: > https://issues.apache.org/jira/browse/HIVE-9198 > > > On Mon, Jun 8, 2015 at 1:06 PM, James Pirz wrote: > >> Thanks ! >> There was a similar problem: Conflict

Re: Hive 1.2.0 Unable to start metastore

2015-06-08 Thread James Pirz
rue > > On Mon, Jun 8, 2015 at 11:52 AM, James Pirz wrote: > >> I am trying to run Hive 1.2.0 on Hadoop 2.6.0 (on a cluster, running >> CentOS). I am able to start Hive CLI and run queries. But once I try to >> start Hive's metastore (I trying to use the builtin der

Hive 1.2.0 Unable to start metastore

2015-06-08 Thread James Pirz
I am trying to run Hive 1.2.0 on Hadoop 2.6.0 (on a cluster, running CentOS). I am able to start Hive CLI and run queries. But once I try to start Hive's metastore (I trying to use the builtin derby) using: hive --service metastore I keep getting Class Not Found Exceptions for "org.apache.derby.j

Re: Hive 1.2.0 fails on Hadoop 2.6.0

2015-06-07 Thread James Pirz
oClassDefFoundError? > > -Slava > > On Fri, Jun 5, 2015 at 7:28 PM, James Pirz wrote: > >> I am trying to run Apache Hive 1.2.0 on Hadoop 2.6.0 on a cluster. My >> hadoop cluster comes up fine (I start hdfs and yarn) and then I create >> required tmp and warehouse dire

Hive 1.2.0 fails on Hadoop 2.6.0

2015-06-05 Thread James Pirz
I am trying to run Apache Hive 1.2.0 on Hadoop 2.6.0 on a cluster. My hadoop cluster comes up fine (I start hdfs and yarn) and then I create required tmp and warehouse directories in HDFS and I try to start Hive CLI (I do not do anything with HCatalog or Hiveserver2) but I keep getting errors relat