ListBucketing feature does not support uppercase string.

2016-05-05 Thread Jim Green
Hi Team, I found when there is uppercase string as the skew value, ListBucketing is not working. https://issues.apache.org/jira/browse/HIVE-13697 is filed: For example: 1. This is good: CREATE TABLE testskew (id INT, a STRING) SKEWED BY (a) ON ('abc', 'xyz') STORED AS DIRECTORIES; set hive.mapre

Anyway to show current user name on beeline

2016-02-11 Thread Jim Green
Hi Team, I could not find a way to show current logged-on user on beeline. Is there any way to show that? Something like: Show current_user; ? -- Thanks, www.openkb.info (Open KnowledgeBase for Hadoop/Database/OS/Network/Tool)

Difference between hive.mapjoin.smalltable.filesize and hive.auto.convert.join.noconditionaltask.size

2016-01-31 Thread Jim Green
Sharing one article about the difference between hive.mapjoin.smalltable.filesize and hive.auto.convert.join.noconditionaltask.size. Although both of them can control the behavior of map join. http://www.openkb.info/2016/01/difference-between-hivemapjoinsmalltabl.html -- Thanks, www.openkb.info

Which user should start the local task if Hive impersonation is enabled

2015-11-10 Thread Jim Green
Hi Team, I am trying to understand what is the expected behavior of Hive impersonation is enabled. Say HiveServer2 process is running as userA, and userB is connecting to beeline. If userB create a table, the table file should owned by userB because impersonation is enabled. However If userB is

Re: Table level stats are not shown after insert starting in Hive 0.13?

2015-09-03 Thread Jim Green
Adding Dev user list. Could somebody help take a look? On Thu, Sep 3, 2015 at 12:25 PM, Jim Green wrote: > Also tried Hive 1.0, and the result is the same as Hive 0.13. > Is there any reason why we do not print the table level stats for > partition table? > > On Thu, Sep 3, 2

Re: Table level stats are not shown after insert starting in Hive 0.13?

2015-09-03 Thread Jim Green
Also tried Hive 1.0, and the result is the same as Hive 0.13. Is there any reason why we do not print the table level stats for partition table? On Thu, Sep 3, 2015 at 10:41 AM, Jim Green wrote: > *Hive 0.12:* > After insert SQL: > Partition default.mytablepar{id=111} stats: [num

Table level stats are not shown after insert starting in Hive 0.13?

2015-09-03 Thread Jim Green
*Hive 0.12:* After insert SQL: Partition default.mytablepar{id=111} stats: [num_files: 1, num_rows: 0, total_size: 4, raw_data_size: 0] Table default.mytablepar stats: [num_partitions: 1, num_files: 1, num_rows: 0, total_size: 4, raw_data_size: 0] *Hive 0.13:* After insert SQL: Partition default.m

Re: Wrong results from join query in Hive 0.13 and also 1.0 with reproduce.

2015-09-02 Thread Jim Green
oid this bug if you > rewrite your query such that it violates one of the requirement (listed > above) to trigger the bug. > > Ashutosh > > > On Wed, Sep 2, 2015 at 10:19 AM, Jim Green wrote: > >> Hi Ashutosh, >> >> Is Hive-10841 related? from the title of that

Re: Wrong results from join query in Hive 0.13 and also 1.0 with reproduce.

2015-09-02 Thread Jim Green
t; Thanks, > Ashutosh > > On Tue, Sep 1, 2015 at 6:00 PM, Jim Green wrote: > >> Seems Hive 1.2 fixed this issue. But not sure what is the JIRA related >> and the possibility to backport this fix into Hive 0.13? >> >> >> On Tue, Sep 1, 2015 at 5:35 PM, Jim Green

Re: Wrong results from join query in Hive 0.13 and also 1.0 with reproduce.

2015-09-01 Thread Jim Green
Seems Hive 1.2 fixed this issue. But not sure what is the JIRA related and the possibility to backport this fix into Hive 0.13? On Tue, Sep 1, 2015 at 5:35 PM, Jim Green wrote: > Hi Team, > > Below is the minimum reproduce of wrong results in Hive 0.13: > > *1. Create 4 t

Wrong results from join query in Hive 0.13 and also 1.0 with reproduce.

2015-09-01 Thread Jim Green
Hi Team, Below is the minimum reproduce of wrong results in Hive 0.13: *1. Create 4 tables* CREATE EXTERNAL TABLE testjoin1( joincol string ); CREATE EXTERNAL TABLE testjoin2( anothercol string , joincol string); CREATE EXTERNAL TABLE testjoin3( anothercol string); CREATE EXTERNAL TABLE t

Re: Reading tables will lock entire database in Hive 0.13

2015-08-31 Thread Jim Green
I found the answer…hive.txn.manager is added starting in hive 0.13. Thanks all. On Mon, Aug 31, 2015 at 3:54 PM, Jim Green wrote: > Hi Team, > > This issue only happens on hive 0.13, not in Hive 0.12. > Does anybody know what is the change between 0.12 and 0.13 which caused

Re: Reading tables will lock entire database in Hive 0.13

2015-08-31 Thread Jim Green
sue. > > Thanks > > On Thu, Feb 26, 2015 at 1:52 PM, Jim Green wrote: > >> Hi Chaoyu, >> >> Thanks for the information. >> Actually by default, if we set hive.support.concurrency=true in Hive >> 0.13: >> >> hive.lock.manager=org.apac

Tez : Anyway to avoid creating subdirectories by "Insert with union all” ?

2015-08-19 Thread Jim Green
Hi Team, Below insert with union-all will create sub-directories: set hive.execution.engine=tez; create table h1_passwords_target like h1_passwords; insert overwrite table h1_passwords_target select * from (select * from h1_passwords limit 1 union all select * from h1_passwords limit 2 ) sub

UDTF fails with java.lang.ClassCastException

2015-08-06 Thread Jim Green
Hi Team, One UDTF fails in Hive 1.0 with below stacktrace: Caused by: java.lang.ClassCastException: org.apache.hadoop.hive.serde2.lazy.LazyString cannot be cast to org.apache.hadoop.io.Text at org.apache.hadoop.hive.serde2.objectinspector.primitive.WritableStringObjectInspector.getPrimitiveJavaOb

Re: Hive on Tez query failed with “wrong key class"

2015-07-27 Thread Jim Green
ove IFile should be speficic to Tez. Hive does not have that code to check the key class and value class. Anyone knows why Tez will check the key and value class when doing sort stuff? Thanks. On Tue, Jul 21, 2015 at 5:26 PM, Jim Green wrote: > > Sample stacktrace is : > [Error: F

Re: Hive on Tez query failed with “wrong key class"

2015-07-21 Thread Jim Green
(HiveContextAwareRecordReader.java:358) ... 21 more ], On Tue, Jul 21, 2015 at 11:26 AM, Bikas Saha wrote: > A full stack trace would help determine is this is a Tez issue or hive > issue. > > > > *From:* Jim Green [mailto:openkbi...@gmail.com] > *Sent:* Tuesday, J

Hive on Tez query failed with “wrong key class"

2015-07-21 Thread Jim Green
Hi Team, Env: Hive 1.0 on Tez 0.5.3 Query is a simple group-by on top of sequence table. It fails with below error on tez mode: *java.lang.RuntimeException: org.apache.hadoop.hive.ql.metadata.HiveException: * *java.io.IOException: java.io.IOException: wrong key class: org.apache.hadoop.io.BytesWr

Re: Hive Tez support matrix

2015-07-07 Thread Jim Green
entation for that. The source of truth is > probably the release pom.xml J > > > > Bikas > > > > *From:* Jim Green [mailto:openkbi...@gmail.com ] > *Sent:* Tuesday, July 07, 2015 2:58 PM > *To:* user@hive.apache.org > *Cc:* u...@tez.apache.org > *Subject:* Re

Re: Hive Tez support matrix

2015-07-07 Thread Jim Green
, Bikas Saha wrote: > That would be in the hive documentation because it’s the dependent > project and determines its compatibility with downstream projects like Tez. > > > > *From:* Jim Green [mailto:openkbi...@gmail.com] > *Sent:* Tuesday, July 07, 2015 10:38 AM > *To:* u

Re: Hive Tez support matrix

2015-07-07 Thread Jim Green
.x are compatible with > Tez-0.5.x. > > I believe Hive 0.13 is compatible only with Tez 0.4. > For Hive 0.14 onwards ( including the Hive-1.x. releases ), they should > work with anything in the range of Tez versions: 0.5.2 <= x <= 0.7.x . > > thanks > — Hitesh > &

Hive Tez support matrix

2015-07-07 Thread Jim Green
Hi Team, Is there any Hive <-> Tez support matrix? For example, Hive 1.2 should be on Tez which version? Tez 0.5.3 only supports which versions of Hive? etc… My understanding is that it does not matter which version of Hive and which version of Tez. -- Thanks, www.openkb.info (Open KnowledgeBas

Re: Is hive 0.13 index working fine on partition tables?

2015-07-06 Thread Jim Green
Anyone knows the JIRAs related to this issue? On Mon, Jun 29, 2015 at 2:35 PM, Jim Green wrote: > Hi Team, > > On hive 0.13, I have a minimum reproduce for index on partition table > issue: > CREATE TABLE test_partition_index( > id1 bigint, > id2 bigint, > id3 bigint)

Hive 1.0 on Tez 0.5.3 reproduced HIVE-8204

2015-07-01 Thread Jim Green
Hi Team, Is HIVE-8204 really fixed or not? Below test can reproduce exactly the same stacktrace when dynamic partition pruning is enabled. *1. Create test tables* create table h1_fact(id int, id2 string) PARTITIONED BY (mmdd string) row format delimited fields terminated by ','; create table

Is hive 0.13 index working fine on partition tables?

2015-06-29 Thread Jim Green
Hi Team, On hive 0.13, I have a minimum reproduce for index on partition table issue: CREATE TABLE test_partition_index( id1 bigint, id2 bigint, id3 bigint) PARTITIONED BY ( dt string) row format delimited fields terminated by ','; cat sampledata 111,222,333 LOAD DATA LOCAL INPATH 'sampledata' O

Hive transaction feature in Hive 1.0

2015-06-12 Thread Jim Green
Hi Team, Sharing the article which explains the Hive transaction features in Hive 1.0: Hive transaction feature in Hive 1.0 -- Thanks, www.openkb.info (Open KnowledgeBase for Hadoop/Database/OS/Network/Tool)

Understanding Hive Outer Join Behavior

2015-06-05 Thread Jim Green
Sharing one article about how to understand Hive Outer Join Behavior . -- Thanks, www.openkb.info (Open KnowledgeBase for Hadoop/Database/OS/Network/Tool)

Re: Why hive 0.13 will initialize derby database if the metastore parameters are not set in hive-site.xml?

2015-03-06 Thread Jim Green
And Starting from hive 0.13, hive.stats.dbclass=fs by default. So I am thinking autostats may not be related here. Could be some other features need derby…. On Fri, Mar 6, 2015 at 6:37 PM, Jim Green wrote: > Hi Edward, > > No matter if I set hive.stats.autogather=false in hive-si

Re: Why hive 0.13 will initialize derby database if the metastore parameters are not set in hive-site.xml?

2015-03-06 Thread Jim Green
:34 metastore_db Thanks. On Fri, Mar 6, 2015 at 6:17 PM, Edward Capriolo wrote: > Make sure hive autogather stats is false . Or aetup the stats db > > > On Friday, March 6, 2015, Jim Green wrote: > >> Hi Team, >> >> Starting from hive 0.13, if the metastore parameters

Why hive 0.13 will initialize derby database if the metastore parameters are not set in hive-site.xml?

2015-03-06 Thread Jim Green
Hi Team, Starting from hive 0.13, if the metastore parameters are not set in hive-site.xml, but we set in .hiverc, hive will try to initialize derby database in current working directory. This behavior did not exist in hive 0.12. Is it a known bug? or behavior change? I have the repro as below:

Re: Reading tables will lock entire database in Hive 0.13

2015-02-26 Thread Jim Green
s > > Chaoyu > > On Wed, Feb 25, 2015 at 11:57 PM, Alan Gates wrote: > >> What lock manager are you using? >> >> Alan. >> >> Jim Green >> February 25, 2015 at 17:24 >> Hi Team, >> >> Here is old thread about this issue: >>

Reading tables will lock entire database in Hive 0.13

2015-02-25 Thread Jim Green
Hi Team, Here is old thread about this issue: http://qnalist.com/questions/5091386/doubt-about-locking-mechanism-in-hive I met the same issue that when reading one table in session A, we can not create a table in the same database in session B. Basically it causes database level lock in Hive 0.13

Regarding configuration of Storage based authorization in Hive 0.12

2015-01-16 Thread Jim Green
Hi Team, Per the documentation for Hive Storage based authorization in Hive, it has 2 uses: 1. Metastore side authorization 2. Client side authorization I have some questions because after enabling client side authorization by adding below configurations in hive-site.xml on client side, I got Nul

Fwd: Question on ORC file stripe size.

2014-12-02 Thread Jim Green
Hi Team, I am creating this table: CREATE TABLE IF NOT EXISTS orctest2 ( id string, id2 string, id3 string, id4 string ) STORED AS ORC tblproperties ("orc.stripe.size"="1048576","orc.row.index.stride"="”); The stripe size is set to 1MB. After loading data, the table file is about 60MB: -rwxr-