RE: Possible Bug: to_date("2015-01-15") returns a string

2016-06-30 Thread Ryan Harris
FWIW, the wiki states that the function returns a string https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDFhttps://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF From: Long, Andrew [loand...@amazon.com] Sent: Thursday, June 30, 2016 5:31 PM To: user@hive.apache.org Su

Re: Hash table in map join - Hive

2016-06-30 Thread Gopal Vijayaraghavan
> But, I got a comment from the author that, the patch wouldn't affect -- >hive.tez.auto.reducer.parallelism=true. > Am I missing something? No, I've linked to the wrong JIRA :( Cheers, Gopal

Re: Hash table in map join - Hive

2016-06-30 Thread Lalitha MV
Also, a couple of follow up questions: 1. The grace hash has to reload/ rebuild the hash table for a new split, only if it has spilled the hash table because of lack of memory space right? How does the regular hash join handle the case when the hash table cannot fit into memory? Does it create

Re: Hash table in map join - Hive

2016-06-30 Thread Lalitha MV
Hi, I was following this thread. I tried adding the patch of the jira manually ( https://issues.apache.org/jira/browse/TEZ-3287 ) [referenced in the above reply for auto reducer optimization in shuffle hash join case]. I added it to 0.8.3 while the patch was for the master. But, I got a comment fr

Re: Possible Bug: to_date("2015-01-15") returns a string

2016-06-30 Thread Gopal Vijayaraghavan
> I ran into this unusual behavior while converting a date string into a >date. If you're on Hive-1.x, this couldn't be fixed due to backwards compatible requirements. If I remember correctly, to_date() pre-dates the Date type in Hive. Marked incompatible for backport -

Possible Bug: to_date("2015-01-15") returns a string

2016-06-30 Thread Long, Andrew
Hello Everyone, I ran into this unusual behavior while converting a date string into a date. I was surprised to find out that to_date will occasionally return a string. Does this make sense? Cheers Andrew hive> CREATE TEMPORARY TABLE datebug > AS SELECT to_date("2015-01-10"); Query ID =

Re: Hash table in map join - Hive

2016-06-30 Thread Gopal Vijayaraghavan
> 1. In the query plan, it still says Map Join Operator (Would have >expected it to be named as Reduce side operator). The "Map" in that case refers really to Map rather the hadoop version. An unambigous name is if it were called the HashJoinOperator. This is one of the optimizations of Tez righ

Upgrading w/Custom Storage Handler

2016-06-30 Thread Lavelle, Shawn
Hello, I'm working on a custom storage handler for Cassandra that was built in HIVE 0.11 and run on SPARK 0.9.2 to HIVE 1.2.1 and SPARK 1.6. I appreciate your help with three questions: 1)Hive 0.11 populated a parameter in the jobConf, TableScanDesc.FILTER_TEXT_CONF_STR, and it appears

RE: Query Performance Issue : Group By and Distinct and load on reducer

2016-06-30 Thread Markovitz, Dudu
1. This works. I’ve recalled that the CAST is needed since FLOOR defaults to FLOAT. select (cast (floor(r*100) as bigint)+ 1) + 100L * (row_number () over (partition by (cast (floor(r*100) as bigint) + 1) order by null) - 1) as ETL_ROW_ID from(select *,rand() as r fro