RE: Specifying a double precision in HiveQL

2011-02-24 Thread Paul Yang
Hacky, but maybe something like select concat( cast(num as int), '.' , cast(abs(num)*100 as int) % 100) from (select 1.234 as num from src limit 1) a; ? -Original Message- From: Aurora Skarra-Gallagher [mailto:aur...@yahoo-inc.com] Sent: Thursday, February 24, 2011 11:31 AM To: user@hi

RE: OutOfMemory errors on joining 2 large tables.

2011-02-22 Thread Paul Yang
Have you taken a look at the distribution of your join keys? If there are a couple join keys that occur much more frequently than others, the reducers handling those keys will have more load and may be subject to an OOM. -Original Message- From: Bennie Schut [mailto:bsc...@ebuddy.com] S

RE: Only a single expression in the SELECT clause is supported with UDTF's

2010-11-08 Thread Paul Yang
In your original query, I think if you put parenthesis around p,k it should have worked: select taxonDensityUDTF(kingdom_concept_id, phylum_concept_id) as (p,k) ... -Original Message- From: Tim Robertson [mailto:timrobertson...@gmail.com] Sent: Monday, November 08, 2010 5:53 AM To: user

RE: Search the newest partition of one table in view

2010-10-20 Thread Paul Yang
I don't think it is possible to use just a view to get that effect. If you're generating the query programmatically, it'd be possible to either use the Thrift service or process the output of 'show partitions' to get the latest partition date. From: lei liu [mailto:liulei...@gmail.com] Sent: We

RE: USING .. AS column names

2010-10-13 Thread Paul Yang
For insert overwrite, the column names don't matter - the order of the columns dictate how they are inserted into the table so the behavior is not specific to the transform clause. Also, when you use AS with transform, you're just assigning column aliases to the output of the transform. For exa