Hive and XML

2016-05-22 Thread Maciek
fill out this brief survey https://mamont914.typeform.com/to/Ewo4aD -- Thank you, Kind Regards ~Maciek

Re: Bucketing in Hive

2016-01-26 Thread Maciek
of optimisations. Pruning is a good example - available for Partitionined but not for Bucketed tables. You can track full list here: https://issues.apache.org/jira/browse/HIVE-9523 Thank you, Kind Regards ~Maciek On 26 January 2016 at 21:44, Mich Talebzadeh wrote: > Hi, > > > > T

Re: Default schema

2015-04-14 Thread Maciek
uot;hive --database > **" > > --Bala G. > > On Tue, Apr 14, 2015 at 1:30 PM, Maciek wrote: > >> Is it possible to customize the schema user logs on to? >> I was thinking of setting some bash environment variable >> or setting param file (like hive-env.sh, hiverc or hive-site.xml…)? >> > >

Default schema

2015-04-14 Thread Maciek
Is it possible to customize the schema user logs on to? I was thinking of setting some bash environment variable or setting param file (like hive-env.sh, hiverc or hive-site.xml…)?

Can CLUSTERED BY/SORTED BY speed up processing aggregations or inter-row calculations?

2014-11-18 Thread Maciek
Can Hive optimizer take advantage of tables CLUSTERED BY … SORTED BY … when performing aggregations or analytic functions (window aggregations)? If so, how? more details at SO: optimize Hive table storage for subsequent Aggregations and/or Window Analytic Functions

UNBOUNDED PRECEDING AND 1 PRECEDING

2014-11-10 Thread Maciek
Hi, Any reason ROWS BETWEEN UNBOUNDED PRECEDING AND 1 PRECEDING clause gives me error? FAILED: SemanticException End of a WindowFrame cannot be UNBOUNDED PRECEDING

Re: CREATE (PARTITIONED) TABLE AS error

2014-11-10 Thread Maciek
Could I ask any of you who are on Hive v0.14 to check if such CTAS syntax works? Many thanks! On Thu, Nov 6, 2014 at 10:32 AM, Maciek wrote: > What I'm referring to is CTAS statement and as per documentation it > doesn't seem to work (Hive 0.13). > I

Re: CREATE (PARTITIONED) TABLE AS error

2014-11-06 Thread Maciek
that > takes in as first argument, the number of rows to insert N (in this case > N=1) followed by "NK" arguments, where K is the number of columns you have. > > > Best Regards, > Nishant Kelkar > > On Wed, Nov 5, 2014 at 9:29 AM, Maciek wrote: > >> Based

Re: from_unixtime() and epoch definition

2014-11-06 Thread Maciek
timezone option to >> from_unixtime(). >> As you mentioned, Hive doesn't really do any timezone handling, just >> assumes things are in the system's local timezone. It will be a bit of a >> bigger project to add better time zone handling to Hive timestamps. >&g

CREATE (PARTITIONED) TABLE AS error

2014-11-05 Thread Maciek
Based on the documentation https://cwiki.apache.org/confluence/display/Hive/DynamicPartitions the following CTAS should work: CREATE TABLE dropme(key int, value string) PARTITIONED BY (yr int, mth int) AS SELECT 2 key, 'val' value, 2014 yr, 5 mth FROM singlerow; but instead it gives me the error:

Re: from_unixtime() and epoch definition

2014-11-05 Thread Maciek
; > $ TZ="GMT" date -r 0 > Thu Jan 1 00:00:00 GMT 1970 > > $ TZ="UTC" date -r 0 > Thu Jan 1 00:00:00 UTC 1970 > > $ TZ="Europe/London" date -r 0 > Thu Jan 1 01:00:00 BST 1970 > > $ TZ="Europe/Dublin" date -r 0 > Thu Jan 1

Re: from_unixtime() and epoch definition

2014-11-03 Thread Maciek
I'd consider this behaviour as a bug and would like to raise it as such. Is there anyone to confirm it's the same on Hive 0.14? On Fri, Oct 31, 2014 at 3:41 PM, Maciek wrote: > Actually confirmed! It's down to the timezone settings > I've moved temporarily server/cl

Re: from_unixtime() and epoch definition

2014-10-31 Thread Maciek
upport for TimeZones? On Fri, Oct 31, 2014 at 3:25 PM, Maciek wrote: > Thought about that myself based on my prior (bad) experience when tried to > working with timezones in Hive (functionality pretty much doesn't exists) > That shouldn't be the case here though, here's wh

Re: from_unixtime() and epoch definition

2014-10-31 Thread Maciek
ck by changing the timezone to UTC on your > machine? > >> >> On Fri, Oct 31, 2014 at 12:00 PM, Maciek wrote: >> >>> Any reason why >>> >>> select from_unixtime(0) t0 FROM … >>> >>> gives >>> >>> 1970-01-01 01:00:00 >>> >>> ? >>> >>> By all available definitions (epoch, from_unixtime etc..) I would expect >>> it to be 1970-01-01 00:00:00…? >>> >>

Re: from_unixtime() and epoch definition

2014-10-31 Thread Maciek
meant 1970-01-01 00:00:00 of course… On Fri, Oct 31, 2014 at 12:00 PM, Maciek wrote: > Any reason why > > select from_unixtime(0) t0 FROM … > > gives > > 1970-01-01 01:00:00 > > ? > > By all available definitions (epoch, from_unixtime etc..) I would expe

from_unixtime() and epoch definition

2014-10-31 Thread Maciek
Any reason why select from_unixtime(0) t0 FROM … gives 1970-01-01 01:00:00 ? By all available definitions (epoch, from_unixtime etc..) I would expect it to be 1970-01-01 01:00:00…?

Re: Hive splits/adds rows when outputting dataset with new lines

2014-10-07 Thread Maciek
This …works! quite surprised as per the steps I outlined, the issue manifested even without CTAS (regular SELECT) still don't see how could that be related …or those are two separate issues? Also, maybe you know - is there any way to make it work for TextFile? Thank you, Maciek On Tue,

Hive splits/adds rows when outputting dataset with new lines

2014-10-06 Thread Maciek
e> select * from corrupted; OK world NULL of NULL hostels worldofhostels Time taken: 0.029 seconds, Fetched: 3 row(s) Apparently, the same happens - new table is split into multiple rows with columns following the one in question (like wordsmerged) become NULLs Am i doing something wrong here? Regards, Maciek