RE: Last value for a column

2012-01-27 Thread Steven Wong
Other than writing a custom UDAF or TRANSFORM script, a somewhat ugly way is something like: SELECT user_id, split(max(concat(time, '_', colour)), '_')[1] FROM T GROUP BY user_id From: mdefoinplatel@orange.com [mailto:mdefoinplatel@orange.com] Sent: Thursday, January 26, 2012 3:24 AM To

Re: Sequence generated Id in Hive

2012-01-27 Thread Mapred Learn
Hive 0.8.0 has row_sequence UDF but it generates unique seq ids only per mapper, not across the job. Sent from my iPhone On Jan 27, 2012, at 8:37 AM, Anson Abraham wrote: > Does Hive support automated sequence Id generation, or does a "udf" have to > be created per each object that is created

Re: Last value for a column

2012-01-27 Thread ameet chaubal
Just recently, a new way of doing windowing functionality was posted at: https://github.com/hbutani/SQLWindowing This is quite comprehensive and includes about 16 functions. This is an approach to solve HIVE-896 which is the issue about Lag/Lead etc functions. There is a detailed document about

RE: modify Hive history file location

2012-01-27 Thread Black, Brent
That fixed it. Thanks! Brent Black From: Chinna Rao Lalam [mailto:chinna...@huawei.com] Sent: Friday, January 27, 2012 12:52 AM To: user@hive.apache.org Subject: RE: modify Hive history file location Hi, Change the value of this property "hive.querylog.location" . By default value is "/t

jobtracker url(Critical)

2012-01-27 Thread hadoop hive
Hey folks, i m facing a problem, with job Tracker URL, actually i added a node to the cluster and after sometime i restart the cluster, then i found that my job tracker is showing recent added node in *nodes * but rest of nodes are not available not even in *blacklist. * * * can any1 have any i

Re: Reading compressed files (external tables) from hive using DeprecatedLzoTextInputFormat

2012-01-27 Thread alo alt
SET hive.exec.compress.output=true; SET io.seqfile.compression.type=BLOCK; SET mapred.output.compression.codec = com.hadoop.compression.lzo.LzopCodec; CREATE EXTERNAL TABLE tmp_hive(domain string,url string) ROW FORMAT DELIMITED FIELDS TERMINATED BY '\t' STORED AS INPUTFORMAT "com.hadoop.mapre

Re: rainstor

2012-01-27 Thread alo alt
sounds interesting, but more as a website I didn't found.. - Alex -- Alexander Lorenz http://mapredit.blogspot.com On Jan 26, 2012, at 9:49 AM, Dalia Sobhy wrote: > > www.rainstor.com > Subject: Re: rainstor > From: swil...@monetate.com > Date: Wed, 25 Jan 2012 22:14:12 -0500 > To: user@hive

Re: Map Red SequenceFile output to Hive table

2012-01-27 Thread alo alt
How you described the table? And how do you load them into? - Alex -- Alexander Lorenz http://mapredit.blogspot.com On Jan 26, 2012, at 11:00 PM, rk vishu wrote: > Hello All, > > I have a mapred job that does transfermation and outputs to a compresses > SequenceFile (by using org.apache.had

RE: modify Hive history file location

2012-01-27 Thread Chinna Rao Lalam
Hi, Change the value of this property "hive.querylog.location" . By default value is "/tmp/{user.name}" Hope It Helps, Chinna Rao Lalam From: Black, Brent [brent.bl...@disney.com] Sent: Friday, January 27, 2012 7:40 AM To: 'user@hive.apache.org' Subject: m

RE: Problem with Hive/HBase integration

2012-01-27 Thread Chinna Rao Lalam
Hi, In the below table space is the problem in columns.mapping WITH SERDEPROPERTIES("hbase.columns.mapping" = "cf_cdr:caller_name, cf_cdr:caller_number") Remove the space between 2 columns like WITH SERDEPROPERTIES("hbase.columns.mapping" = "cf_cdr:caller_name,cf_cdr:caller_number")