Re: Exploding Array of String in Hive

2012-08-10 Thread Techy Teck
Basically How can I get each String outside, something like *favorite_result [3] should return *"VAN%2C%20Under%20£1%2C000%2E00%20Ending%20within%202%20days", On Fri, Aug 10, 2012 at 8:04 PM, Techy Teck wrote: > Below is the data in *favorite_result (datatype: array) *column. &

Re: (Get the current date -1) in Hive

2012-08-07 Thread Techy Teck
carla.stae...@nokia.com > To: user@hive.apache.org > Subject: RE: (Get the current date -1) in Hive > Date: Mon, 6 Aug 2012 23:26:46 + > > > In the case here it literally is taking the UNIX timestamp, formatting > it in -mm-dd format and then subtracting the speci

Re: Custom UserDefinedFunction in Hive

2012-08-07 Thread Techy Teck
Then that means I don't need to create that userdefinedfunction right? On Tue, Aug 7, 2012 at 11:32 AM, Jan Dolinár wrote: > Hi Jamal, > > date is standard linux/unix tool, see the manual page: > http://linux.die.net/man/1/date. > > The $(...) tells the shell to execute the command and insert

Re: Some Weird Behavior

2012-08-07 Thread Techy Teck
t;count(*)" and see > if that makes any difference. [1] > > [1] https://issues.apache.org/jira/browse/HIVE-287 > > On Tue, Aug 7, 2012 at 1:07 PM, Techy Teck > wrote: > > > I am running Hive 0.6. > > > > > > > > > > > > On Tue,

Re: Some Weird Behavior

2012-08-07 Thread Techy Teck
I am running Hive 0.6. On Tue, Aug 7, 2012 at 11:04 AM, kulkarni.swar...@gmail.com < kulkarni.swar...@gmail.com> wrote: > What is the hive version that you are using? > > > On Tue, Aug 7, 2012 at 12:57 PM, Techy Teck wrote: > >> I am not sure about the data, but wh

Re: Some Weird Behavior

2012-08-07 Thread Techy Teck
ould be getting NULL record back right? But I am not getting anything back and that is the reason it is making me more confuse. On Tue, Aug 7, 2012 at 10:31 AM, Yue Guan wrote: > Just in case, all Record is null when uid is null? > > On Tue, Aug 7, 2012 at 1:14 PM, Techy Teck >

Some Weird Behavior

2012-08-07 Thread Techy Teck
SELECT count(*) from data_realtime where dt='20120730' and uid is null I get the count as *1509* So that means If I will be doing SELECT * from data_realtime where dt='20120730' and uid is null I should be seeing those records in which uid is null? right? But I get zero record back wit

Re: Caused by: java.io.EOFException

2012-08-06 Thread Techy Teck
build that file mannually? > > > On Tue, Aug 7, 2012 at 6:01 AM, Techy Teck wrote: > >> Yup that makes sense. But when I tried opening that file using- >> >> hadoop fs -text >> /apps/hdmi-technology/b_apdpds/real-time_new/20120731/PDS_HADOOP_REALTIME_EXPORT-part-

Re: Caused by: java.io.EOFException

2012-08-06 Thread Techy Teck
/hdmi-technology/b_apdpds/real-time_new/20120731/PDS_HADOOP_REALTIME_EXPORT-part-3-2 > > > > > Regards > Bejoy KS > > Sent from handheld, please excuse typos. > -- > *From: * Techy Teck > *Date: *Mon, 6 Aug 2012 14:53:57 -0700 > *To: * >

Caused by: java.io.EOFException

2012-08-06 Thread Techy Teck
I am writing a simple query on our hive table and I am getting some exception- select count(*) from table1 where dt='20120731'; java.io.IOException: IO error in map input file hdfs://ares-nn/apps/hdmi-technology/b_apdpds/real-time_new/20120731/PDS_HADOOP_REALTIME_EXPORT-part-3-2 at org

Re: (Get the current date -1) in Hive

2012-08-06 Thread Techy Teck
guess you can use sub_date, but you have to get today by some outside > script. > > On 08/06/2012 02:10 PM, Techy Teck wrote: > > Is there any way to get the current date -1 in Hive means yesterdays > > date always? > > > > > > > >

Add Yesterday's date at runtime

2012-08-06 Thread Techy Teck
I am running *Hive 0.6 *and below is the content I have in *hivetest1.hql*file. set mapred.job.queue.name=hdmi-technology; set mapred.output.compress=true; set mapred.output.compression.type=BLOCK; set mapred.output.compression.codec=org.apache.hadoop.io.compress.LzoCodec; add jar UserDefinedFunct

(Get the current date -1) in Hive

2012-08-06 Thread Techy Teck
Is there any way to get the current date -1 in Hive means yesterdays date always?

Re: Passing date as command line arguments

2012-08-04 Thread Techy Teck
on hive 0.9 and it was working good for me. > > I guess you should recommend an upgrade of hive in your cluster as well. > Hive has gone far too ahead after 0.6 . :) > Regards > Bejoy KS > > Sent from handheld, please excuse typos. > ------ > *Fro

Re: Passing date as command line arguments

2012-08-04 Thread Techy Teck
27;" QUERY_ID="rjamal_2012080414_3719afde-117d-4c6b-a322-0cff6e4b4c5d" QUERY_RET_CODE="0" QUERY_NUM_TASKS="0" TIME="1344065629745"* *Counters plan="{"queryId":"rjamal_2012080414_3719afde-117d-4c6b-a322-0cff6e4b4c5d","queryTy

Re: Passing date as command line arguments

2012-08-04 Thread Techy Teck
ll. 0.9 should be good. > > > Regards > Bejoy KS > > Sent from handheld, please excuse typos. > -- > *From: * Techy Teck > *Date: *Sat, 4 Aug 2012 00:26:44 -0700 > *To: *; > *ReplyTo: * user@hive.apache.org > *Subject: *Re: Passing da

Re: Passing date as command line arguments

2012-08-04 Thread Techy Teck
nd, It is working fine for me without any > issues. > > By de way the data type for 'dt' is String itself right? > Regards > Bejoy KS > > Sent from handheld, please excuse typos. > -- > *From: * Techy Teck > *Date: *Sat, 4 Au

Re: Passing date as command line arguments

2012-08-04 Thread Techy Teck
ssues. > > By de way the data type for 'dt' is String itself right? > Regards > Bejoy KS > > Sent from handheld, please excuse typos. > -- > *From: * Techy Teck > *Date: *Sat, 4 Aug 2012 00:21:37 -0700 > *To: *; > *ReplyTo: * user

Re: Passing date as command line arguments

2012-08-04 Thread Techy Teck
t * from lip_data_quality where dt='20120709'; > Regards > Bejoy KS > > Sent from handheld, please excuse typos. > -- > *From: * Techy Teck > *Date: *Sat, 4 Aug 2012 00:07:07 -0700 > *To: *; Bejoy Ks > *ReplyTo: * user@hive.apache.org > *S

Re: Passing date as command line arguments

2012-08-04 Thread Techy Teck
t;* *Counters plan="{"queryId":"rjamal_20120804135959_58e245e8-086a-47c2-9ed3-6b952e4a5486","queryType":null,"queryAttributes":{"queryString":"select * from lip_data_quality where dt=20120709"},"queryCo* *unters":"null&

Re: Passing date as command line arguments

2012-08-03 Thread Techy Teck
executed? > > Regards, > Bejoy KS > > -- > *From:* Techy Teck > *To:* user@hive.apache.org > *Cc:* Vijay Dirisala > *Sent:* Saturday, August 4, 2012 12:11 PM > *Subject:* Re: Passing date as command line arguments > > Thanks Vijay for the suggestion. I also trie

Re: Passing date as command line arguments

2012-08-03 Thread Techy Teck
ug 3, 2012 at 11:06 PM, Techy Teck > wrote: > > Can anyone help me out on this? > > > > I am stuck on this and I have no clue what wrong I am doing? > > > > > > > > > > On Fri, Aug 3, 2012 at 3:48 PM, Techy Teck > wrote: > >> > >

Re: Passing date as command line arguments

2012-08-03 Thread Techy Teck
Can anyone help me out on this? I am stuck on this and I have no clue what wrong I am doing? On Fri, Aug 3, 2012 at 3:48 PM, Techy Teck wrote: > I have my below query in *test1.hql file. *I am trying to pass the date * > (dt)* as the command line argument. > > >

Re: Passing date as command line arguments

2012-08-03 Thread Techy Teck
I tried that. But it's not working for me. I didn't got any result. Any other suggestion? On Fri, Aug 3, 2012 at 7:41 PM, Matt Tucker wrote: > Yes, that should do the trick for you. > > > > On Aug 3, 2012, at 10:17 PM, Techy Teck wrote: > > So you are saying,

Sequence File with Block Compression

2012-08-03 Thread Techy Teck
I need to enable Sequence File with Block Compression data. Below is the table which will be stored as SequenceFile. *create table lip_data_quality* *( buyer_id bigint,* * total_chkout bigint,* * total_errpds bigint* * )* * partitioned by (dt string)* *row format deli

Re: decompress the file that has been compressed in LzoCodec format

2012-08-03 Thread Techy Teck
Thanks Bejoy. That worked. On Fri, Aug 3, 2012 at 12:05 AM, Bejoy Ks wrote: > Hi Techy > > Try using hadoop fs -text > > That should give the output in some readable format. > > Regards, > Bejoy KS > > ------ > *From:* Techy Teck >

decompress the file that has been compressed in LzoCodec format

2012-08-02 Thread Techy Teck
I am using the below command to compress the output in *Block Compression*and I have created my table as a *SequenceFile* *set mapred.output.compress=true;* *set mapred.output.compression.type=BLOCK;* *set mapred.output.compression.codec=org.apache.hadoop.io.compress.LzoCodec; * So after the

Re: schema of hive database

2012-08-02 Thread Techy Teck
ported in 0.6 but I could be >> wrong. >> >> >> On Thu, Aug 2, 2012 at 3:59 PM, Techy Teck wrote: >> >>> And I got this error too- >>> >>> FAILED: Parse Error: line 1:19 mismatched input 'table1' expecting EOF >>> >&

Re: schema of hive database

2012-08-02 Thread Techy Teck
And I got this error too- FAILED: Parse Error: line 1:19 mismatched input 'table1' expecting EOF On Thu, Aug 2, 2012 at 12:44 PM, Techy Teck wrote: > What format you used? I tried like this and it doesn't works for me- > > *describe formatted table1* > > I am r

Re: schema of hive database

2012-08-02 Thread Techy Teck
What format you used? I tried like this and it doesn't works for me- *describe formatted table1* I am running Hive 0.6. Anything else I need to do before running this query? On Thu, Aug 2, 2012 at 12:13 PM, Igor Tatarinov wrote: > Try > describe formatted > > igor > decide.com > > On Thu,

FAILED: Execution Error, return code 1 from org.apache.hadoop.hive.ql.exec.MoveTask

2012-08-01 Thread Techy Teck
I am trying to load data in to the date partition, so my data got succesfully loaded for 20120709 but when I tried to load the data for *20120710, * then I am seeing the below exception. Can anyone suggest me why is it happening like this? *Loading data to table data_quality partition (ds=2012071

Efficiently Store data in Hive

2012-08-01 Thread Techy Teck
How can I efficiently store data in Hive and also store and retrieve compressed data in hive? Currently I am storing it as a TextFile. I was going through Bejoy article ( http://kickstarthadoop.blogspot.com/2011/10/how-to-efficiently-store-data-in-hive.html) and I found that LZO compression will

Difference between storing data as a TextFile and SequenceFile

2012-08-01 Thread Techy Teck
What is the difference between storing the data as a TextFile and SequenceFile? And which will be faster while doing Hive queries. I am creating a table like this- create table quality ( id bigint, total_chkout bigint, total_errpds bigint ) partitioned by (ds string) row format delimited fi

Re: count the characters in a string

2012-07-27 Thread Techy Teck
t > https://cwiki.apache.org/confluence/display/Hive/LanguageManual. You can > also read > https://cwiki.apache.org/confluence/display/Hive/LanguageManual+UDF, it > lists a lot of functions that are possible in hive. > > On Fri, Jul 27, 2012 at 4:19 PM, Techy Teck wrote: > >>

count the characters in a string

2012-07-27 Thread Techy Teck
I have a column in Table1 named as - *character *with String datatype. I want to find all the records from the tables if total words in *character*column is less than 32 characters. Something like below. select * from Table1 where *count_characters_in_character_column* < 32; Is it possible t

Re: /apache/hive/bin/hive: line 107: [: too many arguments error

2012-07-27 Thread Techy Teck
le with windows. Look though the open jira issues and see if > any of the issues matches this problem. If not feel free to create > one. For now getting hive to run on windows is going to be a little > rocky. > > On 7/27/12, Techy Teck wrote: > > I am trying to execute the below sh

/apache/hive/bin/hive: line 107: [: too many arguments error

2012-07-27 Thread Techy Teck
I am trying to execute the below shell scripts using PLINK on MachineB from MachineA(Windows Machine) *#!/bin/bash* *HIVE_OPTS="$HIVE_OPTS -hiveconf mapred.job.queue.name=hdmi-technology"* *export HIVE_OPTS* *hive -S -e 'SELECT count(*) from testingtable1' > attachment22.txt* * * *Below is the wa

Re: Group By with rollup in HiveQL?

2012-07-26 Thread Techy Teck
> *select id2, SUM(count2),SUM(coalesce(count1, 0)) as count1 * > > *from table2 * > > * left outer join table1 * > > * on id1=id2 * > > * group by id2 * > > * with rollup * > > > > Whenever I am trying to run the above query, I am getting always as > `FAILED: Parse Error: line mismatched inp

Re: Invalid Table Alias

2012-07-22 Thread Techy Teck
Can anyone give some sort of pointers or suggestions why the below query is giving me exception and the same query works fine in SQL server. Below is the SQL fiddle that works in SQL Server- http://sqlfiddle.com/#!3/d87b2/4 On Fri, Jul 20, 2012 at 6:47 PM, Techy Teck wrote: > In hive

Re: Converting timestamp to date format

2012-07-20 Thread Techy Teck
Thanks for the suggestion. I will use BIGINT now. On Fri, Jul 20, 2012 at 3:48 PM, kulkarni.swar...@gmail.com < kulkarni.swar...@gmail.com> wrote: > BIGINT is 8 bytes whereas INT is 4 bytes. Timestamps are usually of "long" > type. To avoid loss of precision, I would recommend BIGINT. > > > On

Re: Invalid Table Alias

2012-07-20 Thread Techy Teck
In hive we cannot calculate the difference between dates in minutes? On Fri, Jul 20, 2012 at 6:06 PM, Techy Teck wrote: > Whenever I am running the below query > > SELECT TT.BUYER_ID , COUNT(*) FROM > (SELECT testingtable1.buyer_id, testingtable1.item_id, > testingtable1.cr

Re: Both Left and Right Aliases Encountered in Join

2012-07-20 Thread Techy Teck
t from handheld, please excuse typos. > ------ > *From: * Techy Teck > *Date: *Fri, 20 Jul 2012 15:44:36 -0700 > *To: * > *ReplyTo: * user@hive.apache.org > *Subject: *Both Left and Right Aliases Encountered in Join > > I have a join query on which I

Both Left and Right Aliases Encountered in Join

2012-07-20 Thread Techy Teck
I have a join query on which I am joining like this below. And it is working fine ON (table1.item = prod.product *AND* table1.buyer = prod.user *AND* UNIX_TIMESTAMP(table1.tt_time) = prod.timestamps) But when I tried to modify the above same thing. The thing that is in red color will work or not