Re: Inconsistent results from INSERT OVERWRITE TABLE

2011-05-11 Thread Tim Spence
Thank you. Are there tools for parsing the Hive logs for errors? If not, can you talk about the strategy used at Facebook to deal with detection and resolution of MR errors? Perhaps I can write a script to identify errors. First I have to solve the mystery of why there are no logs on my hadoop

Re: Inconsistent results from INSERT OVERWRITE TABLE

2011-05-11 Thread Ning Zhang
Hive queries are compiled to different types tasks (MapReduce, MoveTask, etc), so a successful MR task as indicated in the JT doesn't mean the whole query succeeded. So you need to examine the status of the hive query to see if it succeeded or not. You can also check the hive's log file under /

Inconsistent results from INSERT OVERWRITE TABLE

2011-05-11 Thread Tim Spence
I've been using Hive in production for two months now. We're mainly using it for processing server logs, about 1-2GB per day (2-2.5 million requests). Typically we import a day's worth of logs at once. That said, sometimes we decide to tweak a calculated column. When that happens, we modify our

Re: Strategy for Loading Apache Logs

2011-05-11 Thread Ashish Thusoo
you could always have another sub partition under the daily partition. This sub partition could be the timestamp on when you did the load. So when you run the statement you would create a new sub partition within the date partition and in effect you end up doing an append to the Hive partition.

Re: Implementing conditional and control statements in Hive

2011-05-11 Thread Ashish Thusoo
With streaming, UDF or UDTFs you would get almost any kind of control flow you want without having those features implemented in Hive proper. For udf, udaf or udtf you use java for implementation. In streaming you can use any language of your choice. Not sure if this addresses stuff? Ashish On

Re: Question about how to compile Hive project off-line?

2011-05-11 Thread Alex.Wang
Thank you very much! I am going to use your approach at once. Hope I can get some luck. :-D Best regards. 2011/5/11 Jov : > I do it like this: > > 1.down load the hive src > 2.compile hive in eclipse from my windows,which can connect the net.it > will down load all necessories to src/build dir

Implementing conditional and control statements in Hive

2011-05-11 Thread Jothi A
Hi experts, I got to know that hive has no stored procedures.but this can be done in Hive with the help of streaming and functions.Is there any way to implement *Declare,Conditional handling statements,Control statements* in hive.As of now i'm looking into Hive UDF's and some stored procedures in M

Re: Question about how to compile Hive project off-line?

2011-05-11 Thread Jov
I do it like this: 1.down load the hive src 2.compile hive in eclipse from my windows,which can connect the net.it will down load all necessories to src/build dir 3.in linux,mkdir src/build;copy the src/build/hadoopcore,src/ivy from windows to linux 4.edit the src/build-common.xml,make the target

Question about how to compile Hive project off-line?

2011-05-11 Thread Alex.Wang
Hello everyone. I am a student who begins to do some research on Hive project. I think the first step is to have a successful build of Hive. I have downloaded the Hive source code, uploaded to the Linux machine, unzipped and entered the source directory. Then I run the "ant" command and got a error