external table on flume log files in S3

2012-04-24 Thread Søren
Hi Hive community We are collecting huge amounts of data into Amazon S3 using Flume. In Elastic Mapreduce, we have so far managed to create an external Hive table on JSON formatted gzipped log files in S3 using a customized serde. The log files are collected and stored in one single folder wi

handling null argument in custom udf

2012-12-04 Thread Søren
son serde reading from S3. I'm not processing nulls in my serde in any special way because Hive seems to handle null in the right way when not passed to my own UDF. Are there anyone out there with ideas or experiences on this issue? thanks in advance Søren

Re: handling null argument in custom udf

2012-12-04 Thread Søren
/2012 15:43, Edward Capriolo wrote: There is no null argument. You should handle the null case in your code. If (arga == null) Or optionally you could use a generic udf but a regular one should handle what you are doing. On Tuesday, December 4, 2012, Søren <mailto:s...@syntonetic.com>&

Re: handling null argument in custom udf

2012-12-06 Thread Søren
Right. Thanks for all the help. It turned out that it did help to check for null in the code. No mystery. I did try that earlier but the attempt got lost somehow. Thanks for the advise on using GenericUDF. cheers Søren On 05/12/2012 11:10, Vivek Mishra wrote: The way UDF works is, you need to