Hi Hive community
We are collecting huge amounts of data into Amazon S3 using Flume.
In Elastic Mapreduce, we have so far managed to create an external Hive
table on JSON formatted gzipped log files in S3 using a customized
serde. The log files are collected and stored in one single folder wi
son serde reading
from S3. I'm not processing nulls in my serde in any special way because
Hive seems to handle null in the right way when not passed to my own UDF.
Are there anyone out there with ideas or experiences on this issue?
thanks in advance
Søren
/2012 15:43, Edward Capriolo wrote:
There is no null argument. You should handle the null case in your code.
If (arga == null)
Or optionally you could use a generic udf but a regular one should
handle what you are doing.
On Tuesday, December 4, 2012, Søren <mailto:s...@syntonetic.com>&
Right. Thanks for all the help.
It turned out that it did help to check for null in the code. No mystery.
I did try that earlier but the attempt got lost somehow.
Thanks for the advise on using GenericUDF.
cheers
Søren
On 05/12/2012 11:10, Vivek Mishra wrote:
The way UDF works is, you need to