Hive/HBase integration issue.

2010-11-17 Thread Vivek Mishra
Hi, Currently I am facing an issue with Hive/HBase integration. Exception in thread "main" java.lang.NoSuchMethodError: org.apache.hadoop.util.Shell.getGROUPS_COMMAND()[Ljava/lang/String; StackTrace: Exception in thread "main" java.lang.NoSuchMethodError: org.apache.hadoop.util.Shell.getGROUP

Hive produces very small files despite hive.merge...=true settings

2010-11-17 Thread Leo Alekseyev
I have jobs that sample (or generate) a small amount of data from a large table. At the end, I get e.g. about 3000 or more files of 1kb or so. This becomes a nuisance. How can I make Hive do another pass to merge the output? I have the following settings: hive.merge.mapfiles=true hive.merge.ma

Re: How to generate global unique ID?

2010-11-17 Thread Eyal
Write a UDF that generates a UUID in Java something like this: @description(name = "uuid", value = "_FUNC_() - Generate a unique uuid") @SuppressWarnings({"UnusedDeclaration"}) public class Uuid extends UDF{ public Text evaluate() { return new Text(UUID.randomUUID().toString()); } } On T