Hi,
Currently I am facing an issue with Hive/HBase integration.
Exception in thread "main" java.lang.NoSuchMethodError:
org.apache.hadoop.util.Shell.getGROUPS_COMMAND()[Ljava/lang/String;
StackTrace:
Exception in thread "main" java.lang.NoSuchMethodError:
org.apache.hadoop.util.Shell.getGROUP
I have jobs that sample (or generate) a small amount of data from a
large table. At the end, I get e.g. about 3000 or more files of 1kb
or so. This becomes a nuisance. How can I make Hive do another pass
to merge the output? I have the following settings:
hive.merge.mapfiles=true
hive.merge.ma
Write a UDF that generates a UUID in Java
something like this:
@description(name = "uuid", value = "_FUNC_() - Generate a unique uuid")
@SuppressWarnings({"UnusedDeclaration"})
public class Uuid extends UDF{
public Text evaluate() {
return new Text(UUID.randomUUID().toString());
}
}
On T