Hi,
I'm running a Hive query on my EMR cluster, and running into this issue
which is causing the task to fail:
java.lang.RuntimeException:
org.apache.hadoop.hive.ql.metadata.HiveException: Hive Runtime Error while
processing row [Error getting row data with exception
java.lang.ArrayIndexOutOfBound
Hi,
I'm trying to use the "cluster by" statement in Hive to write a query
like this:
FROM (SELECT * FROM attribute_table
CLUSTER BY id, name, value, amount) map_output
INSERT OVERWRITE TABLE attributed_table
SELECT TRANSFORM (map_output.id,...)
USING 'python2.7 data_attribution.py'
AS id, name,