Hi,
I've built a datastore using Hive 7.1 backed by S3 using persistent metadata.
Now that hive 8.1 is available, I'd like to migrate to the new version.
However, I'm having trouble reading tables with the persistent schema. Looking
in the logs, I'm getting stack traces like the following:
20
Hi,
What's the lifecycle of a hive udf. If I call
select MyUDF(field1,field2) from table;
Then MyUDF is instantiated once per mapper, and within each mapper
execute(field1, field2) is called for each reducer? I hope this is the case,
but I can't find anything about this in the documentation
Hi,
I'm capturing data of the form A (1:n) B, which is a fairly standard
item-subitem pattern. In a standard DB, I'd have A and B tables with a foreign
key from B to A.
But since Hive is different -- there's no natural primary key in my data and
joins seem much more expensive -- I'm consideri
Hi,
This is probably a newbie question, but is there any way to get hive to log
which files it goes through as it performs a query?
I'm setting up a partitioned store (EMR, persistent metadata, storage all on
S3) which *appears* to work, but I'd like understand its behavior better,
especially
> e: mgro...@oanda.com
>
> "Best Trading Platform" - World Finance's Forex Awards 2009.
> "The One to Watch" - Treasury Today's Adam Smith Awards 2009.
>
>
> - Original Message -
> From: "Ranjan Bagchi"
> To: user@hive.ap
es:
columns _col0
columns.types bigint
serialization.format 1
TotalFiles: 1
GatherStats: false
MultiFileSpray: false
Stage: Stage-0
Fetch Operator
limit: -1
Time taken: 0.156 seconds
On Dec 15, 2011, a
Hi,
I'm experiencing the following:
I've a file on s3 -- s3n://my.bucket/hive/ranjan_test. It's got fields
(separated by \001) and records (separated by \n).
I want it to be accessible on hive, the ddl is:
CREATE EXTERNAL TABLE IF NOT EXISTS ranjan_test (
ip_address string,
num_counted int
)