.com
From: Igor Tatarinov mailto:i...@decide.com>>
Reply-To: "user@hive.apache.org<mailto:user@hive.apache.org>"
mailto:user@hive.apache.org>>
Date: Thursday, January 17, 2013 1:29 PM
To: "user@hive.apache.org<mailto:user@hive.apache.org>"
mailto:user
Here is how Twitter does it with Pig:
http://www.umiacs.umd.edu/~jimmylin/publications/Lin_Kolcz_SIGMOD2012.pdf
We use a similar approach and I think that Pig, being somewhat lower-level
with better support of nested objects, is a better tool than Hive. It
should be possible to do something simila
How to run machine learning algorithms (whatever ML algorithms) directly in
Hive? assume the input and output already stored as Hive tables.
ps: I know mahout is available there, but would prefer run machine learning
algorithms directly in Hive
many thanks,