You might want to look at oozie http://yahoo.github.com/oozie/ . The
trunk version doesn't support hive actions (yet I think). But Cloudera
packages a version that has hive support.
> I need to join data from these 3 tables to generate daily statistics but
> obviously, I do not want to reprocess e
Hi,
Are there any good scheduling tools out there suitable for the dependencies
you may get in Hive?
Specific example I have right now:
- 2 tables with event logs from different sources
- 1 table with some additional data from a different source, but this data
is daily summary
None of this data