With respective to your steps, 1. It's MySQL database 2. Yes 3. Not sure whether I need it, Do you think I need it? if so why? 4. Sqoop will get data from MySQL to Hadoop 5. Correct 6. I want to use Hive on Spark for real time data processing on Hadoop
Daily/periodic changes from RDBMS to Hive will be done through Oozhie and Sqoop. As per my research I can write a periodic Sqoop/Pig job to be executed by the Oohie. Hope it will work. All I want to do is run Hive on Spark on Ubuntu. Can you please kindly tell me the configuration steps?