Hello all, I'm using Hive to do mashups of data that is currently on HDFS and data is that is on a MySQL database... I'm considering this scenario instead of bringing the data from MySQL to HDFS because it is a matter verifying the relating data from the query to the data in the external data storage (mysql). This external data is has a fixed size measured in Megabytes and provide indexes for calculations...
What's the best approach? Should I import the external MySQL tables into Hive (probably using sqoop) and provide the needed joins/passes or should I just use a Map and access the needed data using JDBC from a custom Map? Thanks Marcello