Hive as I use it is particularly useful for getting data out of relational tables and more importantly query that data using HiveQL (a variation of transact sql)
. If your data is in binary format and assuming that you manage to store it in HDFS, how are you intending to access the data. At the consumer level what tools are you going to use? Do you a propriety tool with the correct drivers to access the data? HTH Mich Talebzadeh http://talebzadehmich.wordpress.com Publications due shortly: Creating in-memory Data Grid for Trading Systems with Oracle TimesTen and Coherence Cache NOTE: The information in this email is proprietary and confidential. This message is for the designated recipient only, if you are not the intended recipient, you should destroy it immediately. Any information in this message shall not be understood as given or endorsed by Peridale Ltd, its subsidiaries or their employees, unless expressly so stated. It is the responsibility of the recipient to ensure that this email is virus free, therefore neither Peridale Ltd, its subsidiaries nor their employees accept any responsibility. From: karthik maddala [mailto:karthikmaddal...@gmail.com] Sent: 13 March 2015 15:56 To: user@hive.apache.org Subject: Which SerDe for Custom Binary Data. I want to set up a DW based on Hive. However, my data does not come as handy csv files but as binary files in a proprietary format. The binary file consists of serialized data using C language. Could you please suggest which input format to be used and how to write a custom SerDe for the above mentioned data. Thanks, Karthik Maddala