Hi, I have a large set of text files. I have created a Hive table pointing to each of these text files. I am looking to compress the files to save storage.
1) How should I compress the file to use LZO compression. 2) How to know whether LZO compression utility (command ?) is installed on the Hadoop cluster? 3) Should the Hive table definition be modified as a Sequence File if I compress the text file? Please advise. Thanks, Raj