RE: Difference between ORC and RC files

2015-12-21 Thread Mich Talebzadeh
e that this email is virus free, therefore neither Peridale Ltd, its subsidiaries nor their employees accept any responsibility. From: Ashok Kumar [mailto:ashok34...@yahoo.com] Sent: 21 December 2015 19:18 To: user@hive.apache.org Subject: Re: Difference between ORC and RC files Ma

Re: Difference between ORC and RC files

2015-12-21 Thread Ashok Kumar
Many thanks Sir. Very useful. Kindly elaborate why RC files do not have these capabilities. As I see them they are Row Columnar files. Am I correct to assume that ORC file is basically an RC file with more optimisation. Are RC and ORC files designed for columnar format similar to the way a colum

Re: Difference between ORC and RC files

2015-12-21 Thread Alan Gates
ORC offers a number of features not available in RC files: * Better encoding of data. Integer values are run length encoded. Strings and dates are stored in a dictionary (and the resulting pointers then run length encoded). * Internal indexes and statistics on the data. This allows for more