> I added a compact index to this table as below on 5 columns No, those are not what I recommend in this scenario.
You made a statement that the table was sorted and it wasn't. >>Table is sorted in the order of prod_id, cust_id,time_id, channel_id and >> promo_id. It has 22 million rows. >> Not it is not. ... >> Add at least >> >> SORTED BY (PROD_ID) >> >> if what you care about is scanning performance with the ORC indexes. How does Hive perform after you add that to the DDL and reload data? Cheers, Gopal