> I added a compact index to this table as below on 5 columns

No, those are not what I recommend in this scenario.

You made a statement that the table was sorted and it wasn't.

>>Table is sorted in the order of prod_id, cust_id,time_id, channel_id and
>> promo_id. It has 22 million rows.

>> Not it is not.
...
>> Add at least
>>
>> SORTED BY (PROD_ID)
>>
>> if what you care about is scanning performance with the ORC indexes.

How does Hive perform after you add that to the DDL and reload data?

Cheers,
Gopal


Reply via email to