Hi,
Would there be a performance difference query1 vs query2?
*query1 :*
cc = LOAD '$TBL_CLEARCODE'
USING
org.apache.pig.backend.hadoop.hbase.HBaseStorage('cf_data:cq_description
cf_data:cq_category cf_data:cq_dqtimestamp cf_data:cq_checkarray',
'-loadKey true ')
as (key,description,category,ActiveStagTmStamp,transformArray);
*query2:*
cc = LOAD '$TBL_CLEARCODE'
USING
org.apache.pig.backend.hadoop.hbase.HBaseStorage('cf_data:cq_description
cf_data:cq_category cf_data:cq_dqtimestamp cf_data:cq_checkarray',
'-loadKey true *-maxTimestamp $CORR_DATE*')
as (key,description,category,ActiveStagTmStamp,transformArray);
The only difference between between the two querys is the -maxTimestamp
parameter in query2.
Regards,
Krishna