Hello team,

We are extracting data from netezza and loading into hive tables. While loading 
data, we are using hash function to mask few PII data for security reasons.

One such column is acct_num stored as bigint in netezza, which we are storing 
in a string column after converting hash of that acct_num to a hex format. 

Now the issue is we found same value is generated for distinct acct_num in most 
of the records. So any known issues with the algorithm that hash function uses 
in hive?

Thanks,
Murali

Sent from my iPhone

Reply via email to