Hello team, We are extracting data from netezza and loading into hive tables. While loading data, we are using hash function to mask few PII data for security reasons.
One such column is acct_num stored as bigint in netezza, which we are storing in a string column after converting hash of that acct_num to a hex format. Now the issue is we found same value is generated for distinct acct_num in most of the records. So any known issues with the algorithm that hash function uses in hive? Thanks, Murali Sent from my iPhone