sebbASF opened a new issue #125:
URL: https://github.com/apache/whimsy/issues/125


   ICLAs are currently filed using a filename stem based on the full name.
   
   This approach increasingly suffers from collisions; it also has the 
potential to expose PII.
   
   We need to find a different identifier where the likelihood of collisions is 
very small.
   Unfortunately humans don't have a unique immutable identifier, at least not 
one which is likely to be accessible to us.
   So some other ID needs to be found.
   
   Some possibilities to consider:
   - UUID generated at time of filing
   - hash of the ICLA file
   - email address (canonicalised if necessary to generate a valid file name 
stem)
   
   Assuming two people don't share the same email address, all the above should 
be collision-free for distinct people.
   
   Any others?
   
   Note that whilst the above ids will uniquely identify an ICLA, additional 
ICLAs from the same person will generally have different ids (email may or may 
not be the same). This is also true of the full name used in ICLAs: apart from 
ICLAs which are sent to record a change of name, we sometimes get ICLAs with a 
different spelling of a name, or with changes to the given names.
   
   The current approach for replacement ICLAs is to create a directory and 
store all the ICLAs in the same directory.
   If email address is used, something similar would be needed.
   For the other IDs, the list of files in a directory would need to be 
replaced with a list of IDs in the index.


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: dev-unsubscr...@whimsical.apache.org

For queries about this service, please contact Infrastructure at:
us...@infra.apache.org


Reply via email to