sebbASF opened a new issue #125: URL: https://github.com/apache/whimsy/issues/125
ICLAs are currently filed using a filename stem based on the full name. This approach increasingly suffers from collisions; it also has the potential to expose PII. We need to find a different identifier where the likelihood of collisions is very small. Unfortunately humans don't have a unique immutable identifier, at least not one which is likely to be accessible to us. So some other ID needs to be found. Some possibilities to consider: - UUID generated at time of filing - hash of the ICLA file - email address (canonicalised if necessary to generate a valid file name stem) Assuming two people don't share the same email address, all the above should be collision-free for distinct people. Any others? Note that whilst the above ids will uniquely identify an ICLA, additional ICLAs from the same person will generally have different ids (email may or may not be the same). This is also true of the full name used in ICLAs: apart from ICLAs which are sent to record a change of name, we sometimes get ICLAs with a different spelling of a name, or with changes to the given names. The current approach for replacement ICLAs is to create a directory and store all the ICLAs in the same directory. If email address is used, something similar would be needed. For the other IDs, the list of files in a directory would need to be replaced with a list of IDs in the index. -- This is an automated message from the Apache Git Service. To respond to the message, please log on to GitHub and use the URL above to go to the specific comment. To unsubscribe, e-mail: dev-unsubscr...@whimsical.apache.org For queries about this service, please contact Infrastructure at: us...@infra.apache.org