ppkarwasz opened a new pull request, #427:
URL: https://github.com/apache/commons-codec/pull/427

   This change adds two methods to `DigestUtils` that compute generalized Git 
object identifiers using an arbitrary `MessageDigest`, rather than being 
restricted to SHA-1:
   
   - `gitBlob(digest, input)`: computes a generalized [Git blob object 
identifier](https://git-scm.com/book/en/v2/Git-Internals-Git-Objects) for a 
given file or byte content.
   - `gitTree(digest, file)`: computes a generalized [Git tree object 
identifier](https://git-scm.com/book/en/v2/Git-Internals-Git-Objects) for a 
given directory.
   
   ### Motivation
   
   The standard Git object identifiers use SHA-1, which is [in the process of 
being replaced by SHA-256](https://git-scm.com/docs/hash-function-transition) 
in Git itself. These methods generalize the identifier computation to support 
any `MessageDigest`, enabling both forward compatibility and use with external 
standards.
   
   In particular, the `swh:1:cnt:` (content) and `swh:1:dir:` (directory) 
identifier types defined by [SWHID (ISO/IEC 
18670)](https://www.swhid.org/specification/v1.2/5.Core_identifiers/) are 
currently compatible with Git blob and tree identifiers respectively (using 
SHA-1), and can be used to generate canonical, persistent identifiers for 
unpacked source and binary distributions.
   
   Before you push a pull request, review this list:
   
   - [x] Read the [contribution guidelines](CONTRIBUTING.md) for this project.
   - [x] Read the [ASF Generative Tooling 
Guidance](https://www.apache.org/legal/generative-tooling.html) if you use 
Artificial Intelligence (AI).
   - [x] I used AI to create any part of, or all of, this pull request. Which 
AI tool was used to create this pull request, and to what extent did it 
contribute? Claude Code was used for tests and to review the main code
   - [x] Run a successful build using the default 
[Maven](https://maven.apache.org/) goal with `mvn`; that's `mvn` on the command 
line by itself.
   - [x] Write unit tests that match behavioral changes, where the tests fail 
if the changes to the runtime are not applied. This may not always be possible, 
but it is a best practice.
   - [x] Write a pull request description that is detailed enough to understand 
what the pull request does, how, and why.
   - [x] Each commit in the pull request should have a meaningful subject line 
and body. Note that a maintainer may squash commits during the merge process.
   


-- 
This is an automated message from the Apache Git Service.
To respond to the message, please log on to GitHub and use the
URL above to go to the specific comment.

To unsubscribe, e-mail: [email protected]

For queries about this service, please contact Infrastructure at:
[email protected]

Reply via email to