This is an automated email from the ASF dual-hosted git repository.
krickert pushed a change to branch OPENNLP-1850-4-docs
in repository https://gitbox.apache.org/repos/asf/opennlp.git
discard c83701ece OPENNLP-1850 Docs review nits: populate ids2Labels example;
rule-based hyphenation
discard cc251b410 OPENNLP-1850 Mirror the Extended_Pictographic emoji caveat
in the tokenizer manual
discard 046012d1a OPENNLP-1850 Tighten normalizer manual wording (review nits)
discard 858ddf926 OPENNLP-1850 Document the supplementary-dash offset shift in
the DL fold options
discard f79420066 OPENNLP-1850 Document the offset-aware substitution folds
(quotes, digits, ellipsis, bullets, umlaut)
discard 6a87f64e8 OPENNLP-1850 Name the OffsetMappingNameFinder capability
interface in the manual
discard 20098608b OPENNLP-1850 Document the offset-aware normalization
pipeline (buildAligned)
discard 95de4326b OPENNLP-1850 Document Unicode normalization, the UAX #29
tokenizer, and DL handling
discard e7f3c5978 OPENNLP-1850 Fully-qualify TokenNameFinder javadoc links in
NameFinderDL
discard 2c76b083d OPENNLP-1850 Fail loud on corrupt document-classification
model output
discard 318352921 OPENNLP-1850 Fail fast on null finder input; fix the GPU
eval test options
discard 43aa7255f OPENNLP-1850 Harden fail-loud paths in the DL components
discard 1cc2a9789 OPENNLP-1850 Add real-model chunk-boundary eval tests; drop
dead label constants
discard e9c0334eb OPENNLP-1850 Resolve overlapping chunk spans and compose the
input alignment
discard 539780738 OPENNLP-1850 Add OffsetMappingNameFinder capability
interface and a findInOriginal end-to-end test
discard b876e5506 OPENNLP-1850 Offset-safe, Unicode-aware input normalization
in the DL components
omit 0bf7f6c03 OPENNLP-1850 Lazy, recoverable loading for WordBreakProperty
and ExtendedPictographic
omit ab038b41e OPENNLP-1850 Address review: fail-loud TermAnalyzer default;
harden WordBreakProperty
omit 2da2949c8 OPENNLP-1850 Fail fast on null public-entry arguments
(review nits)
omit d06489714 OPENNLP-1850 Clarify that Extended_Pictographic symbols are
kept as emoji
omit 724a2544e OPENNLP-1850 Address tokenizer review comments
omit 2ecf63796 OPENNLP-1850 Address Copilot review on the UAX #29 tokenizer
omit 064d36345 OPENNLP-1850 UAX #29 word tokenizer and the layered Term
model
add a450069ef OPENNLP-1850 UAX #29 word tokenizer: WordSegmenter,
WordTokenizer, WordType (2a)
add 57e2b5833 OPENNLP-1850 Layered Term model: Term, TermAnalyzer (2b)
add b6cd17380 OPENNLP-1850 Per-language NormalizationProfile registry (2c)
add e945009ea OPENNLP-1850 Offset-safe, Unicode-aware input normalization
in the DL components
add aff3fd44f OPENNLP-1850 Add OffsetMappingNameFinder capability
interface and a findInOriginal end-to-end test
add 702d392c8 OPENNLP-1850 Resolve overlapping chunk spans and compose the
input alignment
add 284afa576 OPENNLP-1850 Add real-model chunk-boundary eval tests; drop
dead label constants
add f3dc9b992 OPENNLP-1850 Harden fail-loud paths in the DL components
add 7e1c4e55d OPENNLP-1850 Fail fast on null finder input; fix the GPU
eval test options
add 062322e22 OPENNLP-1850 Fail loud on corrupt document-classification
model output
add b6dc2418f OPENNLP-1850 Fully-qualify TokenNameFinder javadoc links in
NameFinderDL
add 2cf733491 OPENNLP-1850 Document Unicode normalization, the UAX #29
tokenizer, and DL handling
add ef6a3c1da OPENNLP-1850 Document the offset-aware normalization
pipeline (buildAligned)
add 00f4a3a35 OPENNLP-1850 Name the OffsetMappingNameFinder capability
interface in the manual
add 556adc35d OPENNLP-1850 Document the offset-aware substitution folds
(quotes, digits, ellipsis, bullets, umlaut)
add 2e91c1d48 OPENNLP-1850 Document the supplementary-dash offset shift in
the DL fold options
add f758d8ef6 OPENNLP-1850 Tighten normalizer manual wording (review nits)
add ce01cc0b1 OPENNLP-1850 Mirror the Extended_Pictographic emoji caveat
in the tokenizer manual
add e0011e207 OPENNLP-1850 Docs review nits: populate ids2Labels example;
rule-based hyphenation
This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version. This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:
* -- * -- B -- O -- O -- O (c83701ece)
\
N -- N -- N refs/heads/OPENNLP-1850-4-docs (e0011e207)
You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.
Any revisions marked "omit" are not gone; other references still
refer to them. Any revisions marked "discard" are gone forever.
No new revisions were added by this update.
Summary of changes: