This is an automated email from the ASF dual-hosted git repository.
krickert pushed a change to branch OPENNLP-1850-3-dl
in repository https://gitbox.apache.org/repos/asf/opennlp.git
omit 3f77034bb OPENNLP-1850 Fail loud on corrupt document-classification
model output
omit 07f18c467 OPENNLP-1850 Fail fast on null finder input; fix the GPU
eval test options
omit 47a39bf17 OPENNLP-1850 Harden fail-loud paths in the DL components
omit 5d074ccac OPENNLP-1850 Add real-model chunk-boundary eval tests; drop
dead label constants
omit 07b123286 OPENNLP-1850 Resolve overlapping chunk spans and compose the
input alignment
omit 4e3e8d0b0 OPENNLP-1850 Add OffsetMappingNameFinder capability
interface and a findInOriginal end-to-end test
omit 166bc4d20 OPENNLP-1850 Offset-safe, Unicode-aware input normalization
in the DL components
omit e0ea17cbf OPENNLP-1850 Fail fast on null public-entry arguments
(review nits)
omit b15005612 OPENNLP-1850 Clarify that Extended_Pictographic symbols are
kept as emoji
omit 2860117dc OPENNLP-1850 Address tokenizer review comments
omit bf37d092f OPENNLP-1850 Address Copilot review on the UAX #29 tokenizer
omit fe1e77c7c OPENNLP-1850 UAX #29 word tokenizer and the layered Term
model
add 8f1d947dc OPENNLP-1850 Harden andThen insertion mapping docs/tests;
label rung index
add 59043dfea OPENNLP-1850 UAX #29 word tokenizer and the layered Term
model
add f48f50f1f OPENNLP-1850 Address Copilot review on the UAX #29 tokenizer
add cc89abf52 OPENNLP-1850 Address tokenizer review comments
add f70c1956a OPENNLP-1850 Clarify that Extended_Pictographic symbols are
kept as emoji
add a75f272f9 OPENNLP-1850 Fail fast on null public-entry arguments
(review nits)
add 7a3c25ac7 OPENNLP-1850 Address review: fail-loud TermAnalyzer default;
harden WordBreakProperty
add bfcbeb5a1 OPENNLP-1850 Offset-safe, Unicode-aware input normalization
in the DL components
add b933a2d97 OPENNLP-1850 Add OffsetMappingNameFinder capability
interface and a findInOriginal end-to-end test
add 7127f0650 OPENNLP-1850 Resolve overlapping chunk spans and compose the
input alignment
add 280966c73 OPENNLP-1850 Add real-model chunk-boundary eval tests; drop
dead label constants
add 706bd2dd9 OPENNLP-1850 Harden fail-loud paths in the DL components
add 6558e8bc8 OPENNLP-1850 Fail fast on null finder input; fix the GPU
eval test options
add 143cdb72d OPENNLP-1850 Fail loud on corrupt document-classification
model output
add 1ea12ea28 OPENNLP-1850 Fully-qualify TokenNameFinder javadoc links in
NameFinderDL
This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version. This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:
* -- * -- B -- O -- O -- O (3f77034bb)
\
N -- N -- N refs/heads/OPENNLP-1850-3-dl (1ea12ea28)
You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.
Any revisions marked "omit" are not gone; other references still
refer to them. Any revisions marked "discard" are gone forever.
No new revisions were added by this update.
Summary of changes:
.../opennlp/tools/util/normalizer/Alignment.java | 9 ++++
.../tools/util/normalizer/AlignmentTest.java | 48 ++++++++++++++++++++++
.../java/opennlp/dl/namefinder/NameFinderDL.java | 9 ++--
.../tools/tokenize/uax29/WordBreakProperty.java | 9 ++--
.../tools/util/normalizer/TermAnalyzer.java | 4 ++
.../tools/util/normalizer/TextNormalizer.java | 2 +-
.../uax29/WordBoundaryConformanceTest.java | 2 -
.../tokenize/uax29/WordBreakPropertyTest.java | 3 +-
.../normalizer/AlignedNormalizerPipelineTest.java | 4 +-
9 files changed, 75 insertions(+), 15 deletions(-)