This is an automated email from the ASF dual-hosted git repository.
krickert pushed a change to branch OPENNLP-1850-3-dl
in repository https://gitbox.apache.org/repos/asf/opennlp.git
omit 4c1289726 OPENNLP-1850 Add real-model chunk-boundary eval tests; drop
dead label constants
omit 804245db6 OPENNLP-1850 Resolve overlapping chunk spans and compose the
input alignment
omit f3252686d OPENNLP-1850 Add OffsetMappingNameFinder capability
interface and a findInOriginal end-to-end test
omit 3c126ce8b OPENNLP-1850 Offset-safe, Unicode-aware input normalization
in the DL components
omit 3f060954b OPENNLP-1850 Address tokenizer review comments
omit 72e0d2329 OPENNLP-1850 Address Copilot review on the UAX #29 tokenizer
omit 91016675a OPENNLP-1850 UAX #29 word tokenizer and the layered Term
model
add 090593fca OPENNLP-1850 Document the NFC precondition of the German
umlaut fold
add 396573a57 OPENNLP-1850 UAX #29 word tokenizer and the layered Term
model
add 4fda04577 OPENNLP-1850 Address Copilot review on the UAX #29 tokenizer
add 3c57a7456 OPENNLP-1850 Address tokenizer review comments
add 0ec5a3651 OPENNLP-1850 Clarify that Extended_Pictographic symbols are
kept as emoji
add d5319ccaf OPENNLP-1850 Offset-safe, Unicode-aware input normalization
in the DL components
add 22256c160 OPENNLP-1850 Add OffsetMappingNameFinder capability
interface and a findInOriginal end-to-end test
add a14dcf98d OPENNLP-1850 Resolve overlapping chunk spans and compose the
input alignment
add a7488a883 OPENNLP-1850 Add real-model chunk-boundary eval tests; drop
dead label constants
add 2006d1d85 OPENNLP-1850 Harden fail-loud paths in the DL components
This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version. This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:
* -- * -- B -- O -- O -- O (4c1289726)
\
N -- N -- N refs/heads/OPENNLP-1850-3-dl (2006d1d85)
You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.
Any revisions marked "omit" are not gone; other references still
refer to them. Any revisions marked "discard" are gone forever.
No new revisions were added by this update.
Summary of changes:
.../java/opennlp/dl/doccat/DocumentCategorizerDL.java | 11 +++++++++--
.../main/java/opennlp/dl/namefinder/NameFinderDL.java | 15 +++++++++++----
.../opennlp/dl/doccat/DocumentCategorizerDLTest.java | 19 +++++++++++++++++++
.../java/opennlp/dl/namefinder/NameFinderDLTest.java | 8 ++++++++
.../opennlp/tools/tokenize/uax29/WordTokenizer.java | 4 +++-
.../GermanUmlautCharSequenceNormalizer.java | 4 ++++
.../tools/tokenize/uax29/WordTokenizerTest.java | 12 ++++++++++++
.../GermanUmlautCharSequenceNormalizerTest.java | 13 +++++++++++++
8 files changed, 79 insertions(+), 7 deletions(-)