This is an automated email from the ASF dual-hosted git repository.
krickert pushed a change to branch OPENNLP-1850-2-tokenizer
in repository https://gitbox.apache.org/repos/asf/opennlp.git
discard e0ea17cbf OPENNLP-1850 Fail fast on null public-entry arguments
(review nits)
discard b15005612 OPENNLP-1850 Clarify that Extended_Pictographic symbols are
kept as emoji
discard 2860117dc OPENNLP-1850 Address tokenizer review comments
discard bf37d092f OPENNLP-1850 Address Copilot review on the UAX #29 tokenizer
discard fe1e77c7c OPENNLP-1850 UAX #29 word tokenizer and the layered Term
model
add 8f1d947dc OPENNLP-1850 Harden andThen insertion mapping docs/tests;
label rung index
add 59043dfea OPENNLP-1850 UAX #29 word tokenizer and the layered Term
model
add f48f50f1f OPENNLP-1850 Address Copilot review on the UAX #29 tokenizer
add cc89abf52 OPENNLP-1850 Address tokenizer review comments
add f70c1956a OPENNLP-1850 Clarify that Extended_Pictographic symbols are
kept as emoji
add a75f272f9 OPENNLP-1850 Fail fast on null public-entry arguments
(review nits)
add 7a3c25ac7 OPENNLP-1850 Address review: fail-loud TermAnalyzer default;
harden WordBreakProperty
This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version. This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:
* -- * -- B -- O -- O -- O (e0ea17cbf)
\
N -- N -- N refs/heads/OPENNLP-1850-2-tokenizer (7a3c25ac7)
You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.
Any revisions marked "omit" are not gone; other references still
refer to them. Any revisions marked "discard" are gone forever.
No new revisions were added by this update.
Summary of changes:
.../opennlp/tools/util/normalizer/Alignment.java | 9 ++++
.../tools/util/normalizer/AlignmentTest.java | 48 ++++++++++++++++++++++
.../tools/tokenize/uax29/WordBreakProperty.java | 9 ++--
.../tools/util/normalizer/TermAnalyzer.java | 4 ++
.../tools/util/normalizer/TextNormalizer.java | 2 +-
.../uax29/WordBoundaryConformanceTest.java | 2 -
.../tokenize/uax29/WordBreakPropertyTest.java | 3 +-
.../normalizer/AlignedNormalizerPipelineTest.java | 4 +-
8 files changed, 71 insertions(+), 10 deletions(-)