This is an automated email from the ASF dual-hosted git repository.
krickert pushed a change to branch OPENNLP-1850-2c-profiles
in repository https://gitbox.apache.org/repos/asf/opennlp.git
discard 70ec7a3df OPENNLP-1850 Resolve Norwegian nb/nn to the Norwegian
profile (profiles)
discard f600dfd55 OPENNLP-1850 Per-language NormalizationProfile registry (2c)
discard 55dbeb4b2 OPENNLP-1850 Layered Term model: Term, TermAnalyzer (2b)
discard 3fae8aad6 OPENNLP-1850 Fail loud on a Word_Break line missing its ';'
(tokenizer)
discard 47480171c OPENNLP-1850 UAX #29 word tokenizer: WordSegmenter,
WordTokenizer, WordType (2a)
discard b24c9ee3d OPENNLP-1850 Offset/alignment layer: Alignment, AlignedText,
buildAligned, *Aligned (1b)
add 124d8526e OPENNLP-1850 Review nits: align Confusables to
IllegalArgumentException; pom newline (engine)
add 1d8f582c0 OPENNLP-1850 Offset/alignment layer: Alignment, AlignedText,
buildAligned, *Aligned (1b)
add 702acc52f OPENNLP-1850 Review nits: soften DL forward-link; fix
LineBreakPreserving opener (alignment)
add c19c4fc11 OPENNLP-1850 UAX #29 word tokenizer: WordSegmenter,
WordTokenizer, WordType (2a)
add 57b77648e OPENNLP-1850 Fail loud on a Word_Break line missing its ';'
(tokenizer)
add f2d1d8cca OPENNLP-1850 Review nits: ExtendedPictographic fail-loud
parity + doc; WordType heuristic note (tokenizer)
add 58cff0120 OPENNLP-1850 Layered Term model: Term, TermAnalyzer (2b)
add a23a51358 OPENNLP-1850 Review nits: rename dashes()->dash(); LEMMA
doc+test; soften forward-link (Term)
add 8d32dbac9 OPENNLP-1850 Per-language NormalizationProfile registry (2c)
add 859146c9c OPENNLP-1850 Resolve Norwegian nb/nn to the Norwegian
profile (profiles)
add 13e46418b OPENNLP-1850 Review nits: add Turkish profile; derive
coverage from the enum (profiles)
This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version. This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:
* -- * -- B -- O -- O -- O (70ec7a3df)
\
N -- N -- N refs/heads/OPENNLP-1850-2c-profiles (13e46418b)
You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.
Any revisions marked "omit" are not gone; other references still
refer to them. Any revisions marked "discard" are gone forever.
No new revisions were added by this update.
Summary of changes:
opennlp-api/pom.xml | 2 +-
.../util/normalizer/OffsetAwareNormalizer.java | 2 +-
.../tools/tokenize/uax29/ExtendedPictographic.java | 26 +++++++++++++++-------
.../opennlp/tools/tokenize/uax29/WordType.java | 4 +++-
.../opennlp/tools/util/normalizer/Confusables.java | 4 ++--
...PreservingWhitespaceCharSequenceNormalizer.java | 2 +-
.../util/normalizer/NormalizationProfiles.java | 9 +++++++-
.../tools/util/normalizer/TermAnalyzer.java | 11 ++++-----
.../tokenize/uax29/ExtendedPictographicTest.java | 16 +++++++++++++
.../tools/util/normalizer/ConfusablesLoadTest.java | 2 +-
.../util/normalizer/NormalizationProfilesTest.java | 25 +++++++++++++++++----
.../tools/util/normalizer/TermAnalyzerTest.java | 23 +++++++++++++++++++
12 files changed, 101 insertions(+), 25 deletions(-)