This is an automated email from the ASF dual-hosted git repository.
krickert pushed a change to branch OPENNLP-1850-4-docs
in repository https://gitbox.apache.org/repos/asf/opennlp.git
discard 3037db7b2 OPENNLP-1850 Document Unicode normalization and the UAX #29
tokenizer
discard 1c171106c OPENNLP-1850 Offset-safe input normalization in the DL
components
discard dab560578 OPENNLP-1850 UAX #29 word tokenizer and the layered Term
model
add 7b5dfff77 OPENNLP-1850 Address Copilot review on the normalization
foundation
add b493f8959 OPENNLP-1850 UAX #29 word tokenizer and the layered Term
model
add 67c922aea OPENNLP-1850 Address Copilot review on the UAX #29 tokenizer
add 39616baad OPENNLP-1850 Offset-safe input normalization in the DL
components
add 8534bb36f OPENNLP-1850 Make DL entity spans offset-safe under input
normalization
add 0ba03e45d OPENNLP-1850 Document Unicode normalization and the UAX #29
tokenizer
add 9a71f283d OPENNLP-1850 Fix non-compiling DL snippets in the manual
This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version. This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:
* -- * -- B -- O -- O -- O (3037db7b2)
\
N -- N -- N refs/heads/OPENNLP-1850-4-docs (9a71f283d)
You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.
Any revisions marked "omit" are not gone; other references still
refer to them. Any revisions marked "discard" are gone forever.
No new revisions were added by this update.
Summary of changes:
LICENSE | 2 +-
.../opennlp/tools/util/normalizer/OffsetMap.java | 19 +--
.../tools/util/normalizer/OffsetMapTest.java | 89 ++++++++++++++
.../src/main/java/opennlp/dl/AbstractDL.java | 49 +++++++-
.../java/opennlp/dl/namefinder/NameFinderDL.java | 54 ++++++++-
.../java/opennlp/dl/AbstractDLChunkingTest.java | 26 ++++
.../normalizer/CaseFoldCharSequenceNormalizer.java | 7 +-
.../opennlp/tools/util/normalizer/CharClass.java | 5 +
.../opennlp/tools/util/normalizer/Confusables.java | 18 ++-
.../util/normalizer/NormalizationProfiles.java | 2 +
.../uax29/WordBoundaryConformanceTest.java | 6 +-
.../CaseFoldCharSequenceNormalizerTest.java | 4 +
.../tools/util/normalizer/CharClassTest.java | 132 +++++++++++++++++++++
.../util/normalizer/ConfusableSkeletonTest.java | 57 +++++++++
.../util/normalizer/NormalizationProfilesTest.java | 6 +
opennlp-docs/src/docbkx/doccat.xml | 5 +-
opennlp-docs/src/docbkx/namefinder.xml | 25 +++-
17 files changed, 476 insertions(+), 30 deletions(-)
create mode 100644
opennlp-api/src/test/java/opennlp/tools/util/normalizer/OffsetMapTest.java
create mode 100644
opennlp-core/opennlp-runtime/src/test/java/opennlp/tools/util/normalizer/ConfusableSkeletonTest.java