This is an automated email from the ASF dual-hosted git repository.
krickert pushed a change to branch OPENNLP-1850-4-docs
in repository https://gitbox.apache.org/repos/asf/opennlp.git
discard 0022bc1de OPENNLP-1850 Document the offset-aware substitution folds
(quotes, digits, ellipsis, bullets, umlaut)
discard ec325350c OPENNLP-1850 Name the OffsetMappingNameFinder capability
interface in the manual
discard 586a88304 OPENNLP-1850 Document the offset-aware normalization
pipeline (buildAligned)
discard e2f96b546 OPENNLP-1850 Document Unicode normalization, the UAX #29
tokenizer, and DL handling
discard 038e23d28 OPENNLP-1850 Add OffsetMappingNameFinder capability
interface and a findInOriginal end-to-end test
discard 3b2cdd836 OPENNLP-1850 Offset-safe, Unicode-aware input normalization
in the DL components
discard 40cd7299d OPENNLP-1850 Address tokenizer review comments
discard e597e8b82 OPENNLP-1850 Address Copilot review on the UAX #29 tokenizer
discard 3c2824c96 OPENNLP-1850 UAX #29 word tokenizer and the layered Term
model
add bb20b101a OPENNLP-1850 Harden the offset-aware folds from review
feedback
add 3d09dfc46 OPENNLP-1850 UAX #29 word tokenizer and the layered Term
model
add f425549b6 OPENNLP-1850 Address Copilot review on the UAX #29 tokenizer
add 8c2451a52 OPENNLP-1850 Address tokenizer review comments
add 5e7ff554a OPENNLP-1850 Offset-safe, Unicode-aware input normalization
in the DL components
add ad9331ffe OPENNLP-1850 Add OffsetMappingNameFinder capability
interface and a findInOriginal end-to-end test
add bc401d3d3 OPENNLP-1850 Resolve overlapping chunk spans and compose the
input alignment
add 777c501b1 OPENNLP-1850 Document Unicode normalization, the UAX #29
tokenizer, and DL handling
add a44d98f95 OPENNLP-1850 Document the offset-aware normalization
pipeline (buildAligned)
add c832cb2f0 OPENNLP-1850 Name the OffsetMappingNameFinder capability
interface in the manual
add 15a0cb401 OPENNLP-1850 Document the offset-aware substitution folds
(quotes, digits, ellipsis, bullets, umlaut)
add 2fd954339 OPENNLP-1850 Document the supplementary-dash offset shift in
the DL fold options
This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version. This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:
* -- * -- B -- O -- O -- O (0022bc1de)
\
N -- N -- N refs/heads/OPENNLP-1850-4-docs (2fd954339)
You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.
Any revisions marked "omit" are not gone; other references still
refer to them. Any revisions marked "discard" are gone forever.
No new revisions were added by this update.
Summary of changes:
.../tools/util/normalizer/AlignmentTest.java | 17 ++++
.../tools/util/normalizer/CharClassTest.java | 37 ++++++++
.../src/main/java/opennlp/dl/AbstractDL.java | 80 +++++++++++++----
.../src/main/java/opennlp/dl/InferenceOptions.java | 8 ++
.../java/opennlp/dl/namefinder/NameFinderDL.java | 99 ++++++++++++++++------
.../java/opennlp/dl/AbstractDLChunkingTest.java | 50 ++++++++++-
.../opennlp/dl/namefinder/NameFinderDLTest.java | 56 ++++++++++++
.../AlignedAggregateCharSequenceNormalizer.java | 8 +-
.../opennlp/tools/util/normalizer/Confusables.java | 6 ++
.../GermanUmlautCharSequenceNormalizer.java | 5 +-
.../tools/util/normalizer/TextNormalizer.java | 17 ++--
.../normalizer/AlignedNormalizerPipelineTest.java | 21 +++++
.../GermanUmlautCharSequenceNormalizerTest.java | 17 ++++
opennlp-docs/src/docbkx/normalizer.xml | 5 +-
14 files changed, 370 insertions(+), 56 deletions(-)