This is an automated email from the ASF dual-hosted git repository.
krickert pushed a change to branch OPENNLP-1850-4-docs
in repository https://gitbox.apache.org/repos/asf/opennlp.git
omit 8475b41ba OPENNLP-1850 Document the supplementary-dash offset shift in
the DL fold options
omit 2798db144 OPENNLP-1850 Document the offset-aware substitution folds
(quotes, digits, ellipsis, bullets, umlaut)
omit fce6da402 OPENNLP-1850 Name the OffsetMappingNameFinder capability
interface in the manual
omit b7cd3e669 OPENNLP-1850 Document the offset-aware normalization
pipeline (buildAligned)
omit 225c6db41 OPENNLP-1850 Document Unicode normalization, the UAX #29
tokenizer, and DL handling
omit fdd329f7d OPENNLP-1850 Harden fail-loud paths in the DL components
omit c16f3c227 OPENNLP-1850 Add real-model chunk-boundary eval tests; drop
dead label constants
omit fd1b4addb OPENNLP-1850 Resolve overlapping chunk spans and compose the
input alignment
omit 8fdb00bf1 OPENNLP-1850 Add OffsetMappingNameFinder capability
interface and a findInOriginal end-to-end test
omit 64630e992 OPENNLP-1850 Offset-safe, Unicode-aware input normalization
in the DL components
add e0ea17cbf OPENNLP-1850 Fail fast on null public-entry arguments
(review nits)
add 166bc4d20 OPENNLP-1850 Offset-safe, Unicode-aware input normalization
in the DL components
add 4e3e8d0b0 OPENNLP-1850 Add OffsetMappingNameFinder capability
interface and a findInOriginal end-to-end test
add 07b123286 OPENNLP-1850 Resolve overlapping chunk spans and compose the
input alignment
add 5d074ccac OPENNLP-1850 Add real-model chunk-boundary eval tests; drop
dead label constants
add 47a39bf17 OPENNLP-1850 Harden fail-loud paths in the DL components
add 30358edb4 OPENNLP-1850 Document Unicode normalization, the UAX #29
tokenizer, and DL handling
add 7e4c59878 OPENNLP-1850 Document the offset-aware normalization
pipeline (buildAligned)
add 2876e47d9 OPENNLP-1850 Name the OffsetMappingNameFinder capability
interface in the manual
add 59c23b361 OPENNLP-1850 Document the offset-aware substitution folds
(quotes, digits, ellipsis, bullets, umlaut)
add 056078912 OPENNLP-1850 Document the supplementary-dash offset shift in
the DL fold options
add 76ec3b782 OPENNLP-1850 Tighten normalizer manual wording (review nits)
This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version. This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:
* -- * -- B -- O -- O -- O (8475b41ba)
\
N -- N -- N refs/heads/OPENNLP-1850-4-docs (76ec3b782)
You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.
Any revisions marked "omit" are not gone; other references still
refer to them. Any revisions marked "discard" are gone forever.
No new revisions were added by this update.
Summary of changes:
.../java/opennlp/tools/tokenize/uax29/ExtendedPictographic.java | 3 ++-
.../java/opennlp/tools/util/normalizer/NormalizationProfiles.java | 2 ++
.../src/main/java/opennlp/tools/util/normalizer/TermAnalyzer.java | 1 +
opennlp-docs/src/docbkx/normalizer.xml | 7 ++++---
opennlp-docs/src/docbkx/tokenizer.xml | 2 +-
5 files changed, 10 insertions(+), 5 deletions(-)