This is an automated email from the ASF dual-hosted git repository.

krickert pushed a change to branch OPENNLP-1850-3-dl
in repository https://gitbox.apache.org/repos/asf/opennlp.git


 discard 40698dcd2 OPENNLP-1850 Add OffsetMappingNameFinder capability 
interface and a findInOriginal end-to-end test
 discard c51f37d12 OPENNLP-1850 Offset-safe, Unicode-aware input normalization 
in the DL components
 discard 36de08fd2 OPENNLP-1850 Address tokenizer review comments
 discard 01e8387eb OPENNLP-1850 Address Copilot review on the UAX #29 tokenizer
 discard 5e9fd817e OPENNLP-1850 UAX #29 word tokenizer and the layered Term 
model
     add cec6989a3 OPENNLP-1850 Offset-aware normalization pipeline 
(buildAligned)
     add 3226944f0 OPENNLP-1850 UAX #29 word tokenizer and the layered Term 
model
     add 707eadd4f OPENNLP-1850 Address Copilot review on the UAX #29 tokenizer
     add d76a28b55 OPENNLP-1850 Address tokenizer review comments
     add 6814f206b OPENNLP-1850 Offset-safe, Unicode-aware input normalization 
in the DL components
     add 001ac01e8 OPENNLP-1850 Add OffsetMappingNameFinder capability 
interface and a findInOriginal end-to-end test

This update added new revisions after undoing existing revisions.
That is to say, some revisions that were in the old version of the
branch are not in the new version.  This situation occurs
when a user --force pushes a change and generates a repository
containing something like this:

 * -- * -- B -- O -- O -- O   (40698dcd2)
            \
             N -- N -- N   refs/heads/OPENNLP-1850-3-dl (001ac01e8)

You should already have received notification emails for all of the O
revisions, and so the following emails describe only the N revisions
from the common base, B.

Any revisions marked "omit" are not gone; other references still
refer to them.  Any revisions marked "discard" are gone forever.

No new revisions were added by this update.

Summary of changes:
 .../util/normalizer/OffsetAwareNormalizer.java     |  49 +++++
 .../AlignedAggregateCharSequenceNormalizer.java    |  62 ++++++
 .../normalizer/DashCharSequenceNormalizer.java     |   7 +-
 .../InvisibleCharSequenceNormalizer.java           |   7 +-
 ...PreservingWhitespaceCharSequenceNormalizer.java |  72 +++++++
 .../tools/util/normalizer/TextNormalizer.java      |  36 ++++
 .../WhitespaceCharSequenceNormalizer.java          |  10 +-
 .../normalizer/AlignedNormalizerPipelineTest.java  | 239 +++++++++++++++++++++
 8 files changed, 479 insertions(+), 3 deletions(-)
 create mode 100644 
opennlp-api/src/main/java/opennlp/tools/util/normalizer/OffsetAwareNormalizer.java
 create mode 100644 
opennlp-core/opennlp-runtime/src/main/java/opennlp/tools/util/normalizer/AlignedAggregateCharSequenceNormalizer.java
 create mode 100644 
opennlp-core/opennlp-runtime/src/main/java/opennlp/tools/util/normalizer/LineBreakPreservingWhitespaceCharSequenceNormalizer.java
 create mode 100644 
opennlp-core/opennlp-runtime/src/test/java/opennlp/tools/util/normalizer/AlignedNormalizerPipelineTest.java

Reply via email to