This is an automated email from the ASF dual-hosted git repository.
krickert pushed a change to branch OPENNLP-1850-1-foundation
in repository https://gitbox.apache.org/repos/asf/opennlp.git
from cec6989a3 OPENNLP-1850 Offset-aware normalization pipeline
(buildAligned)
add fb5edf31f OPENNLP-1850 Make the per-code-point substitution folds
offset-aware
No new revisions were added by this update.
Summary of changes:
.../normalizer/BulletCharSequenceNormalizer.java | 7 +-
.../normalizer/DigitCharSequenceNormalizer.java | 25 ++++++-
.../normalizer/EllipsisCharSequenceNormalizer.java | 43 +++++++++--
.../GermanUmlautCharSequenceNormalizer.java | 65 +++++++++-------
.../normalizer/QuoteCharSequenceNormalizer.java | 10 ++-
.../normalizer/AlignedNormalizerPipelineTest.java | 86 +++++++++++++++++++++-
6 files changed, 197 insertions(+), 39 deletions(-)