This is an automated email from the ASF dual-hosted git repository.

tballison pushed a change to branch junk-detector-v6
in repository https://gitbox.apache.org/repos/asf/tika.git


    from 0e08c2d80a checkpoint
     new 9bae0246a8 checkpoint v7
     new f5c61f31af checkpoint v7

The 2 revisions listed above as "new" are entirely new to this
repository and will be described in separate emails.  The revisions
listed as "add" were already present in the repository and have only
been added to this reference.


Summary of changes:
 .../apache/tika/quality/TextQualityComparison.java |   9 +-
 .../apache/tika/quality/TextQualityDetector.java   |   2 +-
 .../apache/tika/ml/junkdetect/JunkDetector.java    |  16 +-
 .../ml/junkdetect/JunkFilterEncodingDetector.java  | 162 +--------
 .../tika/ml/junkdetect/tools/DebugScriptRuns.java  | 282 +++++++++++++++
 .../tika/ml/junkdetect/tools/EvalJunkDetector.java |   2 +-
 .../junkdetect/tools/PrototypeCodepointHash.java   | 401 +++++++++++++++++++++
 .../tika/ml/junkdetect/JunkDetectorSmokeTest.java  |  47 ++-
 8 files changed, 744 insertions(+), 177 deletions(-)
 create mode 100644 
tika-ml/tika-ml-junkdetect/src/main/java/org/apache/tika/ml/junkdetect/tools/DebugScriptRuns.java

Reply via email to