This is an automated email from the ASF dual-hosted git repository.
tballison pushed a change to branch TIKA-4745-more-junk-charset
in repository https://gitbox.apache.org/repos/asf/tika.git
from d5ef09b8a8 TIKA-4745 -- efficiency improvements
add 6730ade8bd TIKA-4745 -- further efficiency improvements
No new revisions were added by this update.
Summary of changes:
.../apache/tika/ml/junkdetect/BigramTables.java | 44 +++++++++++++++++----
.../org/apache/tika/ml/junkdetect/junkdetect.bin | Bin 1395108 -> 727979
bytes
2 files changed, 37 insertions(+), 7 deletions(-)