On Wed, Dec 11, 2019 at 1:58 PM Giovanni Bechis <giova...@paclan.it> wrote: > > On 12/11/19 3:17 PM, Bill Cole wrote: > > On 11 Dec 2019, at 2:39, Giovanni Bechis wrote: > > > >> On 12/11/19 6:21 AM, KADAM, SIDDHESH wrote: > >>> Hi PFA... > >>> > >>> On 12/11/2019 12:36 AM, Giovanni Bechis wrote: > >>>> On 12/10/19 7:49 PM, Michael Storz wrote: > >>>> [...] > >>>>> My copy hit > >>>>> > >>>>> BODY_SINGLE_WORD=1.347, HTML_IMAGE_ONLY_04=1.172, MPART_ALT_DIFF=0.79 > >>>>> > >>>>> not enough to mark it as spammy. > >>> > >> FuzzyOcr + bayes is killing this kind of emails for me: > > > > FuzzyOcr is unmaintained and doesn't even have an authoritative repository > > as far as I can tell. It is computationally very expensive, to the degree > > that it isn't safe to just add it to an existing mail system which does not > > have a lot of idle CPU and memory capacity. > > > it's true that it's unmaintained but I have it running on Perl 5.28 with some > patches and it's still useful every now and then (if you have some spare cpu > cycles and you know what you are doing). > A new ocr plugin could be definetely a better choice. > Giovanni
I asked the project owner if I could put fuzzyocr on github. He said go for it, so it is now at https://github.com/raubvogel/FuzzyOcr.