On Wed, Dec 11, 2019 at 1:58 PM Giovanni Bechis <giova...@paclan.it> wrote:
>
> On 12/11/19 3:17 PM, Bill Cole wrote:
> > On 11 Dec 2019, at 2:39, Giovanni Bechis wrote:
> >
> >> On 12/11/19 6:21 AM, KADAM, SIDDHESH wrote:
> >>> Hi PFA...
> >>>
> >>> On 12/11/2019 12:36 AM, Giovanni Bechis wrote:
> >>>> On 12/10/19 7:49 PM, Michael Storz wrote:
> >>>> [...]
> >>>>> My copy hit
> >>>>>
> >>>>> BODY_SINGLE_WORD=1.347, HTML_IMAGE_ONLY_04=1.172, MPART_ALT_DIFF=0.79
> >>>>>
> >>>>> not enough to mark it as spammy.
> >>>
> >> FuzzyOcr + bayes is killing this kind of emails for me:
> >
> > FuzzyOcr is unmaintained and doesn't even have an authoritative repository 
> > as far as I can tell. It is computationally very expensive, to the degree 
> > that it isn't safe to just add it to an existing mail system which does not 
> > have a lot of idle CPU and memory capacity.
> >
> it's true that it's unmaintained but I have it running on Perl 5.28 with some 
> patches and it's still useful every now and then (if you have some spare cpu 
> cycles and you know what you are doing).
> A new ocr plugin could be definetely a better choice.
>   Giovanni

I asked the project owner if I could put fuzzyocr on github. He said
go for it, so it is now at https://github.com/raubvogel/FuzzyOcr.

Reply via email to