Re: How SA reactes to a bunch of garbage characters

2016-06-27 Thread Olivier
Hi, As promissed, ehere is one week log of FuzzyOcr http://pastebin.com/XwwdXkTV The result are not too good. Olivier --

Re: How SA reactes to a bunch of garbage characters

2016-06-15 Thread Olivier
Matus, >>To a part that would do regexp rules, but not Bayes? I don't know if it >>is possible. > > someone who knoes SA internals will have to answer this one, but I doubt > it's useful, see below. I will give a look at Bayes OCR, it does inject the text OCR'ed from an image into the body of th

Re: How SA reactes to a bunch of garbage characters

2016-06-15 Thread Olivier
RW, > I stopped using OCR a long time ago because I didn't find that image > spam was particularly hard to catch. These days I find that spams with > images are mostly either pictures of Russian girls or spoofed corporate > logos. Then you need something able to detect the amount of flesh on a p

Re: How SA reactes to a bunch of garbage characters

2016-06-14 Thread RW
On Tue, 14 Jun 2016 08:56:50 -0400 Joe Quinn wrote: > On 6/14/2016 8:33 AM, Matus UHLAR - fantomas wrote: > > that is just what I would like to know: If OCR produces results > > good enough > > for BAYES and other rules. > > > > I don't think there's difference between bayes and other rules. > > I

Re: How SA reactes to a bunch of garbage characters

2016-06-14 Thread Joe Quinn
On 6/14/2016 8:33 AM, Matus UHLAR - fantomas wrote: that is just what I would like to know: If OCR produces results good enough for BAYES and other rules. I don't think there's difference between bayes and other rules. It's also possible that BAYES would have better results with misread charact

Re: How SA reactes to a bunch of garbage characters

2016-06-14 Thread Matus UHLAR - fantomas
Sure the OCR results are not very precise. But could we imagine that they are pushed in a part of the message that will not go through Bayes? where do you want to push the ORC'ed test, if not back to SA to check other rules like bayes? On 14.06.16 13:50, Olivier wrote: To a part that would do

Re: How SA reactes to a bunch of garbage characters

2016-06-13 Thread Olivier
Matus, >>Sure the OCR results are not very precise. But could we imagine that >>they are pushed in a part of the message that will not go through Bayes? > where do you want to push the ORC'ed test, if not back to SA to check other > rules like bayes? To a part that would do regexp rules, but not

Re: How SA reactes to a bunch of garbage characters

2016-06-13 Thread Matus UHLAR - fantomas
On 09.06.16 10:43, Olivier wrote: For years I am having FuzzyOcr pluging running, but it helps little, because it has it's own list of words to keep updated. I am wondering if, instead of using that own list of words, the result was injected back into the body of the main message. I raised thi

Re: How SA reactes to a bunch of garbage characters

2016-06-12 Thread Olivier
Matus, Thank you for your reply. > On 09.06.16 10:43, Olivier wrote: >>For years I am having FuzzyOcr pluging running, but it helps little, >>because it has it's own list of words to keep updated. >> >>I am wondering if, instead of using that own list of words, the result >>was injected back into

Re: How SA reactes to a bunch of garbage characters

2016-06-10 Thread Matus UHLAR - fantomas
On 09.06.16 10:43, Olivier wrote: For years I am having FuzzyOcr pluging running, but it helps little, because it has it's own list of words to keep updated. I am wondering if, instead of using that own list of words, the result was injected back into the body of the main message. I raised thi

How SA reactes to a bunch of garbage characters

2016-06-08 Thread Olivier
Hi, For years I am having FuzzyOcr pluging running, but it helps little, because it has it's own list of words to keep updated. I am wondering if, instead of using that own list of words, the result was injected back into the body of the main message. Most of the time, what will be injected back