On Fri, 13 May 2016, Reindl Harald wrote:


Am 13.05.2016 um 18:11 schrieb John Hardin:
 On Fri, 13 May 2016, Reindl Harald wrote:

>  the problem is blowing out such rules with such scores at all with a
>  non working auto-QA (non-working in: no correction for days as well as
>  dangerous scoring of new rules from the start)
> > 02-Mai-2016 00:12:34: SpamAssassin: No update available
>  03-Mai-2016 01:55:05: SpamAssassin: No update available
>  04-Mai-2016 00:43:33: SpamAssassin: No update available
>  05-Mai-2016 01:48:15: SpamAssassin: Update processed successfully
>  06-Mai-2016 00:53:17: SpamAssassin: No update available
>  07-Mai-2016 01:21:23: SpamAssassin: No update available
>  08-Mai-2016 01:38:23: SpamAssassin: No update available
>  09-Mai-2016 00:02:56: SpamAssassin: No update available
>  10-Mai-2016 01:10:29: SpamAssassin: No update available
>  11-Mai-2016 00:55:46: SpamAssassin: No update available
>  12-Mai-2016 00:21:17: SpamAssassin: Update processed successfully
>  13-Mai-2016 00:33:31: SpamAssassin: No update available

 Perhaps you could help with that by participating in masscheck. You seem
 to get a lot of FPs on base rules; contributing masscheck results on
 your ham would reduce those

i can't rsync customer mails to a 3rd party

You don't have to. You run the masscheck locally and only upload the rule hit results. I upload my corpora because they are just my email and are thus tiny.

If you select your corpora filenames properly, no information should leak.

if that would be based on some webervice where you just feed local samples and only give the rules which hitted and spam/ham flag out it would be somehow possible

How would a webservice be better? That would still be sending customer emails to a third party for processing.

especially you would not have much from the bayes-samples because they would trigger all sort of wrong rules after strip most headers and and a generic received header (which seems to be needed by the bayes-engine for whatever reason since it otherwise scores samples completly different)

Corpora with headers stripped does present a problem. The masscheck corpora should be complete as received.

--
 John Hardin KA7OHZ                    http://www.impsec.org/~jhardin/
 jhar...@impsec.org    FALaholic #11174     pgpk -a jhar...@impsec.org
 key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C  AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
  People think they're trading chaos for order [by ceding more and
  more power to the Government], but they're just trading normal
  human evil for the really dangerous organized kind of evil, the
  kind that simply does not give a shit. Only bureaucrats can give
  you true evil.                                     -- Larry Correia
-----------------------------------------------------------------------
 143 days since the first successful real return to launch site (SpaceX)

Reply via email to