On Fri, 13 May 2016, Reindl Harald wrote:
Am 13.05.2016 um 18:11 schrieb John Hardin:
On Fri, 13 May 2016, Reindl Harald wrote:
> the problem is blowing out such rules with such scores at all with a
> non working auto-QA (non-working in: no correction for days as well as
> dangerous scoring of new rules from the start)
>
> 02-Mai-2016 00:12:34: SpamAssassin: No update available
> 03-Mai-2016 01:55:05: SpamAssassin: No update available
> 04-Mai-2016 00:43:33: SpamAssassin: No update available
> 05-Mai-2016 01:48:15: SpamAssassin: Update processed successfully
> 06-Mai-2016 00:53:17: SpamAssassin: No update available
> 07-Mai-2016 01:21:23: SpamAssassin: No update available
> 08-Mai-2016 01:38:23: SpamAssassin: No update available
> 09-Mai-2016 00:02:56: SpamAssassin: No update available
> 10-Mai-2016 01:10:29: SpamAssassin: No update available
> 11-Mai-2016 00:55:46: SpamAssassin: No update available
> 12-Mai-2016 00:21:17: SpamAssassin: Update processed successfully
> 13-Mai-2016 00:33:31: SpamAssassin: No update available
Perhaps you could help with that by participating in masscheck. You seem
to get a lot of FPs on base rules; contributing masscheck results on
your ham would reduce those
i can't rsync customer mails to a 3rd party
You don't have to. You run the masscheck locally and only upload the rule
hit results. I upload my corpora because they are just my email and are
thus tiny.
If you select your corpora filenames properly, no information should leak.
if that would be based on some webervice where you just feed local samples
and only give the rules which hitted and spam/ham flag out it would be
somehow possible
How would a webservice be better? That would still be sending customer
emails to a third party for processing.
especially you would not have much from the bayes-samples because they
would trigger all sort of wrong rules after strip most headers and and a
generic received header (which seems to be needed by the bayes-engine
for whatever reason since it otherwise scores samples completly
different)
Corpora with headers stripped does present a problem. The masscheck
corpora should be complete as received.
--
John Hardin KA7OHZ http://www.impsec.org/~jhardin/
jhar...@impsec.org FALaholic #11174 pgpk -a jhar...@impsec.org
key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C AF76 D822 E6E6 B873 2E79
-----------------------------------------------------------------------
People think they're trading chaos for order [by ceding more and
more power to the Government], but they're just trading normal
human evil for the really dangerous organized kind of evil, the
kind that simply does not give a shit. Only bureaucrats can give
you true evil. -- Larry Correia
-----------------------------------------------------------------------
143 days since the first successful real return to launch site (SpaceX)