Re: Skipping RBL checks for internal servers

2015-03-22 Thread David F. Skoll
On Sun, 22 Mar 2015 12:44:26 -0400 Alex Regan wrote: [...] > So instead of trying to figure out the proper expiry period, you just > start over completely every two weeks? No, we use a two-week sliding window to construct our Bayes DB. We don't learn for two weeks and then dump everything; ra

Re: Skipping RBL checks for internal servers

2015-03-22 Thread Reindl Harald
Am 22.03.2015 um 17:44 schrieb Alex Regan: Would it be helpful to have something that graphs the data to monitor the effect of learning changes? Does something already exist? i am doing something similar recently by one per night iterate through all ham/spam smaples to get a overview how they

Re: Skipping RBL checks for internal servers

2015-03-22 Thread Alex Regan
Hi, I think it seldom pays to be too clever with Bayes. If (and this is a big if) you have a large enough sample of mail, in our experience it's better just to shovel it all into Bayes than to be selective about what you present to Bayes. The Bayes algorithms are usually pretty good at picking

Re: Skipping RBL checks for internal servers

2015-03-22 Thread Matus UHLAR - fantomas
On Sat, 21 Mar 2015 20:51:49 + RW wrote: The two calculations produce the same result when Ns2/Nh2 = (Ns2-Ns1)/(Nh2-Nh1) i.e. if spam and ham is being added in the same ratio that it occurs in the database. On 21.03.15 22:54, David F. Skoll wrote: Yup, that's correct; I got it wrong