Re: training bayes and newsletters

2024-10-16 Thread Greg Troxel
I think you are missing that a particular newsletter is not intrinsically ham or spam. It is ham if the user has subscribed, and spam if they have not affirmatively subscribed. I have seen the very same content arrive at my mailserver for 2 users. For one it is ham and the other it is spam. Ther

Re: training bayes and newsletters

2024-10-16 Thread Bill Cole
On 2024-10-15 at 22:05:07 UTC-0400 (Tue, 15 Oct 2024 22:05:07 -0400) Alex is rumored to have said: I can imagine the newsletter template is somewhat common, but does bayes have any ability to distinguish a junk newsletter from a legitimate newsletter? Not if it has never seen either of them.

RE: training bayes and newsletters

2024-10-16 Thread Marc
> I can imagine the newsletter template is somewhat common, but does bayes > have any ability to distinguish a junk newsletter from a legitimate > newsletter? How can bayes, if you also can't? My advice would be to mark eg everything from mailchimp and than whitelist what you indeed agreed to re

Re: training bayes and newsletters

2024-10-15 Thread Axb
On 10/16/24 04:05, Alex wrote: Would I benefit from training known trustworthy newsletters such as ham? Yes, you would.

training bayes and newsletters

2024-10-15 Thread Alex
Hi, I've just retrained my bayes database (stored in SQL) with 10k hams and about 6k spams. I tried to make sure there were no newsletters in either corpus, but some emails present as newsletters but really are spam. However, many legitimate newsletters are hitting BAYES_99 even though I haven't t

Re: SPAM-DETECTOR Re: Tips on training bayes?

2024-09-19 Thread natan
W dniu 18.09.2024 o 16:29, Matus UHLAR - fantomas pisze: On 18.09.24 16:19, natan wrote: I was very disappointed with spamassassin 4.x because it started to grow /var/lib/amavis/tmp/ amavis should clean this itself. which amavis version do you have installed? did you tune it anyhow? amavisd-

Re: Tips on training bayes?

2024-09-19 Thread Bill Cole
On 2024-09-17 at 16:29:52 UTC-0400 (Tue, 17 Sep 2024 16:29:52 -0400) Alex is rumored to have said: It is up to the user, ie you, what is and what is not spam. Well, yes, and no. Of course it's my own system and I can define these terms however I wish. I'm also familiar with the need to i

Re: Tips on training bayes?

2024-09-18 Thread Greg Troxel
Alex writes: > It's only these few types of messages that are very subjective and > experience from the broader open source community would be appreciated. > > If it has a legitimate unsubscribe link, does that make it ham? > > What criteria do you use to determine "spamminess/haminess of EVERY >

Re: Tips on training bayes?

2024-09-18 Thread Benny Pedersen
Jared Hall via users skrev den 2024-09-18 20:08: On Deb-based distros, you can add this in /etc/amavis/conf.d/50-user under the $max_servers parameter. also remember its safe to use tmpfs for tmp dir in amavisd no joke

Re: Tips on training bayes?

2024-09-18 Thread Jared Hall via users
On 9/18/2024 10:19 AM, natan wrote: Hi I was very disappointed with spamassassin 4.x because it started to grow /var/lib/amavis/tmp/ With SA 3.4.X - on average 100MB and it deletes on the fly With SA 4.X - on average 2-6GB and I had to do a quick fix: 59 23 * * * root find /var/lib/amavis/tmp/

Re: Tips on training bayes?

2024-09-18 Thread Benny Pedersen
natan skrev den 2024-09-18 16:36: W dniu 18.09.2024 o 16:30, Reindl Harald (privat) pisze: who reply here ? :) don't blame SA when a blind man can see that your problem is on the Amavis side - why do one need Amavis tu begin with when there is SA and spamass-milter yes yes everyone know

Re: Tips on training bayes?

2024-09-18 Thread natan
W dniu 18.09.2024 o 16:30, Reindl Harald (privat) pisze: Am 18.09.24 um 16:19 schrieb natan: Hi I was very disappointed with spamassassin 4.x because it started to grow /var/lib/amavis/tmp/ With SA 3.4.X - on average 100MB and it deletes on the fly With SA 4.X - on average 2-6GB and I had t

Re: Tips on training bayes?

2024-09-18 Thread Matus UHLAR - fantomas
On 18.09.24 16:19, natan wrote: I was very disappointed with spamassassin 4.x because it started to grow /var/lib/amavis/tmp/ amavis should clean this itself. which amavis version do you have installed? did you tune it anyhow? Did you enable and configure extracttext plugin? Because that one m

Re: Tips on training bayes?

2024-09-18 Thread natan
Hi I was very disappointed with spamassassin 4.x because it started to grow /var/lib/amavis/tmp/ With SA 3.4.X - on average 100MB and it deletes on the fly With SA 4.X - on average 2-6GB and I had to do a quick fix: 59 23 * * * root find /var/lib/amavis/tmp/ -mtime +0 -delete; W dniu 18.09.202

Re: Tips on training bayes?

2024-09-18 Thread Matus UHLAR - fantomas
On 18.09.24 13:42, Grega via users wrote: Right now in SA 4.0.1 bayes at least for me is really challenging to train and set up. I had good trained DB from past V3 install, and it behaved really odd. I trained it on new set of mails 3000 spam and 3000 ham (HAND PICKED mail it was PAIN) and I

Re: Tips on training bayes?

2024-09-18 Thread Grega via users
on training bayes? It is up to the user, ie you, what is and what is not spam. Well, yes, and no. Of course it's my own system and I can define these terms however I wish. I'm also familiar with the need to investigate every message - perhaps I should have made that clear initially.

Re: Tips on training bayes?

2024-09-17 Thread Alex
> > > It is up to the user, ie you, what is and what is not spam. > Well, yes, and no. Of course it's my own system and I can define these terms however I wish. I'm also familiar with the need to investigate every message - perhaps I should have made that clear initially. It's only these few typ

Re: Tips on training bayes?

2024-09-17 Thread Benny Pedersen
Jared Hall via users skrev den 2024-09-17 08:15: On 9/16/2024 8:48 PM, Alex wrote: Hi, Now that I'm using SA4, and my bayes database is quite old, I'd like to retrain it with new ham and spam. I hoped someone had some pointers on some of the gray area and what you consider to be spam and ham.

Re: Tips on training bayes?

2024-09-16 Thread Jared Hall via users
On 9/16/2024 8:48 PM, Alex wrote: Hi, Now that I'm using SA4, and my bayes database is quite old, I'd like to retrain it with new ham and spam. I hoped someone had some pointers on some of the gray area and what you consider to be spam and ham. Are reliable newsletters, like those from, sa

Tips on training bayes?

2024-09-16 Thread Alex
Hi, Now that I'm using SA4, and my bayes database is quite old, I'd like to retrain it with new ham and spam. I hoped someone had some pointers on some of the gray area and what you consider to be spam and ham. Are reliable newsletters, like those from, say, a trusted news source where the user op

Re: training bayes database

2018-05-16 Thread Alex Woick
David B Funk schrieb am 10.05.2018 um 20:23: On Thu, 10 May 2018, John Hardin wrote: On Thu, 10 May 2018, Matthew Broadhead wrote: On 09/05/18 20:43, David Jones wrote: On 05/09/2018 01:29 PM, Matthew Broadhead wrote: On 09/05/18 16:37, Reindl Harald wrote: quoting URIBL_BLOCKED is a joke

Re: training bayes database

2018-05-10 Thread David B Funk
On Thu, 10 May 2018, John Hardin wrote: On Thu, 10 May 2018, Matthew Broadhead wrote: On 09/05/18 20:43, David Jones wrote: On 05/09/2018 01:29 PM, Matthew Broadhead wrote: On 09/05/18 16:37, Reindl Harald wrote: quoting URIBL_BLOCKED is a joke - setup a *recursion* *non-forwarding* namese

Re: training bayes database

2018-05-10 Thread John Hardin
On Thu, 10 May 2018, Matthew Broadhead wrote: On 09/05/18 20:43, David Jones wrote: On 05/09/2018 01:29 PM, Matthew Broadhead wrote: On 09/05/18 16:37, Reindl Harald wrote: Am 09.05.2018 um 16:28 schrieb Matthew Broadhead: it looks like it is working.  so maybe it is just not flagging or mo

Re: training bayes database

2018-05-10 Thread Reio Remma
On 10.05.18 15:23, David Jones wrote: On 05/10/2018 07:12 AM, Reio Remma wrote: On 10.05.18 15:08, David Jones wrote: On 05/10/2018 07:02 AM, Reio Remma wrote: On a slightly related note. We're running a PFSense firewall with DNS Forwarder (dnsmasq) in front of our mail server. From what I've

Re: training bayes database

2018-05-10 Thread David Jones
On 05/10/2018 07:12 AM, Reio Remma wrote: On 10.05.18 15:08, David Jones wrote: On 05/10/2018 07:02 AM, Reio Remma wrote: On 10.05.18 14:58, Matus UHLAR - fantomas wrote: Am 09.05.2018 um 16:28 schrieb Matthew Broadhead: i guess my dns is set to use my isp's dns server.  do i need to set up d

Re: training bayes database

2018-05-10 Thread Matus UHLAR - fantomas
Am 09.05.2018 um 16:28 schrieb Matthew Broadhead: i guess my dns is set to use my isp's dns server. do i need to set up dns relay on my machine so it comes from my ip? there is no way we send more than 500k emails from our domain so i should qualify for the free lookup? On 09/05/18 20:43,

Re: training bayes database

2018-05-10 Thread Reio Remma
On 10.05.18 15:08, David Jones wrote: On 05/10/2018 07:02 AM, Reio Remma wrote: On 10.05.18 14:58, Matus UHLAR - fantomas wrote: Am 09.05.2018 um 16:28 schrieb Matthew Broadhead: i guess my dns is set to use my isp's dns server. do i need to set up dns relay on my machine so it comes from my

Re: training bayes database

2018-05-10 Thread David Jones
On 05/10/2018 07:02 AM, Reio Remma wrote: On 10.05.18 14:58, Matus UHLAR - fantomas wrote: Am 09.05.2018 um 16:28 schrieb Matthew Broadhead: i guess my dns is set to use my isp's dns server.  do i need to set up dns relay on my machine so it comes from my ip? there is no way we send more than

Re: training bayes database

2018-05-10 Thread Reio Remma
On 10.05.18 14:58, Matus UHLAR - fantomas wrote: Am 09.05.2018 um 16:28 schrieb Matthew Broadhead: i guess my dns is set to use my isp's dns server. do i need to set up dns relay on my machine so it comes from my ip? there is no way we send more than 500k emails from our domain so i should q

Re: training bayes database

2018-05-10 Thread Matus UHLAR - fantomas
Am 09.05.2018 um 16:28 schrieb Matthew Broadhead: i guess my dns is set to use my isp's dns server.  do i need to set up dns relay on my machine so it comes from my ip? there is no way we send more than 500k emails from our domain so i should qualify for the free lookup? On 09/05/18 20:43,

Re: training bayes database

2018-05-10 Thread Matthew Broadhead
On 09/05/18 20:43, David Jones wrote: On 05/09/2018 01:29 PM, Matthew Broadhead wrote: On 09/05/18 16:37, Reindl Harald wrote: Am 09.05.2018 um 16:28 schrieb Matthew Broadhead: it looks like it is working.  so maybe it is just not flagging or moving the spam? in a differnt post you showed t

Re: training bayes database

2018-05-09 Thread David Jones
On 05/09/2018 01:29 PM, Matthew Broadhead wrote: On 09/05/18 16:37, Reindl Harald wrote: Am 09.05.2018 um 16:28 schrieb Matthew Broadhead: it looks like it is working.  so maybe it is just not flagging or moving the spam? in a differnt post you showed this status header which *clearly* shows

Re: training bayes database

2018-05-09 Thread Matthew Broadhead
On 09/05/18 16:37, Reindl Harald wrote: Am 09.05.2018 um 16:28 schrieb Matthew Broadhead: it looks like it is working.  so maybe it is just not flagging or moving the spam? in a differnt post you showed this status header which *clearly* shows bayes is working - bayes alone don't flag, the tot

Re: training bayes database

2018-05-09 Thread Matthew Broadhead
On 09/05/18 16:37, Reindl Harald wrote: Am 09.05.2018 um 16:28 schrieb Matthew Broadhead: it looks like it is working.  so maybe it is just not flagging or moving the spam? in a differnt post you showed this status header which *clearly* shows bayes is working - bayes alone don't flag, the tot

Re: training bayes database

2018-05-09 Thread John Hardin
On Wed, 9 May 2018, Reio Remma wrote: On 9 May 2018, at 18:33, John Hardin wrote: Also: On Wed, 9 May 2018, Matthew Broadhead wrote: your message has X-Spam-Status: No, score=-18.15 tagged_above=-999 required=6.2 Setting the threshold higher will result in more spam getting through. The

Re: training bayes database

2018-05-09 Thread Reio Remma
> On 9 May 2018, at 18:33, John Hardin wrote: > > Also: > >> On Wed, 9 May 2018, Matthew Broadhead wrote: >> >> your message has >> >> X-Spam-Status: No, score=-18.15 tagged_above=-999 required=6.2 > > Setting the threshold higher will result in more spam getting through. The > scores calc

Re: training bayes database

2018-05-09 Thread John Hardin
Also: On Wed, 9 May 2018, Matthew Broadhead wrote: your message has X-Spam-Status: No, score=-18.15 tagged_above=-999 required=6.2 Setting the threshold higher will result in more spam getting through. The scores calculated by the masscheck processes are based on the assumption that the th

Re: training bayes database

2018-05-09 Thread John Hardin
On Wed, 9 May 2018, Matthew Broadhead wrote: [root@ns1 ~]# sudo -H -u amavis bash -c '/usr/bin/sa-learn --dump magic' 0.000  0  3  0  non-token data: bayes db version 0.000  0  32225  0  non-token data: nspam 0.000  0 440420  0  non

Re: training bayes database

2018-05-09 Thread Matthew Broadhead
On 09/05/18 16:03, Reio Remma wrote: On 09.05.18 16:59, Matthew Broadhead wrote: setting log_level and sa_debug in /etc/amavisd/amavisd.conf didn't seem to make any difference. should i be doing it in /etc/mail/spamassassin/local.cf? See if $sa_debug=1 works (for full debug)? (and restart ama

Re: training bayes database

2018-05-09 Thread Reio Remma
On 09.05.18 16:59, Matthew Broadhead wrote: setting log_level and sa_debug in /etc/amavisd/amavisd.conf didn't seem to make any difference. should i be doing it in /etc/mail/spamassassin/local.cf? See if $sa_debug=1 works (for full debug)? (and restart amavisd). Reio ok now i am getting a lot

Re: training bayes database

2018-05-09 Thread Matthew Broadhead
On 09/05/18 15:48, Reio Remma wrote: On 09.05.18 16:33, Matthew Broadhead wrote: On 08/05/18 21:53, Reio Remma wrote: On 08.05.2018 22:08, John Hardin wrote: On Tue, 8 May 2018, Matthew Broadhead wrote: system setup centos-release-7-4.1708.el7.centos.x86_64, spamassassin-3.4.0-2.el7.x86_64,

Re: training bayes database

2018-05-09 Thread Reio Remma
On 09.05.18 16:33, Matthew Broadhead wrote: On 08/05/18 21:53, Reio Remma wrote: On 08.05.2018 22:08, John Hardin wrote: On Tue, 8 May 2018, Matthew Broadhead wrote: system setup centos-release-7-4.1708.el7.centos.x86_64, spamassassin-3.4.0-2.el7.x86_64, amavisd-new-2.11.0-3.el7.noarch /etc

Re: training bayes database

2018-05-09 Thread Matthew Broadhead
On 08/05/18 21:53, Reio Remma wrote: On 08.05.2018 22:08, John Hardin wrote: On Tue, 8 May 2018, Matthew Broadhead wrote: system setup centos-release-7-4.1708.el7.centos.x86_64, spamassassin-3.4.0-2.el7.x86_64, amavisd-new-2.11.0-3.el7.noarch /etc/mail/spamassassin/local.cf: required_hits 5

Re: training bayes database

2018-05-09 Thread Matthew Broadhead
On 09/05/18 09:09, Reio Remma wrote: On 09.05.18 9:57, Matthew Broadhead wrote: BAYES_00=-1.9 I've personally set *bayes_sql_override_username = amavis* in my local.cf If at all possible, run amavisd with SA bayes debug to see if/how it's using the database. Good luck, Reio Thanks Reio

Re: training bayes database

2018-05-09 Thread Reio Remma
On 09.05.18 9:57, Matthew Broadhead wrote: BAYES_00=-1.9 I've personally set *bayes_sql_override_username = amavis* in my local.cf If at all possible, run amavisd with SA bayes debug to see if/how it's using the database. Good luck, Reio

Re: training bayes database

2018-05-08 Thread Matthew Broadhead
(1) [root@ns1 ~]# sudo -H -u amavis bash -c '/usr/bin/sa-learn --dump magic' 0.000  0  3  0  non-token data: bayes db version 0.000  0  32225  0  non-token data: nspam 0.000  0 440420  0  non-token data: nham 0.000  0 159

Re: training bayes database

2018-05-08 Thread John Hardin
On Tue, 8 May 2018, Reio Remma wrote: On 08.05.2018 22:08, John Hardin wrote: On Tue, 8 May 2018, Matthew Broadhead wrote: system setup centos-release-7-4.1708.el7.centos.x86_64, spamassassin-3.4.0-2.el7.x86_64, amavisd-new-2.11.0-3.el7.noarch /etc/mail/spamassassin/local.cf: required_hits

Re: training bayes database

2018-05-08 Thread Reio Remma
On 08.05.2018 22:08, John Hardin wrote: On Tue, 8 May 2018, Matthew Broadhead wrote: system setup centos-release-7-4.1708.el7.centos.x86_64, spamassassin-3.4.0-2.el7.x86_64, amavisd-new-2.11.0-3.el7.noarch /etc/mail/spamassassin/local.cf: required_hits 5 report_safe 0 rewrite_header Subject [

Re: training bayes database

2018-05-08 Thread John Hardin
On Tue, 8 May 2018, Matthew Broadhead wrote: system setup centos-release-7-4.1708.el7.centos.x86_64, spamassassin-3.4.0-2.el7.x86_64, amavisd-new-2.11.0-3.el7.noarch /etc/mail/spamassassin/local.cf: required_hits 5 report_safe 0 rewrite_header Subject [SPAM] use_bayes  1 bayes_auto_le

training bayes database

2018-05-08 Thread Matthew Broadhead
system setup centos-release-7-4.1708.el7.centos.x86_64, spamassassin-3.4.0-2.el7.x86_64, amavisd-new-2.11.0-3.el7.noarch /etc/mail/spamassassin/local.cf: required_hits 5 report_safe 0 rewrite_header Subject [SPAM] use_bayes  1 bayes_auto_learn   1 bayes_auto_expire  1 # Store bayesian

Re: Training Bayes with BAYES_999 Mail

2015-10-02 Thread Reindl Harald
Am 02.10.2015 um 19:15 schrieb Andrew Davidson: I'm not an expert on the mechanics of Bayes so I'm wondering how valuable it is to continue training with collected spam that is properly tagged with BAYES_999. Does that help to reinforce the logic or is it overly focusing the database on emai

Re: Training Bayes with BAYES_999 Mail

2015-10-02 Thread Matus UHLAR - fantomas
On 02.10.15 13:15, Andrew Davidson wrote: I'm not an expert on the mechanics of Bayes so I'm wondering how valuable it is to continue training with collected spam that is properly tagged with BAYES_999. Does that help to reinforce the logic or is it overly focusing the database on emails it can

Training Bayes with BAYES_999 Mail

2015-10-02 Thread Andrew Davidson
I'm not an expert on the mechanics of Bayes so I'm wondering how valuable it is to continue training with collected spam that is properly tagged with BAYES_999. Does that help to reinforce the logic or is it overly focusing the database on emails it can already detect? Should I only be training it

Re: General rules for training bayes

2014-10-23 Thread Matus UHLAR - fantomas
but do you train mail from mass-mailers? Staples? Facebook? Banks? why not? of course I train if I want such mail to be properly classified later. On 22.10.14 14:36, Alex Regan wrote: The problem I've had with doing this is that it's often so difficult to determine which bulk message shoul

Re: General rules for training bayes

2014-10-22 Thread Alex Regan
Hi, I'm having some trouble with my bayes database, and thought it would be a good time to just rebuild it. I'm wondering if anyone has any good suggestions for the type of mail that should be used for training. be careful about forwarded mail, if possible. if you get many spam from your old a

Re: General rules for training bayes

2014-10-22 Thread RW
On Wed, 22 Oct 2014 14:44:24 +0200 Reindl Harald wrote: > > Am 22.10.2014 um 14:30 schrieb Benny Pedersen: > > On October 22, 2014 1:30:44 PM Matus UHLAR - fantomas > > wrote: > > > >> oh, does BAYES take care about these? > >> we are still talking about manually feeding BAYES, aren't we? > > >

Re: General rules for training bayes

2014-10-22 Thread Benny Pedersen
On October 22, 2014 3:05:56 PM Matus UHLAR - fantomas wrote: >>On October 22, 2014 1:30:44 PM Matus UHLAR - fantomas >> wrote: >>>oh, does BAYES take care about these? >>>we are still talking about manually feeding BAYES, aren't we? >Am 22.10.2014 um 14:30 schrieb Benny Pedersen: >>Sorry, yes

Re: General rules for training bayes

2014-10-22 Thread Matus UHLAR - fantomas
On October 22, 2014 1:30:44 PM Matus UHLAR - fantomas wrote: oh, does BAYES take care about these? we are still talking about manually feeding BAYES, aren't we? Am 22.10.2014 um 14:30 schrieb Benny Pedersen: Sorry, yes bayes can be ignore all headers if one dont like it to track origin sende

Re: General rules for training bayes

2014-10-22 Thread Reindl Harald
Am 22.10.2014 um 14:30 schrieb Benny Pedersen: On October 22, 2014 1:30:44 PM Matus UHLAR - fantomas wrote: oh, does BAYES take care about these? we are still talking about manually feeding BAYES, aren't we? Sorry, yes bayes can be ignore all headers if one dont like it to track origin send

Re: General rules for training bayes

2014-10-22 Thread Benny Pedersen
On October 22, 2014 1:30:44 PM Matus UHLAR - fantomas wrote: oh, does BAYES take care about these? we are still talking about manually feeding BAYES, aren't we? Sorry, yes bayes can be ignore all headers if one dont like it to track origin senders or ips

Re: General rules for training bayes

2014-10-22 Thread RW
On Wed, 22 Oct 2014 13:30:44 +0200 Matus UHLAR - fantomas wrote: > >>be careful about forwarded mail, if possible. if you get many spam > >>from your old account, it may start to classify ALL mail forwarded > >>through that > > On 22.10.14 13:15, Benny Pedersen wrote: > >This only correct if inte

Re: General rules for training bayes

2014-10-22 Thread Matus UHLAR - fantomas
be careful about forwarded mail, if possible. if you get many spam from your old account, it may start to classify ALL mail forwarded through that On 22.10.14 13:15, Benny Pedersen wrote: This only correct if internal networks and or trusted networks is not configured correct oh, does BAYES t

Re: General rules for training bayes

2014-10-22 Thread Reindl Harald
Am 22.10.2014 um 13:15 schrieb Benny Pedersen: On October 22, 2014 1:08:45 PM Matus UHLAR - fantomas: be careful about forwarded mail, if possible. if you get many spam from your old account, it may start to classify ALL mail forwarded through that This only correct if internal networks and

Re: General rules for training bayes

2014-10-22 Thread Benny Pedersen
On October 22, 2014 1:08:45 PM Matus UHLAR - fantomas wrote: be careful about forwarded mail, if possible. if you get many spam from your old account, it may start to classify ALL mail forwarded through that This only correct if internal networks and or trusted networks is not configured co

Re: General rules for training bayes

2014-10-22 Thread Matus UHLAR - fantomas
On 21.10.14 21:29, Alex Regan wrote: I'm having some trouble with my bayes database, and thought it would be a good time to just rebuild it. I'm wondering if anyone has any good suggestions for the type of mail that should be used for training. be careful about forwarded mail, if possible. if

Re: General rules for training bayes

2014-10-22 Thread Axb
On 10/22/2014 03:29 AM, Alex Regan wrote: I have the database in a replicated mysql database for now. I'd like to go to redis, but it's not quite ready for distributed configurations, correct? What do you mean by "distributed configurations"? - many clients querying a central Redis DB? - real

General rules for training bayes

2014-10-21 Thread Alex Regan
Hi all, I'm having some trouble with my bayes database, and thought it would be a good time to just rebuild it. I'm wondering if anyone has any good suggestions for the type of mail that should be used for training. I understand individually-crafted emails would make the best ham, but do you

Re: Training Bayes On A Gateway

2014-10-09 Thread Ted Mittelstaedt
I collect spam this way, periodically I scan the mail logs looking for "unknown user" entries and sort the results - usernames/email addresses that are repeatedly being "guessed" get an alias entry added that forwards the spam to a spam mailbox. I have about 20 of these now that are aliased

Re: Training Bayes On A Gateway

2014-10-09 Thread Jason W.
On Thu, Oct 9, 2014 at 4:14 PM, John Hardin wrote: > On Thu, 9 Oct 2014, John Traweek CCNA, Sec+ wrote: > > I've built a gateway server using sa-exim to filter email for our >> > > This topic comes up fairly regularly. Did you search the list archives on > terms like "exchange bayes" ? > Since

Re: Training Bayes On A Gateway

2014-10-09 Thread John Hardin
On Thu, 9 Oct 2014, John Traweek CCNA, Sec+ wrote: I've built a gateway server using sa-exim to filter email for our corporate Microsoft Exchange environment. It's working pretty good, but I have Bayes turned off due to the fact that I am unsure on how to train it in this type of environment.

Re: Training Bayes On A Gateway

2014-10-09 Thread Reindl Harald
Am 09.10.2014 um 21:43 schrieb John Traweek CCNA, Sec+: I’ve built a gateway server using sa-exim to filter email for our corporate Microsoft Exchange environment. It’s working pretty good, but I have Bayes turned off due to the fact that I am unsure on how to train it in this type of environme

Training Bayes On A Gateway

2014-10-09 Thread John Traweek CCNA, Sec+
I've built a gateway server using sa-exim to filter email for our corporate Microsoft Exchange environment. It's working pretty good, but I have Bayes turned off due to the fact that I am unsure on how to train it in this type of environment. Has someone written a how to article on how to efficie

Re: Training Bayes on outbound mail

2011-01-28 Thread Karsten Bräckelmann
On Fri, 2011-01-28 at 18:10 +, Dominic Benson wrote: > Recently, in order to balance the ham/spam ratio given to sa-learn, I > have started to pass mail submitted by authenticated users to sa-learn > --ham. > The thinking here is that users would generally want to receive mail > that they se

Re: Training Bayes on outbound mail

2011-01-28 Thread Benny Pedersen
On Fri, 28 Jan 2011 18:10:08 +, Dominic Benson wrote: > The approach, if anyone is interested, is to use an "unseen" Exim router > to pipe mail to sa-learn --ham using the pipe transport, on the > condition that an acl_m variable, set for authenticated users in > acl_check_rcpt, evaluates

Re: Training Bayes on outbound mail

2011-01-28 Thread David B Funk
On Fri, 28 Jan 2011, David F. Skoll wrote: > On Fri, 28 Jan 2011 18:10:08 + > Dominic Benson wrote: > > > Recently, in order to balance the ham/spam ratio given to sa-learn, I > > have started to pass mail submitted by authenticated users to > > sa-learn --ham. > > > I haven't seen any mentio

Re: Training Bayes on outbound mail

2011-01-28 Thread Dominic Benson
On 28 Jan 2011, at 18:39, Lawrence @ Rogers wrote: > On 28/01/2011 2:53 PM, David F. Skoll wrote: >> On Fri, 28 Jan 2011 18:10:08 + >> Dominic Benson wrote: >> >>> Recently, in order to balance the ham/spam ratio given to sa-learn, I >>> have started to pass mail submitted by authenticated

Re: Training Bayes on outbound mail

2011-01-28 Thread Lawrence @ Rogers
On 28/01/2011 2:53 PM, David F. Skoll wrote: On Fri, 28 Jan 2011 18:10:08 + Dominic Benson wrote: Recently, in order to balance the ham/spam ratio given to sa-learn, I have started to pass mail submitted by authenticated users to sa-learn --ham. I haven't seen any mention of this strategy

Re: Training Bayes on outbound mail

2011-01-28 Thread David F. Skoll
On Fri, 28 Jan 2011 18:10:08 + Dominic Benson wrote: > Recently, in order to balance the ham/spam ratio given to sa-learn, I > have started to pass mail submitted by authenticated users to > sa-learn --ham. > I haven't seen any mention of this strategy on-list or on the web, so > I'm inter

Training Bayes on outbound mail

2011-01-28 Thread Dominic Benson
Hi - Recently, in order to balance the ham/spam ratio given to sa-learn, I have started to pass mail submitted by authenticated users to sa-learn --ham. The thinking here is that users would generally want to receive mail that they send, and many messages will either be replies or replied to,

Re: Training bayes

2008-12-27 Thread Matus UHLAR - fantomas
On 27.12.08 22:49, JC Putter wrote: > Id like to know of you can train bayes with *.msg format or Must it be eml > format? I know that outlook express uses eml but office outlook uses msg > format? the message must be raw, plaintext message body as is transferred over the network. msg is afaik a b

Training bayes

2008-12-27 Thread JC Putter
Id like to know of you can train bayes with *.msg format or Must it be eml format? I know that outlook express uses eml but office outlook uses msg format? [cid:image001.jpg@01C96875.69DDB690] This message has been scanned by Nexus Mail Gateway <>

Re: Training Bayes ham messages when they are sent out of the server

2007-02-06 Thread maillist
Philip Seccombe wrote: Does anyone have any ideas how I can get the emails back on the server, or keep a copy on the server to create a bayes database on? I thought of forwarding emails back, but then its a forwarded email and not the actual one which will mess up the database. I've no

Training Bayes ham messages when they are sent out of the server

2007-02-06 Thread Philip Seccombe
Hi there, Apologies if this has been answered, I cannot find on the web anything to say about this and not being a Linux guru I'm a little bewildered. Basically a previous technician setup SpamAssassin on a server for us and has since left the company on bad terms so is not able to support

Re: Training Bayes properly

2006-07-02 Thread jdow
ically). --Will -Original Message- From: jdow [mailto:[EMAIL PROTECTED] Sent: Friday, June 30, 2006 7:36 PM To: users@spamassassin.apache.org Subject: Re: Training Bayes properly From: "Stefan Jakobs" <[EMAIL PROTECTED]> Am Freitag, 30. Juni 2006 02:09 schrieb Rick

RE: Training Bayes properly

2006-07-02 Thread Will Nordmeyer
... I've got the sa-stats (and a modified version) that I run periodically). --Will -Original Message- From: jdow [mailto:[EMAIL PROTECTED] Sent: Friday, June 30, 2006 7:36 PM To: users@spamassassin.apache.org Subject: Re: Training Bayes properly From: "Stefan Jakobs" <[EM

Re: Training Bayes properly

2006-06-30 Thread jdow
From: "Stefan Jakobs" <[EMAIL PROTECTED]> Am Freitag, 30. Juni 2006 02:09 schrieb Rick Macdougall: Hi, Hello, And my hit rates are For HAM RANKRULE NAMECOUNT %OFRULES %OFMAIL %OFSPAM %OFHAM 1BAYES_00 2281924.15 54.611.65 96.70 And SPAM RANKRULE NAME

RE: Training Bayes properly

2006-06-30 Thread Randal, Phil
[mailto:[EMAIL PROTECTED] Sent: 30 June 2006 00:45To: usersSubject: Training Bayes properly So it looks like I have to reset my Bayes and re-train it. I want to do it properly this time. I will be making sure I personally review every message that our users put into the spam folder

Re: Training Bayes properly

2006-06-30 Thread Stefan Jakobs
Am Freitag, 30. Juni 2006 02:09 schrieb Rick Macdougall: > Hi, Hello, > And my hit rates are > > For HAM > RANKRULE NAMECOUNT %OFRULES %OFMAIL %OFSPAM %OFHAM > 1BAYES_00 2281924.15 54.611.65 96.70 > > And SPAM > RANKRULE NAMECOUNT %OFRULES %OFMAIL %OFSPAM

Re: Training Bayes properly

2006-06-30 Thread Anthony Peacock
e. I also occasionally learn a handful of hams to keep them up to date. Loren - Original Message - *From:* Leigh Sharpe <mailto:[EMAIL PROTECTED]> *To:* users <mailto:users@spamassassin.apache.org> *Sent:* Thursday, June 29, 2006 4:45 PM *Su

Re: Training Bayes properly

2006-06-29 Thread jdow
From: "Rick Macdougall" <[EMAIL PROTECTED]> jdow wrote: I don't know if it's a good example of YMMV, I think both of our bayes are operating at respectable levels given the data they have to deal with. I may wish I could get better results but I really don't think it's possible in the enviro

Re: Training Bayes properly

2006-06-29 Thread Loren Wilton
pe To: users Sent: Thursday, June 29, 2006 4:45 PM Subject: Training Bayes properly So it looks like I have to reset my Bayes and re-train it. I want to do it properly this time. I will be making sure I personally review every message that our users put into the spam folder firs

Re: Training Bayes properly

2006-06-29 Thread Rick Macdougall
jdow wrote: From: "Rick Macdougall" <[EMAIL PROTECTED]> K! I bet you are running system wide Bayes for a very non-homogeneous collection of people. I've appended my figures (not the best I have seen but very good) below yours. Your BAYES_00 is better than mine only if you do not consider the

Re: Training Bayes properly

2006-06-29 Thread jdow
From: "Rick Macdougall" <[EMAIL PROTECTED]> Nigel Frankcom wrote: On Fri, 30 Jun 2006 09:45:07 +1000, "Leigh Sharpe" So it looks like I have to reset my Bayes and re-train it. I want to do it properly this time. I will be making sure I personally review every message that our users put into th

Re: Training Bayes properly

2006-06-29 Thread jdow
200 is OK. 2000 is enough. Over the years from 2.43 forward my entire spam and ham corpus contents amount to under 2000 each and Bayes is running remarkably smoothly for me. I am "tempted" to enable automatic learning to see what will happen. I'll take a snapshot of my Bayes first, though. (The "g

Re: Training Bayes properly

2006-06-29 Thread Rick Macdougall
Nigel Frankcom wrote: On Fri, 30 Jun 2006 09:45:07 +1000, "Leigh Sharpe" <[EMAIL PROTECTED]> wrote: So it looks like I have to reset my Bayes and re-train it. I want to do it properly this time. I will be making sure I personally review every message that our users put into the spam folder firs

Re: Training Bayes properly

2006-06-29 Thread Nigel Frankcom
On Fri, 30 Jun 2006 09:45:07 +1000, "Leigh Sharpe" <[EMAIL PROTECTED]> wrote: >So it looks like I have to reset my Bayes and re-train it. I want to do >it properly this time. I will be making sure I personally review every >message that our users put into the spam folder first, to make sure they >

Training Bayes properly

2006-06-29 Thread Leigh Sharpe
So it looks like I have to reset my Bayes and re-train it. I want to do it properly this time. I will be making sure I personally review every message that our users put into the spam folder first, to make sure they haven't put spam into the wrong folder. However, I have a couple of question