Re: how to make a custom ruleset

2009-03-05 Thread Adi Nugroho
On Thursday 05 March 2009 23:44:39 Benny Pedersen wrote: > ups > > header SELF_FROM From =~ /\...@my.address/i > header SELF_TO To =~ /\...@my.address/i > meta SELF (SELF_FROM && SELF_TO) > describe SELF Trap mail with forged sender the same as recipient > score SELF 3.0 I have tried above syntax

Re: Dealing with low scoring spam - tighter MTA integration

2009-03-05 Thread Kenneth Porter
--On Thursday, March 05, 2009 10:31 PM +0100 Andrzej Adam Filip wrote: I try hard to preach that SA methodology of creating "spam score" based on weighted tests *CAN* be applied at this point too. I would like too apply such test in milter (MIMEDefang) that uses SA anyway in my installation.

Re: how to make a custom ruleset

2009-03-05 Thread LuKreme
On Mar 5, 2009, at 7:28, Martin Gregorie wrote: On Thu, 2009-03-05 at 21:31 +0800, Adi Nugroho wrote: I found that a lot of spam is using recipient email address as the sender. (from a...@internux.co.id to a...@internux.co.id, or from i...@apache.org to i...@apache.org). The only disadvan

Re: Dealing with low scoring spam - tighter MTA integration

2009-03-05 Thread Andrzej Adam Filip
James Wilkinson wrote: > Andrzej Adam Filip wrote: >> At "RCPT TO:" stage there are available: >> * connecting client IP address (last mail hop) >> so big part of DNSBL and DNSWL tests *CAN* be used >> * envelope sender for SPF based tests >> * envelope sender and envelope recipient for auto wh

Re: Dealing with low scoring spam - tighter MTA integration

2009-03-05 Thread James Wilkinson
Andrzej Adam Filip wrote: > At "RCPT TO:" stage there are available: > * connecting client IP address (last mail hop) > so big part of DNSBL and DNSWL tests *CAN* be used > * envelope sender for SPF based tests > * envelope sender and envelope recipient for auto white/black listing > (producing

Re: Dealing with low scoring spam - tighter MTA integration

2009-03-05 Thread Andrzej Adam Filip
Kenneth Porter wrote: > --On Thursday, March 05, 2009 7:43 AM +0100 Andrzej Adam Filip > wrote: > >> What I would like to see is a option to make spam assassin to produce >> "weighted scores" based on subset of all tests capable to work on subset >> of the "final data" available *before* message

Re: Dealing with low scoring spam - tighter MTA integration

2009-03-05 Thread Kenneth Porter
--On Thursday, March 05, 2009 7:43 AM +0100 Andrzej Adam Filip wrote: What I would like to see is a option to make spam assassin to produce "weighted scores" based on subset of all tests capable to work on subset of the "final data" available *before* message headers&body are transfered in SMT

Re: 2 + 2 != 4 - Spamassassin needs a new paradigm

2009-03-05 Thread Justin Mason
On Thu, Mar 5, 2009 at 11:12, decoder wrote: > Justin Mason wrote: >> >> Thanks for doing this!  couple of q's: >> >> 1. I can offer a bigger ham/spam corpus if you'd like to test against >> that as well; >> corpora from multiple contributors can sometimes expose training set bias. >> > > That wou

Re: SpamAssassin Doesn't Appear to be working

2009-03-05 Thread Mark Martinec
Jason, > Got it tweaked the settings > set $sa_tag_level_deflt = ; Good. > and the header now shows... I feel better even if it was working before Thanks for figuring out the solution by yourself. The $sa_tag_level_deflt controls the spam score at which X-Spam-* header fields start to be

Re: 2 + 2 != 4 - Spamassassin needs a new paradigm

2009-03-05 Thread John Hardin
On Thu, 5 Mar 2009, decoder wrote: John Hardin wrote: Would there be any benefit to having an offline version - i.e. something that evaluates the log or a corpus to generate new meta rules, that could be added onto the default ruleset? For instance: cron @ 0200: sa_meta_eval > /etc/mail/s

Re: how to make a custom ruleset

2009-03-05 Thread Benny Pedersen
On Thu, March 5, 2009 17:31, John Hardin wrote: >> header SELF_FROM From =~ /\...@my.address/i >> header SELF_TO To =~ /\...@my.address/i > > Are you sure you want to give 1 point to each of those cases in > addition to whatever points the meta adds? it was not me that maked the rules, just edit

Re: 2 + 2 != 4 - Spamassassin needs a new paradigm

2009-03-05 Thread decoder
John Hardin wrote: Would there be any benefit to having an offline version - i.e. something that evaluates the log or a corpus to generate new meta rules, that could be added onto the default ruleset? For instance: cron @ 0200: sa_meta_eval > /etc/mail/spamassassin/metarules.cf /etc/init.d/spa

Re: 2 + 2 != 4 - Spamassassin needs a new paradigm

2009-03-05 Thread decoder
Marc Perkel wrote: I suppose what I was thinking was that you still used the SA result but added or subtracted from the SA result based on your SVM code, sort of the way bayes does. Or are you letting SVM make the final determination? At the moment, I am only using the SVM answer. What you fi

Re: how to make a custom ruleset

2009-03-05 Thread John Hardin
On Thu, 5 Mar 2009, Benny Pedersen wrote: header SELF_FROM From =~ /\...@my.address/i header SELF_TO To =~ /\...@my.address/i Are you sure you want to give 1 point to each of those cases in addition to whatever points the meta adds? If not, then they should be named __SELF_FROM and __SELF_T

Re: SpamAssassin Doesn't Appear to be working

2009-03-05 Thread JasonHirsh
Got it tweaked the settings set $sa_tag_level_deflt = ; and the header now shows... I feel better even if it was working before -- View this message in context: http://www.nabble.com/SpamAssassin-Doesn%27t-Appear-to-be-working-tp22341459p22355132.html Sent from the SpamAssassin - U

Re: Some emails pass spamassassin unprocessed

2009-03-05 Thread Monky
Karsten Bräckelmann-2 wrote: > > Mentioning some numbers is good, though too qualitative. How many mails > is that per day, in absolute numbers? > Maybe 1-10 a day in total. I have several email accounts there and it happens with all of them although not in a serious amount per account. Kars

Re: how to make a custom ruleset

2009-03-05 Thread Benny Pedersen
On Thu, March 5, 2009 16:27, Adi Nugroho wrote: > describe SELF Trap mail with forged sender the same as recipient > header SELF From =~ /\...@my.address/i > header SELF_TO To =~ /\...@my.address/i > meta SELF 5.0 ups header SELF_FROM From =~ /\...@my.address/i header SELF_TO To =~ /\..

Re: 2 + 2 != 4 - Spamassassin needs a new paradigm

2009-03-05 Thread Marc Perkel
decoder wrote: Marc Perkel wrote: Good work so far but sounds like you need to throw more data at it. Also even though you indicate "over 99% accuracy" can you break that down better? 99.9% is 10 times as accurate as 99%. What do you mean by more data? Of course, some additional data might

Re: how to make a custom ruleset

2009-03-05 Thread Adi Nugroho
On Thursday 05 March 2009 22:28:23 Martin Gregorie wrote: > describe SELF Trap mail with forged sender the same as recipient > header SELF From =~ /\...@my.address/i > header SELF To =~ /\...@my.address/i > meta SELF 5.0 Dear Martin, Thank you for the rule... I made a file self.cf in /et

Re: NOTICE: mail delivery status.

2009-03-05 Thread Matus UHLAR - fantomas
On 05.03.09 16:01, Geert Batsleer wrote: > I added this to my local.cf, is this syntax OK or will this block 'Club' > 'Casion' and 'Vegas' if used seperate? > > > header FROM_CASINO From:name =~ /\Vegas Club Casino\b/i > descrbibe FROM_CASINO Casino Club Casino filter 04/03/09 > score FRO

Re: NOTICE: mail delivery status.

2009-03-05 Thread Benny Pedersen
On Thu, March 5, 2009 16:01, Geert Batsleer wrote: > header FROM_CASINO From:name =~ /\Vegas Club Casino\b/i header FROM_CASINO From:name =~ /\bVegas Club Casino\b/i -- http://localhost/ 100% uptime and 100% mirrored :)

Re: NOTICE: mail delivery status.

2009-03-05 Thread Geert Batsleer
I added this to my local.cf, is this syntax OK or will this block 'Club' 'Casion' and 'Vegas' if used seperate? header FROM_CASINO From:name =~ /\Vegas Club Casino\b/i descrbibe FROM_CASINO Casino Club Casino filter 04/03/09 score FROM_CASINO 10.0 I got the following response from

Re: how to make a custom ruleset

2009-03-05 Thread Benny Pedersen
On Thu, March 5, 2009 14:31, Adi Nugroho wrote: > I found that a lot of spam is using recipient email address as the > sender. (from a...@internux.co.id to a...@internux.co.id, or from > i...@apache.org to i...@apache.org). all this happends on domains that have no spf and or testing spf in mta,

Re: how to make a custom ruleset

2009-03-05 Thread Daniel J McDonald
On Thu, 2009-03-05 at 21:31 +0800, Adi Nugroho wrote: > Dear all, > > I found that a lot of spam is using recipient email address as the sender. > (from a...@internux.co.id to a...@internux.co.id, or from i...@apache.org to > i...@apache.org). > > Since if we mail to our self, usually we have ve

Re: how to make a custom ruleset

2009-03-05 Thread Martin Gregorie
On Thu, 2009-03-05 at 21:31 +0800, Adi Nugroho wrote: > I found that a lot of spam is using recipient email address as the sender. > (from a...@internux.co.id to a...@internux.co.id, or from i...@apache.org to > i...@apache.org). > The only disadvantage is that you'll label test messages as spam.

how to make a custom ruleset

2009-03-05 Thread Adi Nugroho
Dear all, I found that a lot of spam is using recipient email address as the sender. (from a...@internux.co.id to a...@internux.co.id, or from i...@apache.org to i...@apache.org). Since if we mail to our self, usually we have very low score, I hope it is save to give a BIG score (probably 2 or

Re: 2 + 2 != 4 - Spamassassin needs a new paradigm

2009-03-05 Thread decoder
Marc Perkel wrote: Good work so far but sounds like you need to throw more data at it. Also even though you indicate "over 99% accuracy" can you break that down better? 99.9% is 10 times as accurate as 99%. What do you mean by more data? Of course, some additional data might help. One should

Re: SpamAssassin Doesn't Appear to be working

2009-03-05 Thread JasonHirsh
>> In this set up I am lead top believe that Amavisd-new handles the SA >> config >> but I did not see a process for spamd so i enabled in rc.conf. > > There is no need for a spamd process in this setup - think of amavisd > proces as an equivalent of spamd (in that it calls a SpamAssassin libra

Re: 2 + 2 != 4 - Spamassassin needs a new paradigm

2009-03-05 Thread decoder
Justin Mason wrote: Thanks for doing this! couple of q's: 1. I can offer a bigger ham/spam corpus if you'd like to test against that as well; corpora from multiple contributors can sometimes expose training set bias. That would be cool :) Is this corpus already processed by spamassassin (i

Re: Bye Bye Bayes

2009-03-05 Thread Matus UHLAR - fantomas
On 04.03.09 06:17, John Hardin wrote: > I used to have a couple of users who treated their Trash folder as > long-term read-message storage. After reading most messages they'd move > them to Trash, and _never_ _purge_ _it_. I couldn't break them of this > habit, even after purging their Trash fo

Re: 2 + 2 != 4 - Spamassassin needs a new paradigm

2009-03-05 Thread Justin Mason
On Thu, Mar 5, 2009 at 00:23, decoder wrote: > decoder wrote: >> Justin Mason wrote: >>> So you're volunteering to code it up, then? ;) >> >> I was planning to do at least some brainstorming+experiements as to what >> learning methods would seem suitable and how well the method performs, >> whenev