RE: spam scores low (Sendmail + smtp-vilter + SA )

2006-01-16 Thread David B Funk
On Mon, 16 Jan 2006, Mike Sassaman wrote: > % spamassassin --lint shows no output, so I'm thinking that means no > problems in my local.cf. Good, 'spamassassin --lint' should show no outout, it ony barks when there's something wrong. Now 'spamassassin --lint -D' gives -tons- of output, but any er

Re: Bayes - how bad is a small ham corpus with a big spam corpus?

2006-01-16 Thread Robert Menschel
Hello srunschke, Monday, January 16, 2006, 2:33:42 AM, you wrote: sad> I'm currently trying to build up a new bayes DB here, ... sad> ... yet it poses a problem to build up the ham part. sad> ... Much of the inbound smtp mail either contains private or sad> confidential information, so I cannot u

Re: Gain an extra 25%! (was "Purging the Spamassassin Database")

2006-01-16 Thread Andrew Donkin
> ALTER TABLE bayes_vars MODIFY id SMALLINT UNSIGNED NOT NULL; Whoops - doing that will prevent any new users from being created in your Bayes database. Add auto_increment: ALTER TABLE bayes_vars MODIFY id SMALLINT UNSIGNED NOT NULL AUTO_INCREMENT; -- ___

Re: Pump and dump stock Blacklist?

2006-01-16 Thread ajc85
sounds like a business plan for turning underpants into profit. btw: if anybody is interested ive had an experimental c# implimentation of SpamD running about about a week now on Win32 with great performance on a 10mb global bayes database. If any interest is shown i might release some code.

Re: Pump and dump stock Blacklist?

2006-01-16 Thread Brian Godette
On Monday 16 January 2006 04:27 pm, [EMAIL PROTECTED] wrote: > Hi! > > > Im curious if there are any intitiatives to collect pump and dump stock > > symbols and names to check against incomming spam. I've looked around > > but have yet to find anything. I think some sort of database would be > >

Re: Gain an extra 25%! (was "Purging the Spamassassin Database")

2006-01-16 Thread Justin Mason
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 "M.S. Lucas" writes: > From: "Jason Frisvold" <[EMAIL PROTECTED]> > > On 1/15/06, Andrew Donkin <[EMAIL PROTECTED]> wrote: > > > If you have fewer than 65,000 accounts you could halve the size of the > > > id columns. > > > > > > That took 25% off my

Re: Pump and dump stock Blacklist?

2006-01-16 Thread raymond
Hi! Im curious if there are any intitiatives to collect pump and dump stock symbols and names to check against incomming spam. I've looked around but have yet to find anything. I think some sort of database would be nice to have to do lookups on and just have it check agains the body of the me

Pump and dump stock Blacklist?

2006-01-16 Thread Spam Ass
Hello everyone,Im curious if there are any intitiatives to collect pump and dump stock symbols and names to check against incomming spam.  I've looked around but have yet to find anything.  I think some sort of database would be nice to have to do lookups on and just have it check agains the body o

Re: Gain an extra 25%! (was "Purging the Spamassassin Database")

2006-01-16 Thread Chr. v. Stuckrad
On Mon, Jan 16, 2006 at 04:09:37PM +0100, M.S. Lucas wrote: > Could this be made a default with the small size of the id columns and a > note in the installation file for the big users? > There are more users of SA with less then 65k users then with more. Does it mean '65k is the largest User-Num

Re: Spamassassin & Bayes

2006-01-16 Thread Markus Braun
It's more like "I know all machines in 10.2.3.xxx belong to my network which is safe and which I trust, so I'll add 10.2.3.xxx/24 to trusted_networks" I don't think it's safe to add a external network like gmx to your trusted_networks list. But what is my internal network. I have one serv

RE: spam scores low (Sendmail + smtp-vilter + SA )

2006-01-16 Thread Mike Sassaman
> -Original Message- > From: jdow [mailto:[EMAIL PROTECTED] > Sent: Monday, January 16, 2006 3:24 PM > To: users@spamassassin.apache.org > Subject: Re: spam scores low (Sendmail + smtp-vilter + SA ) > > > From: "Mike Sassaman" <[EMAIL PROTECTED]> > >> On Fri, 13 Jan 2006, Mike Sassaman w

RE: Spamassassin & Bayes

2006-01-16 Thread Bowie Bailey
Markus Braun wrote: > > The trusted_networks setting specifies which mail servers you trust > > not to forge header information. Normally, this will just be the > > servers that you directly control. You can either list your > > mailservers individually, or you can list your entire network so > >

Re: Spamassassin & Bayes

2006-01-16 Thread jdow
From: "François Conil" <[EMAIL PROTECTED]> Markus Braun wrote: The trusted_networks setting specifies which mail servers you trust not to forge header information. Normally, this will just be the servers that you directly control. You can either list your mailservers individually, or you can

RE: Spamassassin & Bayes

2006-01-16 Thread Bowie Bailey
Markus Braun wrote: > > > About the rulesets in rulesemporium.com what can you suggest for > > > me? > > > > I would suggest at least the "0" files for you, and probably the > > "1" files. I suspect you get German-language email, and the SARE > > rules are really only checked thoroughly aginst En

Re: spam scores low (Sendmail + smtp-vilter + SA )

2006-01-16 Thread jdow
From: "Mike Sassaman" <[EMAIL PROTECTED]> On Fri, 13 Jan 2006, Mike Sassaman wrote: > Thanks everyone for your suggestions. This is what I've done: > > required_score 4 > rewrite_header Subject *SPAM* > add_header all Report _REPORT_ > use_razor2 1 > razor_config /etc/mail/spamassassin/

Re: Spamassassin & Bayes

2006-01-16 Thread François Conil
Markus Braun wrote: The trusted_networks setting specifies which mail servers you trust not to forge header information. Normally, this will just be the servers that you directly control. You can either list your mailservers individually, or you can list your entire network so that you automa

Re: Autolearn: works from command-line, not via milter

2006-01-16 Thread Jim Maul
Don Levey wrote: Don Levey wrote: Don Levey wrote: Jim Maul wrote: Failed means it didnt work for some reason. No means it simply didnt even try to autolearn (score wasnt high enough, spam/ham threshold not reached, etc.) In short, failed points to a potential problem, whereas no doesnt. -J

RE: Spamassassin & Bayes

2006-01-16 Thread Markus Braun
The trusted_networks setting specifies which mail servers you trust not to forge header information. Normally, this will just be the servers that you directly control. You can either list your mailservers individually, or you can list your entire network so that you automatically trust any mai

Re: Spamassassin & Bayes

2006-01-16 Thread Markus Braun
Are you running German-language rules on 3.0.x? If so this is (unfortunately) expected. If you are only running English-language rules files, then this would probably indicate that you had rules from 2.6x on a 3.x SA install. I have spamassassin installed on a debian sarge version with apt-g

RE: Autolearn: works from command-line, not via milter

2006-01-16 Thread Don Levey
Don Levey wrote: > Don Levey wrote: >> Jim Maul wrote: >>> >>> Failed means it didnt work for some reason. No means it simply >>> didnt even try to autolearn (score wasnt high enough, spam/ham >>> threshold not reached, etc.) In short, failed points to a >>> potential problem, whereas no doesnt.

RE: spam scores low (Sendmail + smtp-vilter + SA )

2006-01-16 Thread Mike Sassaman
> On Fri, 13 Jan 2006, Mike Sassaman wrote: > > > Thanks everyone for your suggestions. This is what I've done: > > > > required_score 4 > > rewrite_header Subject *SPAM* > > add_header all Report _REPORT_ > > use_razor2 1 > > razor_config /etc/mail/spamassassin/.razor/razor-agent.conf >

RE: Autolearn: works from command-line, not via milter

2006-01-16 Thread Don Levey
Don Levey wrote: > Jim Maul wrote: >> >> Failed means it didnt work for some reason. No means it simply didnt >> even try to autolearn (score wasnt high enough, spam/ham threshold >> not reached, etc.) In short, failed points to a potential problem, >> whereas no doesnt. >> >> -Jim > > That I und

Re: Autolearn: works from command-line, not via milter

2006-01-16 Thread Jim Maul
Don Levey wrote: Jim Maul wrote: Do you have a copy of a message that causes the autolearn=failed? I'd run it through SA with debug enabled to see why its failing..permissions of the bayes files are a likely cause. It looks to be failing only when it actually tries to do the autolearn so i'd c

RE: Autolearn: works from command-line, not via milter

2006-01-16 Thread Don Levey
Jim Maul wrote: > Don Levey wrote: >> Don Levey wrote: >> >>> Messages coming in and autoscanned via spamass-milter/spamd all fail >>> autolearn. To pick one example from this list (full headers >>> available if it will help): >>> >>> >>> X-Spam-Status: No, score=-4.9 required=5.0 tests=BAYES_00 >

RE: (OT) Actualizar spamassassin

2006-01-16 Thread Steven Manross
Group, Does someone have an SA RH8 upgrade guide link? Please? He's using RH8 and coming from SA 2.31-16... If someone also speaks Spanish, please forgive my butchery of the language. No translators were used -- it's all my fault. Fabian, Pardone me, pero mi Espanol es muy malo, y no tengo in

Re: Autolearn: works from command-line, not via milter

2006-01-16 Thread Jim Maul
Don Levey wrote: Don Levey wrote: Messages coming in and autoscanned via spamass-milter/spamd all fail autolearn. To pick one example from this list (full headers available if it will help): X-Spam-Status: No, score=-4.9 required=5.0 tests=BAYES_00 autolearn=failed version=3.0.4 X-Sp

RE: Autolearn: works from command-line, not via milter

2006-01-16 Thread Don Levey
Don Levey wrote: > > Messages coming in and autoscanned via spamass-milter/spamd all fail > autolearn. To pick one example from this list (full headers > available if it will help): > > > X-Spam-Status: No, score=-4.9 required=5.0 tests=BAYES_00 > autolearn=failed version=3.0.4 > X-Spam-Che

Re: What's up with these URLs?

2006-01-16 Thread mouss
Daniel Quinlan a écrit : > Theo Van Dinter <[EMAIL PROTECTED]> writes: > > >>http://gz8l9a2f7cg3/?ra=3DXXDD >>http://kzbrdz2lj80ym/?ra=3DXXDD >> >>Clearly if it's just a hostname w/out domain it's not going to work >>... So does this get parsed into something useful by some >>MUA/browser/etc? B

Re: Bayes - how bad is a small ham corpus with a big spam corpus?

2006-01-16 Thread Matt Kettler
[EMAIL PROTECTED] wrote: > Hi list, > > I'm currently trying to build up a new bayes DB here, since the autobuilt > DB fubared (as expected, no need to throw things at me ;)). It's rather > easy > to build up the spam part, as we are getting right enough of it, yet it > poses > a problem to buil

Re: Running spamassassin in two-pass

2006-01-16 Thread Nicolas Boullis
Randal, Phil wrote: > You might want to save yourself effort in reinventing the wheel and take > a look at MaliScanner 4.50.x which caches spamassassin scores > (http://www.mailscanner.info). Thanks for the information. I had a look at the website but could not find much information. Does MailScan

Autolearn: works from command-line, not via milter

2006-01-16 Thread Don Levey
I just moved/upgraded my home server this weekend, leading to less hair on my head and more ulcers elsewhere. While I've worked my way through many problems in the past few days, this one seems to be eluding me and googling, archives, etc haven't yet helped me. Here's the scoop: Messages coming i

Re: Gain an extra 25%! (was "Purging the Spamassassin Database")

2006-01-16 Thread M.S. Lucas
From: "Jason Frisvold" <[EMAIL PROTECTED]> On 1/15/06, Andrew Donkin <[EMAIL PROTECTED]> wrote: > If you have fewer than 65,000 accounts you could halve the size of the > id columns. > > That took 25% off my bayes_token data file and index. Since MySQL > likes to keep indexes in VM, I think it w

RE: Spamassassin & Bayes

2006-01-16 Thread Bowie Bailey
Markus Braun wrote: > > debug: tests=ALL_TRUSTED,MISSING_DATE,MISSING_SUBJECT,NO_REAL_NAME > > > Here is a BIG problem! You got an ALL_TRUSTED on this spam. > > Assuming you fed in a spam that came from outside and the headers > > are still there showing that, it shoudl NEVER get an ALL_TRUSTED

(OT) Actualizar spamassassin

2006-01-16 Thread Fabian Donoso C.
Hola a todos. Tengo la version (binario, rpm) spamassassin-2.31-16 en RH8 y necesito actualizarla, alguien sabe como se hace?

Re: Gain an extra 25%! (was "Purging the Spamassassin Database")

2006-01-16 Thread Jason Frisvold
On 1/15/06, Andrew Donkin <[EMAIL PROTECTED]> wrote: > If you have fewer than 65,000 accounts you could halve the size of the > id columns. > > That took 25% off my bayes_token data file and index. Since MySQL > likes to keep indexes in VM, I think it was worthwhile. Excellent information, thanks

Re: no bayes check applied in MTA

2006-01-16 Thread Paco Yepes
El vie, 06-01-2006 a las 09:55 +0100, mouss escribió: > Paco Yepes a écrit : > > Hello. > > > > Im using postfix 2.1.5 + spamassassin 3.1.0a-2 + amavisd-new > > 20030616p10-5 > > > > In the last weeks the size of false-negatives (fn) is grow > > significatelly. All this fn do not have scores of t

RE: Running spamassassin in two-pass

2006-01-16 Thread Randal, Phil
You might want to save yourself effort in reinventing the wheel and take a look at MaliScanner 4.50.x which caches spamassassin scores (http://www.mailscanner.info). Cheers, Phil Phil Randal Network Engineer Herefordshire Council Hereford, UK > -Original Message- > From: Nicolas

Running spamassassin in two-pass

2006-01-16 Thread Nicolas Boullis
Hi, I think that some part of spamassassin is highly user-specific (such as BAYES, AWL or UNWANTED_LANGUAGE_BODY). But I receive some e-mails that are sent to hundreds of our users, and I consider it is some waste of CPU-time and bandwidth to run the same tests on the same message hundreds of time

Re: What's up with these URLs?

2006-01-16 Thread M.S. Lucas
From: "Daniel Quinlan" <[EMAIL PROTECTED]> Theo Van Dinter <[EMAIL PROTECTED]> writes: http://gz8l9a2f7cg3/?ra=3DXXDD http://kzbrdz2lj80ym/?ra=3DXXDD Clearly if it's just a hostname w/out domain it's not going to work ... So does this get parsed into something useful by some MUA/browser/etc?

Re: What's up with these URLs?

2006-01-16 Thread Daniel Quinlan
Theo Van Dinter <[EMAIL PROTECTED]> writes: > http://gz8l9a2f7cg3/?ra=3DXXDD > http://kzbrdz2lj80ym/?ra=3DXXDD > > Clearly if it's just a hostname w/out domain it's not going to work > ... So does this get parsed into something useful by some > MUA/browser/etc? Both were found in a spam with a

Bayes - how bad is a small ham corpus with a big spam corpus?

2006-01-16 Thread srunschke
Hi list, I'm currently trying to build up a new bayes DB here, since the autobuilt DB fubared (as expected, no need to throw things at me ;)). It's rather easy to build up the spam part, as we are getting right enough of it, yet it poses a problem to build up the ham part. Much of our mail comin