Now I don't expect SA to know dutch; that would be unfair. But what I
would
like is some way to score those english terms way higher than an american
would or could.  For an american, mortgage does not spell spam per se. But
for ME it does, and I can practically guarantee I will not ever get an
email
that mentions "mortgage" together with "you have been approved" which
won't
be spam.
At the risk of being repetitive, this is precisely the sort of thing bayes
excels at. Give it a shot (hopefully you have some ham'n'spam saved up
already), I think you will be pleased.


Well, none of this is your concern of course. But I would really really

Perhaps it's true that your success is not directly anyone's concern but your own. However, the regulars on this list are basically a buncha SA users who are trying to improve their results and help others do the same along the way.


And herin lies the problem, sure anybody who is willing to spend time tweaking their personal setups, training bayes etc. will have great success at filtering out spam.


Some of us though are system administrators and need a solution to offer to the end users. The typical end user wants to open their email and see no spam, period.

Presently without the tweaks and training all we can do is reduce his spam by about 50 - 60%. Settings have to be left at conservative in order not to get the phone calls complaining about false positives.

The bayes filtering works great, but the typical user is not going to want to jump through what he would consider the huge obstacles to train a corpus. Furthermore implementing bayes on a system that incorporates thousands of users can be a daunting task, and isn't even an available option to some of us.

Therefore when someone asks if there is a method to improve on the basic ruleset we should pay more attention, not just recommend he use bayes.

tm.


really
like if there was a way to have those typical english spam-words score way
higher than they do now.  Could we maybe envision two rulesets, one for
english-speaking residents and one for non-english speaking residents...?
I edited the score file myself but not only is it a hard, long and
error-prone
task, but by editing it I throw away much of the valueable knowhow which
assembled that score-list in the first place.  But I am faced with the
fact
that over 95% of my spam is in english and that I cannot sit back while
the
online pharmacies fly around me, so to speak.
Put yourself in my (our, if i'd be speaking for all non-english countries)
place and ask yourself this question: Would you accept a score of only 0.5
for a rule that says "gratis hypotheekadvies" or "vijf miljoen
emailadressen"
??  No, of course you wouldn't, because you'd know that a company that
pretends to sell you a mortgage from 12000 miles away will never ever be a
genuine offer...



Knowing that there are regulars on this list who's primary language is NOT English, anyone care to share how their setup handles English and non-English spam?








------------------------------------------------------- This SF.Net email sponsored by: ApacheCon 2003, 16-19 November in Las Vegas. Learn firsthand the latest developments in Apache, PHP, Perl, XML, Java, MySQL, WebDAV, and more! http://www.apachecon.com/ _______________________________________________ Spamassassin-talk mailing list [EMAIL PROTECTED] https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to