Re: A New Approach: Find the Ham

Burak Ueda Sat, 10 Feb 2007 18:57:06 -0800

Good point, but will cause trouble UNLESS we find a way to recognizeham 100%. And it must me exactly 100% (99% won't be enough).As other users said, with current system, if we can filter 70-80 of thespam, remaining 20-30% will only be an annoyance, but ham will be delivered.

But with the new approach event if the spam stopped 100%, only 1%undelivered ham will cause a lot of trouble.


Just my 1 Yen  :-)




Dan wrote:

I've developed a new approach to scoring that I want to 1) share witheveryone and 2) make into a working system thats as accurate as whatI've already built, but easier to use. First, the theory:
SITUATION
In the beginning, all email was ham. When spam came along, we leftthe ham alone and targeted the annoyance (spam).
ASSUMPTION
All messages are ham unless x,y,z score says they're spam.

APPROACH
Block nothing, then create rules to catch what you don't want. ie,build tests that target the spam, then score the millions of ways spamcan occur.
RESULT
Huge time spent tuning and retuning weights, catching everything insight (including much ham).
NEW SITUATION
Ham is now the tiniest minority of all email.

NEW ASSUMPTION
All messages are spam unless x,y,z score says they're ham.

NEW APPROACH
Block everything, then create rules to not catch what you do want.ie, build tests that target the spam (keeping all the tests you'vealready built), then score the thousands of ways ham triggers on thosetests.
NEW RESULT
Spend less time and energy while catching more of what you do want andless of what you don't.
CHALLENGE
All filtering software is written to score for results that equal spam-> catch the bad
SOLUTION
Make filtering software score for results that equal ham -> uncatchthe good.
Your thoughts?

Dan


BTW, is there a better forum for this level of question?

Re: A New Approach: Find the Ham

Reply via email to