Good point, but will cause trouble UNLESS we find a way to recognize
ham 100%. And it must me exactly 100% (99% won't be enough).
As other users said, with current system, if we can filter 70-80 of the
spam, remaining 20-30% will only be an annoyance, but ham will be delivered.
But with the new approach event if the spam stopped 100%, only 1%
undelivered ham will cause a lot of trouble.
Just my 1 Yen :-)
Dan wrote:
I've developed a new approach to scoring that I want to 1) share with
everyone and 2) make into a working system thats as accurate as what
I've already built, but easier to use. First, the theory:
SITUATION
In the beginning, all email was ham. When spam came along, we left
the ham alone and targeted the annoyance (spam).
ASSUMPTION
All messages are ham unless x,y,z score says they're spam.
APPROACH
Block nothing, then create rules to catch what you don't want. ie,
build tests that target the spam, then score the millions of ways spam
can occur.
RESULT
Huge time spent tuning and retuning weights, catching everything in
sight (including much ham).
NEW SITUATION
Ham is now the tiniest minority of all email.
NEW ASSUMPTION
All messages are spam unless x,y,z score says they're ham.
NEW APPROACH
Block everything, then create rules to not catch what you do want.
ie, build tests that target the spam (keeping all the tests you've
already built), then score the thousands of ways ham triggers on those
tests.
NEW RESULT
Spend less time and energy while catching more of what you do want and
less of what you don't.
CHALLENGE
All filtering software is written to score for results that equal spam
-> catch the bad
SOLUTION
Make filtering software score for results that equal ham -> uncatch
the good.
Your thoughts?
Dan
BTW, is there a better forum for this level of question?