Re: Experimental Plugin: MetaSVM

Marc Perkel Sun, 15 Mar 2009 07:59:07 -0700


decoder wrote:

LuKreme wrote:
This is an excellent idea, but it also needs rule hits on ham, right?
You're right if you're saying that the method would work better ifthere were more ham rules. From what I have seen in my experimentshowever, the results are also very precise with the current SAruleset. But any rule that adds some information to the feature setmight yet increase the performance (especially the performance onunrecognized spam, on ham/spam which is detected by SA as well, thealgorithm performs nearly as good as SA itself).

What I'm thinking, once this gets working, is to write what I'll call"informational rules". These rules would by themselves be 0 point rulesand might at best be only slight indicators of spam vs. ham, but whencombined with other rules would enhance the ability to form accuratemetarules. And perhaps tokens can come from other things that justrules. Like the countries the message has passed through. Or individualword rules that we stopped using a long time ago. Marketing phrases.

I remember when Bayes first came out that we discovered that RED textwas a stronger indicator of spam than words like viagra. I'm hopefulthat this is going to give us a breakthrough like that where we findthat interesting combinations change the way we see spam filtering.


I'm looking forward to seeing what comes of this.

Re: Experimental Plugin: MetaSVM

Reply via email to