On 08/17/16 03:43, Matus UHLAR - fantomas wrote:
On 16.08.16 20:06, Marc Perkel wrote:
What I'm doing is looking for fingerprints in email that intersect HAM and not in SPAM - which would be a HAM result.
If it matches SPAM and does NOT match HAM - then it's SPAM.

The magic is in the NOT matching on the other side.

so, if mail matches both hammy and spammy tokens (or token sets), you don't
classify at all?


On that fingerprint is it matches both it creates no score on that item. The idea is to generate a lot of fingerprints so that something scores. If you look at enough stuff to generate hundreds of fingerprints and you have big reference corpi then you will usually get a result on something. Usually a big result in one direction.

But ignoring if it's in both makes it more immune to poisoning.

--
Marc Perkel - Sales/Support
supp...@junkemailfilter.com
http://www.junkemailfilter.com
Junk Email Filter dot com
415-992-3400

Reply via email to