On Wed, Jul 26, 2006 at 07:43:51AM -0700, John Rudd wrote: > When that score is developed, how is it decided that the scores have > settled? When a "95% of the spam in the corpus got ranked 5 or > higher"? 80%? 100%? That's the comparison I'm looking for.
It's a learning system, so it's done when it runs out of results to learn from. ;) I have some bits on the perceptron in my presentation from AC 2004: http://people.apache.org/~felicity/AC2004/ check out page 29. Looking at the STATISTICS* files in the rules directory may be useful too, btw. -- Randomly Generated Tagline: "I hate going to the dentist. Everytime I go my tongue gets depressed." - Home Movies, "Therapy"
pgpGMw4gejeJX.pgp
Description: PGP signature