Re: [SAtalk] scoring individual words

2002-06-02 Thread Craig R Hughes
Daniel, look in the wordfreqs/ directory of the distribution. C Daniel Quinlan wrote: DQ> Craig R Hughes writes: DQ> DQ> > Better than a straight dictionary of single words is a dictionary of DQ> > phrases, weighted by their frequency in spam vs nonspam. Hmm, wait, DQ> > that sounds familiar

Re: [SAtalk] scoring individual words

2002-06-01 Thread Tony L. Svanstrom
On Sat, 1 Jun 2002 the voices made Craig R Hughes write: > Tony L. Svanstrom wrote: > > TLS> > TLS> Just a thought... wouldn't it be a good idea to have a "dictionary" with > TLS> common words in spam; each word could have a low score, but it'd add up pretty > TLS> quickly whenever you get a 3 p

Re: [SAtalk] scoring individual words

2002-06-01 Thread Daniel Quinlan
Craig R Hughes writes: > Better than a straight dictionary of single words is a dictionary of > phrases, weighted by their frequency in spam vs nonspam. Hmm, wait, > that sounds familiar somehow... ;) I suppose we ought to turn spam > phrases back on I'll work on that right now, and check

Re: [SAtalk] scoring individual words

2002-06-01 Thread Craig R Hughes
Better than a straight dictionary of single words is a dictionary of phrases, weighted by their frequency in spam vs nonspam. Hmm, wait, that sounds familiar somehow... ;) I suppose we ought to turn spam phrases back on I'll work on that right now, and check it in once working. C Tony L

Re: [SAtalk] scoring individual words

2002-05-29 Thread Matt Sergeant
Tony L. Svanstrom wrote: > Just a thought... wouldn't it be a good idea to have a "dictionary" with > common words in spam; each word could have a low score, but it'd add up pretty > quickly whenever you get a 3 page pornmail... That's how the spam phrases stuff is supposed to work. Matt. __

[SAtalk] scoring individual words

2002-05-29 Thread Tony L. Svanstrom
Just a thought... wouldn't it be a good idea to have a "dictionary" with common words in spam; each word could have a low score, but it'd add up pretty quickly whenever you get a 3 page pornmail... /Tony -- # Per scientiam ad libertatem! // Through knowledge towards freedom! # # Genom