Daniel,
look in the wordfreqs/ directory of the distribution.
C
Daniel Quinlan wrote:
DQ> Craig R Hughes writes:
DQ>
DQ> > Better than a straight dictionary of single words is a dictionary of
DQ> > phrases, weighted by their frequency in spam vs nonspam. Hmm, wait,
DQ> > that sounds familiar
On Sat, 1 Jun 2002 the voices made Craig R Hughes write:
> Tony L. Svanstrom wrote:
>
> TLS>
> TLS> Just a thought... wouldn't it be a good idea to have a "dictionary" with
> TLS> common words in spam; each word could have a low score, but it'd add up pretty
> TLS> quickly whenever you get a 3 p
Craig R Hughes writes:
> Better than a straight dictionary of single words is a dictionary of
> phrases, weighted by their frequency in spam vs nonspam. Hmm, wait,
> that sounds familiar somehow... ;) I suppose we ought to turn spam
> phrases back on I'll work on that right now, and check
Better than a straight dictionary of single words is a dictionary of phrases,
weighted by their frequency in spam vs nonspam. Hmm, wait, that sounds familiar
somehow... ;) I suppose we ought to turn spam phrases back on I'll work
on that right now, and check it in once working.
C
Tony L
Tony L. Svanstrom wrote:
> Just a thought... wouldn't it be a good idea to have a "dictionary" with
> common words in spam; each word could have a low score, but it'd add up pretty
> quickly whenever you get a 3 page pornmail...
That's how the spam phrases stuff is supposed to work.
Matt.
__
Just a thought... wouldn't it be a good idea to have a "dictionary" with
common words in spam; each word could have a low score, but it'd add up pretty
quickly whenever you get a 3 page pornmail...
/Tony
--
# Per scientiam ad libertatem! // Through knowledge towards freedom! #
# Genom