--On Sunday, September 21, 2008 18:39 -0600 Bob Proulx <[EMAIL PROTECTED]> wrote:

OVERALL    SPAM%     HAM%     S/O    RANK   SCORE  NAME
  1.116   1.5957   0.2705    0.855   0.51    2.08  SUBJ_ALL_CAPS

Am I reading that correctly to see that in spam all caps showed up in
1.60% of the regression corpus and only in 0.27% of the non-spam?
Gosh that seems like a very small indicator.


No, it's high.  Only 1.87% had all caps subject, but of those 85%
were spam: 1.60 / 1.87.

If I am reading correctly.

Joseph Brennan
Columbia University Information Technology


Reply via email to