On Thu, 21 Jan 2016 12:11:15 +0000 RW <rwmailli...@googlemail.com> wrote:
> "ambulatory care" -> only in ham ... > is that you have discarded the count information. And his assertion is not necessarily true, either. According to our statistics, we've seen "ambulatory care" in 1400 spams, but also in 22 spams. While 1400/1422 still makes the token useful for Bayes, his algorithm would discount it altogether because it's not "pure" ham. Regards, Dianne.