On 7/7/2023 11:04 AM, Richard wrote:
For example, here I run it against a file containing just over 2100 spam:

In the end, I ran it on about four dozen files of ham and about 6 or so files of spam emails, carefully curated. In all these files, I NEVER saw it say it examined more than 1 message and EVERY time it said it examined a message it also said it "learned" 1 token.


I believe the default format is Maildir.  You  mention a single file w/ multiple emails which suggests you might be running MBox format? If so, try the --mbox command line switch.

-- Jared Hall

Reply via email to