Re: training SpamAssassin without updating bayes*

Loren Wilton Sat, 04 Mar 2006 19:31:36 -0800

> During testing, I can see spamassassin create a "bayes_journal" file and
> write to it continuously. I understand this is spamassassin's way of


If the journal is only growing it isn't being learned from.

Typically at some point if auto-learn were enabled one of the spam mail runs
would take some extra time and update the bayes store.  Since you have
auto-learn disabled, this isn't happening.  Typically in a real-world
situation, the admin would have set up a cron job to run sa-learn and do the
update.


I would strongly suggest in your paper when comparing your Bayes technique
to SA's, that you EMPHASIZE that your comparison was ONLY on Bayes ability,
and not the overall ability of the product.  A very large part of SA is the
ability to classify spam based on rules, and quite a lot of that benefit can
come from running net tests.

So if your thing is only a Bayes tool and it is twice as good as SA's Bayes
module, it would be somewhat disingenious to claim "our spam tool is twice
as good as SpamAssassin", rather than "our tool is twice as good as the
Bayes module of SpamAssassin".

        Loren

Re: training SpamAssassin without updating bayes*

Reply via email to