> During testing, I can see spamassassin create a "bayes_journal" file and > write to it continuously. I understand this is spamassassin's way of
If the journal is only growing it isn't being learned from. Typically at some point if auto-learn were enabled one of the spam mail runs would take some extra time and update the bayes store. Since you have auto-learn disabled, this isn't happening. Typically in a real-world situation, the admin would have set up a cron job to run sa-learn and do the update. I would strongly suggest in your paper when comparing your Bayes technique to SA's, that you EMPHASIZE that your comparison was ONLY on Bayes ability, and not the overall ability of the product. A very large part of SA is the ability to classify spam based on rules, and quite a lot of that benefit can come from running net tests. So if your thing is only a Bayes tool and it is twice as good as SA's Bayes module, it would be somewhat disingenious to claim "our spam tool is twice as good as SpamAssassin", rather than "our tool is twice as good as the Bayes module of SpamAssassin". Loren