On Wed, 21 Mar 2012 13:44:49 +0100 Matus UHLAR - fantomas <uh...@fantomas.sk> wrote:
> Mangling data by exchange is a big. problem when trying to filter > spam in front of it. I see two ways to avoid this problem: > - use spam server for exchange. We use one from GFI, with quite good > results. > - you can use spam filter in front of exchange, store copies on it > and learn from them. However, you will probably be the only one who > can train spamfilter in such case. Actually, there's a third way and it's what we do (but difficult to set up with pure SpamAssassin.) We tokenize inbound messages and store the tokens on the server. In each message, we add links for doing training. When you click on a training link, the system trains the message based on the tokens stored on the server. In that way, you are training using exactly the tokens that the Bayes code saw. Regards, David.