Justin Mason wrote:
> http://spamassassin.apache.org/full/3.1.x/doc/Mail_SpamAssassin_Plugin.html#item_bayes_learn

Thanks!

> by the way, a nice, working plugin that does this would be quite useful

Since it was so straight-forward I made a small plugin that collects the raw 
tokens in a SQL table.

I've only been using it for about an hour, so there may be well be problems 
with it. It ought to work though :-)
I've only tested it with MySQL, but it should work without mods with SQLite as 
well I think, and should be trivial to modify for other SQL servers.

If anyone wants to test it, it's called CollectTokens.pm and is available at 
<http://whatever.frukt.org/spamassassin.text.shtml>. Please tell me when yopu 
find any problems.

What to actually do with the collected data is up to you, but here's two 
example queries:

Top 10 ham tokens:
SELECT bayes_token.ham_count,bayes_rawtoken.rawtoken 
  FROM bayes_rawtoken,bayes_token 
  WHERE bayes_rawtoken.token=bayes_token.token
  ORDER BY bayes_token.ham_count DESC LIMIT 10;

Top 10 spam tokens:
SELECT bayes_token.spam_count,bayes_rawtoken.rawtoken 
  FROM bayes_rawtoken,bayes_token 
  WHERE bayes_rawtoken.token=bayes_token.token
  ORDER BY bayes_token.spam_count DESC LIMIT 10;

Not sure that this is useful for anything at all, but curiosity is part of 
human nature. :-)

Regards
/Jonas

-- 
Jonas Eckerman, FSDB & Fruktträdet
http://whatever.frukt.org/
http://www.fsdb.org/
http://www.frukt.org/

Reply via email to