I just yesterday
installed SAProxy for use with Outlook Express POP3 account. It works, but when
I try to teach it my old spam collection, it does not learn from it! I use the
--dir option, and save e-mails in .eml format from OE. Sometimes it learns
a couple of messages from one hundred. I also got some spam that SA didn't
detect, and saved those messages into Spam folder, run sa-learn, and again it
said 0 messages.
The only explanation
I can think is that perhaps they all have the same message id? Then sa-learn
should have an option to ignore the message id, and just add it. It does not
matter to add the same spam message twice, either, because then it just gets
extra weight?
Also shouldn't SA
detect that messages have the same id, and mark the message as spam? It could
have a dynamic database of the latest one million messages that I got, for
example.
I don't know if
sa-learn fails because of duplicate message ids, though. Any other suggestions
are wellcome. I also tried -D switch once but it didn't say anything about
the ignored messages.
How does the
--forget option work? If I use it with all spam, do they get learned as separate
messages or does it always remove the old data from database
first?
--
Harri