Hi,
If these are hit rates with a very minimal daily corpus, don't know if the
present ruleset is ready for production unless you have 0 tolerance for any
bulk, period
I'm afraid I must agree. I don't have a confirmed and sorted corpus per se,
but after a single night's live testing with very low scores I can confirm
that, as I suspected, many of these rules hit genuine opt-in newsletters and
even things like ebay notifications in French.
Thanks for the feedback. I do not have any ebay subscriptors in my users,
except one power-seller who has ebay thingies in whitelist.
I will however keep the ruleset live for a while, to see whether the
online meds and onling gambling rules actually hit anything.
They should, they do on my machines. But actually, they are only useful
for a "new" spam that has not been caught yet by RBL. When I wrote them,
it was because spam *was* getting through, now they just push towards
"almost-probably-spam". Another note is that much of this particular spam
is auto and badly translated (much "pidgin-French" if I can say so).
My personal tolerance for bulk mail is pretty low, and in a way I'd love to
use rules like these, with just a bit of fine tuning - the rules do also hit
a fair bit of French spam. But unfortunately my users actually want to
receive their newsletters and even complain if it ends up in their spam
folder.
I think I have a newbye simple problem of philosophy/strategy: my
approach, for what it's worth, was that I flag anything that contains some
unsubscribe links and French law reminders because anyway all the ones I
receive are spam, and I add the opt-in mailing/newsletter I receive to
whitelist_from in user_prefs, i.e. I kill everything except those
explicitly allowed.
If that is not the correct approach, I can garantee you the current way
the rules are written is bad (too harsh), and I need strategy advice on
how to manage opt-in lists.
John