Adam Katz wrote:
On 02/28/2011 08:24 AM, Kris Deugau wrote:
Mail reported by a customer as falsely tagged showed these rule hits.
I've scored these rules down for now.
Checking through the message text showed these likely matches:
FRT_APPROV: approuvé
FRT_EXPERIENCE: Expérience
I'm pretty sure it's the accented 'e' in each word that's the trigger.
I agree. I have fixed those two specific examples on SA trunk at svn
revision 1075489.
Thanks!
Please note that this sort of thing is better handled as a bug request,
and complaints directed at this list tend not to get such prompt
attention. Try filing it in https://issues.apache.org/SpamAssassin/
next time.
Noted. I wasn't sure I was looking at the problem the right way though;
I tried a couple of things to fix the rule but none of them worked.
This does indeed seem likely. Extra eyes from those of us versed in
non-English Latin-character languages would be quite helpful.
This could get you started:
grep -riE '^(raw|body|header.*subject).*\(\?![a-z?]{2,}\)' rules*
Nothing further jumps out from that list for French, but I'm not nearly
as familiar with any other Latin-character languages. And my last
extensive contact with French was almost 20 years ago in high school...
-kgd