Re: FRT_APPROV, FRT_EXPERIENCE FPs on French text

Kris Deugau Mon, 28 Feb 2011 12:55:13 -0800

Adam Katz wrote:

On 02/28/2011 08:24 AM, Kris Deugau wrote:

Mail reported by a customer as falsely tagged showed these rule hits.
I've scored these rules down for now.


Checking through the message text showed these likely matches:

FRT_APPROV:    approuvé

FRT_EXPERIENCE:    Expérience

I'm pretty sure it's the accented 'e' in each word that's the trigger.


I agree.  I have fixed those two specific examples on SA trunk at svn
revision 1075489.


Thanks!

Please note that this sort of thing is better handled as a bug request,
and complaints directed at this list tend not to get such prompt
attention.  Try filing it in https://issues.apache.org/SpamAssassin/
next time.

Noted. I wasn't sure I was looking at the problem the right way though;I tried a couple of things to fix the rule but none of them worked.

This does indeed seem likely.  Extra eyes from those of us versed in
non-English Latin-character languages would be quite helpful.

This could get you started:

grep -riE '^(raw|body|header.*subject).*\(\?![a-z?]{2,}\)' rules*

Nothing further jumps out from that list for French, but I'm not nearlyas familiar with any other Latin-character languages. And my lastextensive contact with French was almost 20 years ago in high school...


-kgd

Re: FRT_APPROV, FRT_EXPERIENCE FPs on French text

Reply via email to