On 05.01.10 07:19, cmvhk wrote:
> An email sent to me containing a book review in French was recently falsely
> classified as spam, largely because it failed the LOCAL_BODY_CIALIS rule:
> 
> 2.0 LOCAL_BODY_CIALIS      BODY: Mentions viagra clone 'cialis'
> 
> I quote offending part of the message:
> 
> ... de\s sa sortie en 1978,
> comme un outil de travail de premier plan pour les spe/cialistes de
> langue et d'e/pigraphie e/trusques, mais e/tait devenue avec le temps ....
> 
> e/ is a standard way of transliterating e-acute. Could the rule be rewritten
> so as not to catch instances such as this? (I recall a rule which used to
> object to 'Best wishes, Virginia' because of the proximity of 'best' and
> 'virgin', which was rewritten so as not to match if the string 'virgin' was
> part of 'Virginia'.

another rule depending on languages used.

the fastest workaround should be rule that matches "specialistes" in any form
and meta-rule that gives -2 when LOCAL_BODY_CIALIS and the rule above are
hit.

If french language is detected, this could be also a part of the meta-rule.

-- 
Matus UHLAR - fantomas, uh...@fantomas.sk ; http://www.fantomas.sk/
Warning: I wish NOT to receive e-mail advertising to this address.
Varovanie: na tuto adresu chcem NEDOSTAVAT akukolvek reklamnu postu.
Micro$oft random number generator: 0, 0, 0, 4.33e+67, 0, 0, 0...

Reply via email to