Hi,

I'm not sure if this is valid ml to report issue. If not, sorry.

Recently I noticed some Japanese emails are detected as spam, and it is
false positive actually. The cause of it seems to be TVD_SPACE_ENCODED
and TVD_SPACE_RATIO_MINFP.

Both of rules uses __TVD_SPACE_RATIO, and it matches Japanese characters
easily, because Japaneses doesn't have spaces as word separator.

Well, so, the question is, TVD_SPACE_RATIO checks !__ISO_2022_JP_DELIM
with __TVD_SPACE_RATIO in rule.  But TVD_SPACE_ENCODED and
TVD_SPACE_RATIO_MINFP uses __TVD_SPACE_RATIO without !__ISO_2022_JP_DELIM. 

Is it intended one?


BTW, if I added "&& !__ISO_2022_JP_DELIM" check to __TVD_SPACE_ENCODED
and TVD_SPACE_RATIO_MINFP rules in .spamassassin/user_prefs, it seems to
prevent the false positive for me.

Is there recommended way to prevent false positive?


meta        __TVD_SPACE_ENCODED    (__TVD_SPACE_RATIO && __SUBJECT_ENCODED_B64 
&& !__SUBJECT_UTF8_B_ENCODED && !__ISO_2022_JP_DELIM)
meta        TVD_SPACE_RATIO_MINFP  __TVD_SPACE_RATIO && 
!__LCL__ENV_AND_HDR_FROM_MATCH && !__SUBSCRIPTION_INFO && !__RCD_RDNS_MAIL && 
!__SUBJECT_ENCODED_QP && !__THREADED && !__TO_EQ_FROM_DOM && 
!__BOTH_INR_AND_REF && !__X_CRON_ENV && !__HAS_THREAD_INDEX && 
!__HDRS_LCASE_KNOWN && !__ISO_2022_JP_DELIM
-- 
OGAWA Hirofumi <hirof...@mail.parknet.co.jp>

Reply via email to