I'd like to score anything in Windows-1251 fairly high, as I don't expect to get anything legitimate in that charset. How can I read the charset declared in a Subject header, or in a MIME part, for matching in a rule?

The only tools I see are ok_locales and CHARSET_FARAWAY, but those seem like heavy hammers as they blacklist everything and then require me to whitelist what I want. I'd rather the reverse: let me list which codepages to reject.

I tried this rule but it's not firing and I'm not sure why:

describe KP_CYRILLIC Cyrillic code page
header   KP_CYRILLIC Subject =~ /Windows-1251/
score    KP_CYRILLIC 0.1

Reply via email to