Hey folks,
I hacked together a quick test for words with weird characters in them, e.g.


b;uy m`ore ...

Add to EvalTests.pm

sub weird_words {
    my ($self, $fulltext) = @_;
    my $count = 0;
    # Get the text
    $fulltext = $self->get_decoded_body_text_array();
    foreach my $line ( @{$fulltext} ) {
        my @words = split(' ',$line);
        foreach my $word ( @words ) {
            if ( $word =~ /([a-z,A-Z]+)[EMAIL PROTECTED]:;|\\,.]([a-z,A-Z]+)/ ) {
                                        $count++;
            }   
        }
   }
   return $count;
}

and

body WEIRD_WORDS                eval:weird_words()
describe WEIRD_WORDS    Count words with weird characters in them

the appropriate .cf file.

I appologize if this duplicates effort that others have done, I didn't see it anywhere. Also this is obviously a rough cut, a better version would look up the word (minus characters) in a bad-word dictionary...

--brendan



-------------------------------------------------------
This SF.net email is sponsored by: IBM Linux Tutorials.
Become an expert in LINUX or just sharpen your skills.  Sign up for IBM's
Free Linux Tutorials.  Learn everything from the bash shell to sys admin.
Click now! http://ads.osdn.com/?ad_id=1278&alloc_id=3371&op=click
_______________________________________________
Spamassassin-talk mailing list
[EMAIL PROTECTED]
https://lists.sourceforge.net/lists/listinfo/spamassassin-talk

Reply via email to