On 23/11/11 16:21, Martin Gregorie wrote:
> On Wed, 2011-11-23 at 15:13 +0100, Simon Loewenthal wrote:
>> I have spam that hits on these rules.
>>
>> X-Spam-Report:
>>     *  1.7 URIBL_BLACK Contains an URL listed in the URIBL blacklist
>>     *      [URIs: europjobs.eu]
>>     *  1.2 URIBL_JP_SURBL Contains an URL listed in the JP SURBL blocklist
>>     *      [URIs: europjobs.eu]
>>     *  0.0 UNPARSEABLE_RELAY Informational: message has unparseable
>> relay lines
>>     *  0.8 BAYES_50 BODY: Bayes spam probability is 40 to 60%
>>     *      [score: 0.5000]
>>     *  1.1 DCC_CHECK Listed in DCC (http://rhyolite.com/anti-spam/dcc/)
>>     *  1.4 PYZOR_CHECK Listed in Pyzor (http://pyzor.sf.net/)
>>     *  0.3 DIGEST_MULTIPLE Message hits more than one network digest check
>>     *  0.8 RDNS_NONE Delivered to internal network by a host with no rDNS
>>
>> What I fail to understand is why it did not hit on this local.cf rule:
>>
>> describe RBODY_JOB_DOMAINS1 English language job opportunity1
>> rawbody RBODY_JOB_DOMAINS1
>> /\@(?:axeabout|career-lists|careers-consult|eur-exlusive|europe-career|europ-exlusive|it-jobsearch\.com|uk-exlusive|tech-newposition|new-joboffers|joblists|web-newcarer|world-jobsearch|gb-totaljob|simple-jobneed|sprytex-it|europjobs.eu|businesinsiders.com)\./
>> score    RBODY_JOB_DOMAINS1 4.5
>>
>> ( I tried the same by replacing |europjobs.eu| with |europjobs\.eu| in
>> case it helped, but made no difference)
>>
> What Axb said. I'd just add that your rule description appears to be
> misleading in that it seems to be a list of partial domain names rather
> than any specifically English words or phrases and that you'll get fewer
> FPs and, probably, a better hit rate if you use a meta to combine
> generic job offer phrases with something else, along the lines of:
>
> describe JOB_OFFERS Phrases typical of English language job offers
> body     JOB_OFFERS /(my client|(contract|permanent) jobs))/i
> score    JOB_OFFERS 0.01
>
> describe UNWANTED_JOB_OFFERS Jobs at blacklisted sites
> meta     UNWANTED_JOB_OFFERS JOB_OFFERS && (URIBL_BLACK ||
> URIBL_JP_SURBL)
> score    UNWANTED_JOB_OFFERS 4.5
>
> because your rule is in effect a private blacklist that duplicates what
> the URIBLs are already doing. Of course my JOB_OFFERS rule is merely an
> example. In Real Life (tm) it would be a set of rather more elaborate
> rules that you've built to recognise your particular jobspam stream.
>
>
> Martin
>
>
Oh, this is a far better idea, and it uses the results of an already
existing rule.  Thank-you. I shall work on something like this.
Cheers.

-- 
        Email  simon AT klunky DOT co DOT uk   
        PGP is optional: 4BA78604
        I won't accept your confidentiality
        agreement, and your Emails are kept.
                       ~Ö¿Ö~

Reply via email to