On 23/11/11 16:21, Martin Gregorie wrote:
> On Wed, 2011-11-23 at 15:13 +0100, Simon Loewenthal wrote:
>> I have spam that hits on these rules.
>>
>> X-Spam-Report:
>> * 1.7 URIBL_BLACK Contains an URL listed in the URIBL blacklist
>> * [URIs: europjobs.eu]
>> * 1.2 URIBL_JP_SURBL Contains an URL listed in the JP SURBL blocklist
>> * [URIs: europjobs.eu]
>> * 0.0 UNPARSEABLE_RELAY Informational: message has unparseable
>> relay lines
>> * 0.8 BAYES_50 BODY: Bayes spam probability is 40 to 60%
>> * [score: 0.5000]
>> * 1.1 DCC_CHECK Listed in DCC (http://rhyolite.com/anti-spam/dcc/)
>> * 1.4 PYZOR_CHECK Listed in Pyzor (http://pyzor.sf.net/)
>> * 0.3 DIGEST_MULTIPLE Message hits more than one network digest check
>> * 0.8 RDNS_NONE Delivered to internal network by a host with no rDNS
>>
>> What I fail to understand is why it did not hit on this local.cf rule:
>>
>> describe RBODY_JOB_DOMAINS1 English language job opportunity1
>> rawbody RBODY_JOB_DOMAINS1
>> /\@(?:axeabout|career-lists|careers-consult|eur-exlusive|europe-career|europ-exlusive|it-jobsearch\.com|uk-exlusive|tech-newposition|new-joboffers|joblists|web-newcarer|world-jobsearch|gb-totaljob|simple-jobneed|sprytex-it|europjobs.eu|businesinsiders.com)\./
>> score RBODY_JOB_DOMAINS1 4.5
>>
>> ( I tried the same by replacing |europjobs.eu| with |europjobs\.eu| in
>> case it helped, but made no difference)
>>
> What Axb said. I'd just add that your rule description appears to be
> misleading in that it seems to be a list of partial domain names rather
> than any specifically English words or phrases and that you'll get fewer
> FPs and, probably, a better hit rate if you use a meta to combine
> generic job offer phrases with something else, along the lines of:
>
> describe JOB_OFFERS Phrases typical of English language job offers
> body JOB_OFFERS /(my client|(contract|permanent) jobs))/i
> score JOB_OFFERS 0.01
>
> describe UNWANTED_JOB_OFFERS Jobs at blacklisted sites
> meta UNWANTED_JOB_OFFERS JOB_OFFERS && (URIBL_BLACK ||
> URIBL_JP_SURBL)
> score UNWANTED_JOB_OFFERS 4.5
>
> because your rule is in effect a private blacklist that duplicates what
> the URIBLs are already doing. Of course my JOB_OFFERS rule is merely an
> example. In Real Life (tm) it would be a set of rather more elaborate
> rules that you've built to recognise your particular jobspam stream.
>
>
> Martin
>
>
Oh, this is a far better idea, and it uses the results of an already
existing rule. Thank-you. I shall work on something like this.
Cheers.
--
Email simon AT klunky DOT co DOT uk
PGP is optional: 4BA78604
I won't accept your confidentiality
agreement, and your Emails are kept.
~Ö¿Ö~