I will update the script so that it can handle both maxweight and an optional exclude file.

I don't think that a variable for setting the weight of rows/entries different from the maxweight is necessary, since the purpose is to stop processing at a match, so the maxweight option would set the row/entry weight to the same value as the maxweight.

Scott, what will happen if all entries in a filter file have the weight 0 and I use MAXWEIGHT 0? Will the processing stop immediately or at the first match?

/Roger


As for the maxweight, perhaps someone can do it better, but this works for me:

In the variables ection add:
set V_Maxweight=20

In the code after if not "%v_weight%"=="" echo SKIPIFWEIGHT %v_weight%>> surbl.filter.tmp
add:
if not "%v_maxweight%"=="" echo MAXWEIGHT %v_maxweight%>> surbl.filter.tmp


Scott Fisher
Director of IT
Farm Progress Companies

[EMAIL PROTECTED] 04/16/04 12:25AM >>>
Roger,

Thanks for the fine work.  I finally got around to setting this up after
figuring out that it wasn't thousands of URL's long and my server seems
to be handling it well enough for now.

I have two suggestions for the script.

1) Add a MAXWEIGHT variable.  If you score each line at say 4 points and
set the MAXWEIGHT to 4, then the filter will stop processing on the
first hit and save resources.  I tried playing around with this to get
it to work, but I'm totally clueless when it comes to batch file
programming and I think I was hitting some sort of a reserved word.

2) Add the ability to remove listings contained in a text file (an
exclude list).  From looking over the current list of domains, there are
the following:
        - norton.com
        - webhosting.yahoo.com

These were probably in spam, but they are not unique to spam.  I also
found an entry for "pe.kg" in the list which doesn't resolve and seems
like it was probably from a parsing error.  A list of top sites from a
page linked to from the project's site shows that yahoo.com is one of
the most frequently spamvertised domains, though this is clearly not
listed in this file due to an exception on their end.

This type of test is definitely very vulnerable to pollution and it
would be great to have a way to detect such problems and add them to a
list for exclusion.

Long-term this is best suited for a DNS lookup due to various
limitations of doing a contains filter, but for now, it seems to be
working very well at adding points to things that are coming in below my
drop weight, in fact it might very well be tagging the majority of what
is scoring in my Hold level and pushing it over the top.

Matt

---
[This E-mail was scanned for viruses by Declude Virus (http://www.declude.com)]

---
This E-mail came from the Declude.JunkMail mailing list.  To
unsubscribe, just send an E-mail to [EMAIL PROTECTED], and
type "unsubscribe Declude.JunkMail".  The archives can be found
at http://www.mail-archive.com.

Reply via email to