A plan for HAM - White list for ham domains

Marc Perkel Tue, 03 Jul 2007 07:30:30 -0700

A little play on words spoofing "A plan for spam".

I have been testing a new technique for detecting ham that is workingquite well. It's nearly (or possibly at) 100% accurate in that what itidentifies is ham.

First of all you get a verified RDNS lookup on the host. Verified meansthat you do a reverse lookup and then look up the host name to see if itresolves to the same IP that you looked up. That's something spammerscan't spoof. Then you separate the name at the registrar barrier andlook up that name from a list of host domains that never send spam. Forexample, all hosts that end in apache.org are considered spam.

This idea is different that an IP based whitelist in that you are reallywhitelisting based on a list of blessed host names rather than justunnamed IP addresses.

Also - a dynamic whitelist could be generated in the fly if someonecould write a custom DNS server. Here's how it would work. You send arequest about an IP address. If the server doesn't already know the IPthen it does a reverse DNS to get the name and them looks up the name toverify the name resolves to the same IP address. If it does you thenbreak the name at the registrar barrier and do a lookup to see if thename is on the blessed list. If it is you return a cude indicating it iswhitelisted and you cache the IP of the lookup.

The master list of blessed host names could be dynamically generated bysome sort of automated reputation system where ham and spam are reportedby IP address from some trusted sources. Those domains that areconsistently producing nothing but ham make the list.

The advantage of this is increased accuracy and lower system load.Domains that are whitelisted need not be further tested and can beinstantly classified as ham and fed into the bayes learner. This shouldgreatly reduce false positives.


Who likes this idea?

A plan for HAM - White list for ham domains

Reply via email to