On Tue, Jan 14, 2003 at 03:46:02PM -0500, Robert J. Accettura wrote: > 1. Just posed a bunch of rule ideas. Should keep you busy for a while > ;). Deductions mostly at this point. But I plan to work on some good > spam detection ideas later tonight.
If you can do me a favor -- go back to the bug after posting and set the milestone to be 2.6? Saves me from having to do it. :) > But why waste the extra 30 bytes, when most people (especially if you Why waste the extra 30 bytes??? Why waste the extra 4k? I had to specially write procmail rules to take html only mails and convert them into text. 1) my MUA is text based and I don't want to spawn a browser to read the message, 2) why do you feel like you should suck up my bandwidth and disk space because you don't want to send text? Messages that are just text should remain just text. I don't need to see "Hi Theo!" in bold purple font, just send me an URL to pictures of your baby I don't want them included in your mail, and I'm smart enough to not need the HTML anchor tag around the URL. This is getting off into another discussion. I personally hate HTML email, but that doesn't change the fact that at the moment it's a great spam sign. As/if that changes, the rule score will go down, the rule will get improved, or it will die. That's just how the life cycle goes. > know they can recieve HTML mail). Most clients automatically reply in > the format they recieve in. Most users don't even realize the > difference between formats. agreed, but that's another issue. > capabilities as well. As HTML becomes more common for use, plain text > will disappear, since there is no need for it. It doesnt't take to much Except that text is the standard. HTML is in addition. :) > Good, then you'll get a lot of them as I analyze more mail. As I elluded to -- just because it works for you doesn't mean it works well for everyone. FYI: Your suggestions are likely to be looked at a lot faster if you suggest rules/code to implement them. The way we generate our rules and scores is communal: people run the latest code over their corpus, we look at the results. If the rule, for instance, has too many FPs, it gets fixed or trashed. As I said before, 59% of my spam and ~0% of nonspam is HTML only for me. So it would get a strong positive score. But being communal, if you submit results that are reversed from mine, the score would go down. > Very true, but how many heed that advice. And how many check their Well, to be honest, that's not my problem. ;) In the true UNIX philosophy, we give you enough rope to hang yourself. If you want to delete the messages automatically or put them in some folder you never look at, that's your decision. Don't blame us if you start wondering why you never got that stock trade confirmation or the letter from Grammy. -- Randomly Generated Tagline: "The difference between war and sex is that sex is a lot more fun ... I don't know if you've had sex, but it's really fantastic!" - Jake Johannsen, Politically Incorrect 8/10/2001
msg12176/pgp00000.pgp
Description: PGP signature