On Sun, Oct 06, 2002 at 10:59:08AM +0200, martin f krafft wrote: > also sprach Tom Marshall <[EMAIL PROTECTED]> [2002.10.06.0608 +0200]: > > I've written a Bayesian e-mail filter, > > http://sourceforge.net/projects/bmf/ From the project page: > > > > bmf is a self contained and extremely efficient Bayesian mail > > filter. See Paul Graham's article "A Plan for Spam" for background > > information. It aims to be faster, smaller, and more versatile > > than similar applications. > > bogofilter already implements this. > > > I would like to become a Debian developer to maintain my own package > > (and a few others) and find a sponsor. Toward that end, I am also > > in the process of contacting a current Debian developer to sign my > > GPG key but he has not yet responded. If anyone reading this > > message is in or near Seattle, WA and would like to sign my key, > > please let me know. > > I'll sponsor it if you can tell me why it's better than bogofilter. > Where are the Debian sources?
- It is small. bogofilter is over 30 times its size. spamprobe is over 7 times its size. - It is versatile. It supports text files (compatible with bogofilter 0.6), libdb v1 to v4 (compatible with bogofilter 0.7.x), and mysql. bogofilter and spamprobe support only recent versions of libdb. - It is efficient. The incoming text is not copied to form a data structure. Sorted vectors are used to store data. The number of calls to memory allocation functions is orders of magnitude less than bogofilter. - It supports scoring and updating the word lists in a single invocation. This was finally added less than two days ago in bogofilter cvs. - It includes a utility to convert between supported formats. bogofilter and spamprobe have no facility to import/export lists. - The parser is handcrafted and is easily made to recognize the unique format of an email message (multiline headers, case insensitivity in header names, etc.) This would be difficult in a lex grammar. - It does not rely on external data structure libraries. The current release of bogofilter uses libJudy, which is a pain to download and compile. Yes, I realize that CVS and the current .deb have removed libJudy. - It is highly portable. It's written in C and compiles cleanly with no compiler warnings on several architectures. - Its author promises not to break backward compatibility without providing a clean upgrade path. My initial shot at a Debian package is available on sourceforge.net along with the tarball and rpm package. I freely admit that the current version is hacked up from looking at existing packages. I'm here because I want to learn the Right Way to do it. -- Majority, n.: That quality that distinguishes a crime from a law.
pgp1dB4FnmeIY.pgp
Description: PGP signature