Spamassassin auto-whitelist research

2004-10-21 Thread Daniel Olmedilla
Hi, I sent one mail one week ago asking for collaboration in order to do some research on distributions of ham and spam mail among users on the Web. In order to increase the probability that people trust on what I am saying, I uploaded a page on the web server of my research institution. The p

Re: Spamassassin auto-whitelist research

2004-10-16 Thread Daniel Olmedilla
Hi to all again, please read till the end. There I show some of the results I get from the autowhitelist. Just for you to which kind of information I get from it :-). > >In fact he is right in the fact that the autowhitelist is kept in memory. > >However, it keeps only the e-mail addres and two

Re: Spamassassin auto-whitelist research

2004-10-15 Thread Matt Kettler
At 03:31 PM 10/15/2004, Daniel Olmedilla wrote: In fact he is right in the fact that the autowhitelist is kept in memory. However, it keeps only the e-mail addres and two numbers for each row so assuming for example an average of 50 bytes per e-mail address (quite long) and two numbers, even if you

Re: Spamassassin auto-whitelist research

2004-10-15 Thread Daniel Olmedilla
Thank you Mark for your checks. In fact he is right in the fact that the autowhitelist is kept in memory. However, it keeps only the e-mail addres and two numbers for each row so assuming for example an average of 50 bytes per e-mail address (quite long) and two numbers, even if you had 100.00

Re: Spamassassin auto-whitelist research

2004-10-15 Thread Matt Kettler
At 11:19 AM 10/15/2004, Martin Hepworth wrote: hmm great, perl malware :-) More seriously, anyone checked the code for nasties... Disclaimer: I'm no perl expert, so treat my analysis that of someone with limited experience. I inspected the code and saw nothing terribly suspicious in my limited u

Re: Spamassassin auto-whitelist research

2004-10-15 Thread Daniel Olmedilla
Humm. I didn't expect this :-). I don't know what is "aka" but of course you can check my publications list at http://www.l3s.de/~olmedilla/pub/publications.html As you can see I have been working in e-learning and in information retrieval (with ranking algorithms and personalized web search). N

Re: Spamassassin auto-whitelist research

2004-10-15 Thread Martin Hepworth
hmm great, perl malware :-) More seriously, anyone checked the code for nasties... -- Martin Hepworth Snr Systems Administrator Solid State Logic Tel: +44 (0)1865 842300 Daniel Olmedilla wrote: Dear all, I am a Ph.D. student that works in Hanover (Germany). I am currently studying the distributio

Re: Spamassassin auto-whitelist research

2004-10-15 Thread sahil
Quoting Alex Broens <[EMAIL PROTECTED]>: [Daniel's request] > aka harvesting? His request does seem a bit shady, but I'd google Dan's previous papers before jumping to conclusions. -- Sahil Tandon

Re: Spamassassin auto-whitelist research

2004-10-15 Thread Alex Broens
Daniel Olmedilla wrote: Dear all, I am a Ph.D. student that works in Hanover (Germany). I am currently studying the distribution of spam mails and e-mail addresses. For that I am gathering some information from e-mail distributions of institutions and also individuals. I developed a script that

Spamassassin auto-whitelist research

2004-10-15 Thread Daniel Olmedilla
Dear all, I am a Ph.D. student that works in Hanover (Germany). I am currently studying the distribution of spam mails and e-mail addresses. For that I am gathering some information from e-mail distributions of institutions and also individuals. I developed a script that gathers the information