Dear mail server admins :-)

I am trying to analyse the size distribution of emails, and whether it
is Zipf-distributed or if there is a better approximation. 
I have my own mails obviously, and found a old analysis at [0], but I
would like to have a up-to-date and somewhat representative sample.

For this, I would like to ask for your assistence: Could you, if you
have access to a large number of mail boxes, obtain the file sizes of
each mail, and email me that list? 

E.g. if every mail is a file in the folder inbox/
 $ find inbox/ -type f -printf '%s\n'

If you'd prefer to use a backup tar, you can do something like
 $ tar -tvzf mails.tar.gz |awk '{ print $3;}'

Any compressed format would be fine. The more files, the better ;-)
Spam-free would also be preferred. Please also let me know if
your user base might be a slightly biased sample.

If you wish, I can let you know my findings. It would be a help me a
lot.

Best regards,
Johannes
(Student at the TU Vienna, Austria)

[0] http://osdir.com/ml/freebsd.devel.net/2002-10/msg00203.html
-- 
Johannes Buchner
mail: e0625...@student.tuwien.ac.at
xmpp: buchner.johan...@amessage.at
icq:  163390666
skype:johannes_buchner
Ich freue mich über PGP/GPG verschlüsselte/signierte Mails!

Attachment: pgpDSsb2dgZ7G.pgp
Description: PGP signature

Reply via email to