On Thu, 29 Mar 2007, Marc Perkel wrote: > The question was about a corpus of email. I assume that it means > that the email is from multiple sources.
Correct. Assume for the sake of argument that the distribution of domains being checked somewhat reflects the distribution of ISP sizes - for example, there would be more aol.com and hotmail.com addresses than most other domains. Also, duplicates would be collapsed so caching isn't really beneficial. > So I doubt that someone running it would even be detectable buy > anyone else. Well, yes, but whether or not you get caught does not affect the morality or courtesy of an act... -- John Hardin KA7OHZ http://www.impsec.org/~jhardin/ [EMAIL PROTECTED] FALaholic #11174 pgpk -a [EMAIL PROTECTED] key: 0xB8732E79 -- 2D8C 34F4 6411 F507 136C AF76 D822 E6E6 B873 2E79 ----------------------------------------------------------------------- You are in a maze of twisty little protocols, all written by Microsoft. ---------------------------------------------------------------------- 15 days until Thomas Jefferson's 264th Birthday