There is the Berkeley Web Term Frequency database which contains over 30 million unique terms extracted from 50 million webpages.
http://elib.cs.berkeley.edu/docfreq/index.html On 8/31/06, Jason Pump <[EMAIL PROTECTED]> wrote:
Is there a large list of words and their frequency in the english language? Obviously it would differ by corpus but I would like to see what's already available. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]
-- Dave Kor, PhD Candidate Center for Information Mining and Extraction School of Computing National University of Singapore. --------------------------------------------------------------------- To unsubscribe, e-mail: [EMAIL PROTECTED] For additional commands, e-mail: [EMAIL PROTECTED]