There is the Berkeley Web Term Frequency database which contains over
30 million unique terms extracted from 50 million webpages.

http://elib.cs.berkeley.edu/docfreq/index.html

On 8/31/06, Jason Pump <[EMAIL PROTECTED]> wrote:
Is there a large list of words and their frequency in the english
language? Obviously it would differ by corpus but I would like to see
what's already available.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]



--
Dave Kor, PhD Candidate
Center for Information Mining and Extraction
School of Computing
National University of Singapore.

---------------------------------------------------------------------
To unsubscribe, e-mail: [EMAIL PROTECTED]
For additional commands, e-mail: [EMAIL PROTECTED]

Reply via email to