Re: OT: word frequency analysis

2005-01-17 Thread Thomas Arend
-BEGIN PGP SIGNED MESSAGE- Hash: SHA1 Am Montag, 17. Januar 2005 06:34 schrieb Loren Wilton: > > Probably want to nuke punctuation and capitalization before doing > > the sort. I'm too braindead at the moment, but some perl incantation > > might be the way to go, or if you're old school t

Re: OT: word frequency analysis

2005-01-17 Thread Loren Wilton
> Probably want to nuke punctuation and capitalization before doing > the sort. I'm too braindead at the moment, but some perl incantation > might be the way to go, or if you're old school then awk would probably > work. Yea, that occurred to me. Since I was pasting a spam into a text file anywa

Re: OT: word frequency analysis

2005-01-17 Thread Steve Prior
Probably want to nuke punctuation and capitalization before doing the sort. I'm too braindead at the moment, but some perl incantation might be the way to go, or if you're old school then awk would probably work. Steve Rich Puhek wrote: Loren Wilton wrote: I'm not a unix type, so how to do this is

Re: OT: word frequency analysis

2005-01-17 Thread Rich Puhek
Loren Wilton wrote: I'm not a unix type, so how to do this isn't obvious to me, but it is probably trivial. Given a file with a few paragraphs of words (multiple words per line, obviously) I want to generate a list of the individual words in descending order of occurance frequency. I'd like the fr