Re: [CentOS] Unicode related query

2010-02-02 Thread Rajagopal Swaminathan
Greetings, On Wed, Feb 3, 2010 at 11:03 AM, Joseph L. Casale wrote: > > You don't say much as to what bounds the words, spaces? Give more info, but > http://www.regular-expressions.info/unicode.html leads to some Perl solutions. Thanks for the quick reply. I have started perusing it. Perl is c

Re: [CentOS] Unicode related query

2010-02-02 Thread Joseph L. Casale
>I am able to get a english word list in by using the following command > >cat | tr -sc A-Za-z '\012' > >My question is how to specify unicode character and ASCII. >Specifically text text file containing 3 byte sequence starting with >\x0e in the tr command. > >I am able to see the character usin