Re: Grep with non-ascii

2023-02-09 Thread Tomoaki AOKI
On Thu, 09 Feb 2023 03:24:14 +0100 "Julian H. Stacey" wrote: > > The one positive development in the world of computing that I would > > credit to Java is the earliest big push toward the adoption of UTF-8. > > I strongly hope UTF-8 becomes universally used sooner rather than > > later.

Re: Grep with non-ascii

2023-02-08 Thread Julian H. Stacey
> The one positive development in the world of computing that I would > credit to Java is the earliest big push toward the adoption of UTF-8. > I strongly hope UTF-8 becomes universally used sooner rather than > later. -- George No idea What migh

Re: Grep with non-ascii

2023-02-04 Thread George Mitchell
On 2/3/23 23:16, Tomoaki AOKI wrote: [...] And FreeBSD already has UTF-8. ;-) Drawbacks of UTF-8 are... *Han unification. Not exactly same but lookalike characters in Japanese, Chinese and Korean are fatally missingly unified. *Lack of proper support for variant forms of characters.

Re: Grep with non-ascii

2023-02-04 Thread Eivind Nicolay Evensen
Den Sat, 4 Feb 2023 08:41:17 +0700 skrev Eugene Grosbein : > 03.02.2023 21:18, Eivind Nicolay Evensen wrote: > > > Den Fri, 3 Feb 2023 19:12:32 +0700 > > skrev Eugene Grosbein : > > > >> 03.02.2023 17:06, Eivind Nicolay Evensen wrote: > >>> Hello. > >>> > >>> I just noticed this today: > >>>

Re: Grep with non-ascii

2023-02-03 Thread Tomoaki AOKI
On Fri, 3 Feb 2023 12:36:47 -0500 George Mitchell wrote: > On 2/3/23 11:06, Tomoaki AOKI wrote: > > [...] > > If this is the case like above, the only solution is to move to > > character set containing ALL characters all over the world. > > > > AFAIK, the only candidates are only two, TRON code

Re: Grep with non-ascii

2023-02-03 Thread Tomoaki AOKI
On Fri, 3 Feb 2023 17:31:55 +0100 Eivind Nicolay Evensen wrote: > Den Sat, 4 Feb 2023 01:06:05 +0900 > skrev Tomoaki AOKI : > > > On Fri, 3 Feb 2023 15:18:53 +0100 > > Eivind Nicolay Evensen wrote: > > > > > Den Fri, 3 Feb 2023 19:12:32 +0700 > > > skrev Eugene Grosbein : > > > > > > > 03.0

Re: Grep with non-ascii

2023-02-03 Thread Eugene Grosbein
03.02.2023 21:18, Eivind Nicolay Evensen wrote: > Den Fri, 3 Feb 2023 19:12:32 +0700 > skrev Eugene Grosbein : > >> 03.02.2023 17:06, Eivind Nicolay Evensen wrote: >>> Hello. >>> >>> I just noticed this today: >>> >>> elg!ene[~]> printf "bø\nhei\nøl\n" | grep ø >>> grep: trailing backslash (

Re: Grep with non-ascii

2023-02-03 Thread George Mitchell
On 2/3/23 11:06, Tomoaki AOKI wrote: [...] If this is the case like above, the only solution is to move to character set containing ALL characters all over the world. AFAIK, the only candidates are only two, TRON code [1] and Unicode (UCS, ISO/IEC 10646) [2]. And TRON code is very rarely used, a

Re: Grep with non-ascii

2023-02-03 Thread Eivind Nicolay Evensen
Den Sat, 4 Feb 2023 01:06:05 +0900 skrev Tomoaki AOKI : > On Fri, 3 Feb 2023 15:18:53 +0100 > Eivind Nicolay Evensen wrote: > > > Den Fri, 3 Feb 2023 19:12:32 +0700 > > skrev Eugene Grosbein : > > > > > 03.02.2023 17:06, Eivind Nicolay Evensen wrote: > > > > Hello. > > > > > > > > I just n

Re: Grep with non-ascii

2023-02-03 Thread Tomoaki AOKI
On Fri, 3 Feb 2023 15:18:53 +0100 Eivind Nicolay Evensen wrote: > Den Fri, 3 Feb 2023 19:12:32 +0700 > skrev Eugene Grosbein : > > > 03.02.2023 17:06, Eivind Nicolay Evensen wrote: > > > Hello. > > > > > > I just noticed this today: > > > > > > elg!ene[~]> printf "bø\nhei\nøl\n" | grep ø >

Re: Grep with non-ascii

2023-02-03 Thread Eivind Nicolay Evensen
Den Fri, 3 Feb 2023 20:39:48 +0900 skrev Tomoaki AOKI : > On Fri, 3 Feb 2023 11:06:42 +0100 > Eivind Nicolay Evensen wrote: > > > Hello. > > > > I just noticed this today: > > > > elg!ene[~]> printf "bø\nhei\nøl\n" | grep ø > > grep: trailing backslash (\) > > elg!ene[~]> echo $LC_CTYPE

Re: Grep with non-ascii

2023-02-03 Thread Eivind Nicolay Evensen
Den Fri, 3 Feb 2023 19:12:32 +0700 skrev Eugene Grosbein : > 03.02.2023 17:06, Eivind Nicolay Evensen wrote: > > Hello. > > > > I just noticed this today: > > > > elg!ene[~]> printf "bø\nhei\nøl\n" | grep ø > > grep: trailing backslash (\) > > elg!ene[~]> echo $LC_CTYPE $LANG > > nb_NO.I

Re: Grep with non-ascii

2023-02-03 Thread Eugene Grosbein
03.02.2023 17:06, Eivind Nicolay Evensen wrote: > Hello. > > I just noticed this today: > > elg!ene[~]> printf "bø\nhei\nøl\n" | grep ø > grep: trailing backslash (\) > elg!ene[~]> echo $LC_CTYPE $LANG > nb_NO.ISO8859-1 nb_NO.ISO8859-1 > > While I have the result I envisioned with gnugrep: > >

Re: Grep with non-ascii

2023-02-03 Thread Tomoaki AOKI
On Fri, 3 Feb 2023 11:06:42 +0100 Eivind Nicolay Evensen wrote: > Hello. > > I just noticed this today: > > elg!ene[~]> printf "bø\nhei\nøl\n" | grep ø > grep: trailing backslash (\) > elg!ene[~]> echo $LC_CTYPE $LANG > nb_NO.ISO8859-1 nb_NO.ISO8859-1 > > While I have the result I envisioned w