Re: [dev] [sbase] [PATCH-UPDATE] Rewrite tr(1) in a sane way

2015-01-11 Thread FRIGN
On Sat, 10 Jan 2015 22:47:09 +0100 Markus Wichmann wrote: > You wanted to be Unicode compatible, right? Because in that case I > expect [:alpha:] to be the class of all characters in General Category L > (that is, Lu, Ll, Lt, Lm, or Lo). That includes a few more characters > than just A-Z and a-z

Re: [dev] [sbase] [PATCH-UPDATE] Rewrite tr(1) in a sane way

2015-01-10 Thread Dimitris Papastamos
On Sat, Jan 10, 2015 at 11:19:20PM +0100, Dmitrij D. Czarkoff wrote: > FRIGN said: > > On Sat, 10 Jan 2015 02:52:09 +0100 > > "Dmitrij D. Czarkoff" wrote: > > > > > > +#define UPPER "A-Z" > > > > +#define LOWER "a-z" > > > > +#define PUNCT "!\"#$%&'()*+,-./:;<=>?@[\\]^_`{|}~" > > > > > > These d

Re: [dev] [sbase] [PATCH-UPDATE] Rewrite tr(1) in a sane way

2015-01-10 Thread random832
On Sat, Jan 10, 2015, at 19:11, Ian D. Scott wrote: > On Sat, Jan 10, 2015 at 06:56:45PM -0500, random...@fastmail.us wrote: > Actually, ẞ, capital of ß, was added in Unicode 5.1. There are probably > others letters with this issue, however. My main point was that you've got to be careful that

Re: [dev] [sbase] [PATCH-UPDATE] Rewrite tr(1) in a sane way

2015-01-10 Thread Ian D. Scott
On Sat, Jan 10, 2015 at 06:56:45PM -0500, random...@fastmail.us wrote: > On Sat, Jan 10, 2015, at 16:47, Markus Wichmann wrote: > > You wanted to be Unicode compatible, right? Because in that case I > > expect [:alpha:] to be the class of all characters in General Category L > > (that is, Lu, Ll, L

Re: [dev] [sbase] [PATCH-UPDATE] Rewrite tr(1) in a sane way

2015-01-10 Thread random832
On Sat, Jan 10, 2015, at 16:47, Markus Wichmann wrote: > You wanted to be Unicode compatible, right? Because in that case I > expect [:alpha:] to be the class of all characters in General Category L > (that is, Lu, Ll, Lt, Lm, or Lo). That includes a few more characters > than just A-Z and a-z. And

Re: [dev] [sbase] [PATCH-UPDATE] Rewrite tr(1) in a sane way

2015-01-10 Thread Dmitrij D. Czarkoff
FRIGN said: > On Sat, 10 Jan 2015 02:52:09 +0100 > "Dmitrij D. Czarkoff" wrote: > > > > +#define UPPER "A-Z" > > > +#define LOWER "a-z" > > > +#define PUNCT "!\"#$%&'()*+,-./:;<=>?@[\\]^_`{|}~" > > > > These definitions hugely misrepresent corresponding character classes. > > I interpreted the

Re: [dev] [sbase] [PATCH-UPDATE] Rewrite tr(1) in a sane way

2015-01-10 Thread Markus Wichmann
On Sat, Jan 10, 2015 at 08:51:03PM +0100, FRIGN wrote: > On Sat, 10 Jan 2015 02:52:09 +0100 > "Dmitrij D. Czarkoff" wrote: > > > > +#define UPPER "A-Z" > > > +#define LOWER "a-z" > > > +#define PUNCT "!\"#$%&'()*+,-./:;<=>?@[\\]^_`{|}~" > > > > These definitions hugely misrepresent corresponding

Re: [dev] [sbase] [PATCH-UPDATE] Rewrite tr(1) in a sane way

2015-01-10 Thread FRIGN
On Sat, 10 Jan 2015 02:52:09 +0100 "Dmitrij D. Czarkoff" wrote: > > +#define UPPER "A-Z" > > +#define LOWER "a-z" > > +#define PUNCT "!\"#$%&'()*+,-./:;<=>?@[\\]^_`{|}~" > > These definitions hugely misrepresent corresponding character classes. I interpreted the character classes by default for

Re: [dev] [sbase] [PATCH-UPDATE] Rewrite tr(1) in a sane way

2015-01-09 Thread Dmitrij D. Czarkoff
FRIGN said: > +#define UPPER "A-Z" > +#define LOWER "a-z" > +#define PUNCT "!\"#$%&'()*+,-./:;<=>?@[\\]^_`{|}~" These definitions hugely misrepresent corresponding character classes. -- Dmitrij D. Czarkoff

Re: [dev] [sbase] [PATCH-UPDATE] Rewrite tr(1) in a sane way

2015-01-09 Thread FRIGN
On Fri, 9 Jan 2015 20:39:48 +0100 FRIGN wrote: > sin just told me the patch was missing chartorunearr.c which in fact is the case. Here's an updated patch which should cleanly apply to a vanilla codebase at HEAD. Cheers FRIGN -- FRIGN >From f626eecfb757ab46cab7f16dc439258a6a497f1b Mon Sep