On Jul 1, 8:42 pm, Jim <[EMAIL PROTECTED]> wrote: > On Jul 1, 8:29 pm, John Machin <[EMAIL PROTECTED]> wrote: > > Comments on the above grep output: > > 1. You have SOFT HYPHEN twice, mapping it to u'-' and '-' > > Hmph. I'll correct that. Thanks. Well, maybe not. I forgot that I got the by-hand conversions from three different sources and that's why that character appears in two different places. (I thought that listing all cases for each source was less confusing. Arguable, for sure.)
> 2. The idea of a soft hyphen is as a hint to a hyphenator about where > > to insert a hyphen if one is necessary and the hyphenator is suspected > > of acting cluelessly without the hint. IMHO, asciification should > > substitute u'', not u'-'. > > Thanks also here. I'll think about it. Googling "soft hyphen" showed me that the question is not perfectly clear-- some people seem to have very elaborate opinions on the topic-- but I've gone with your suggestion. Thank you. Again, I'd appreciate additional corrections. Not do I only speak ASCII :-( but I admit to entering the data while watching a basketball game, so no doubt there are some real blunders. Thanks, Jim -- http://mail.python.org/mailman/listinfo/python-list