On Wed, Jan 08, 2003 at 10:23:58AM +0900, Tomohiro KUBOTA wrote: > > Anyway, though I don't know such a module, your way can be very easily > > implemented. I think the easiest one is like following: > > > > $name =~ s/([\x80-\xff])/"&#".ord($1).";"/eg; > > I wrote a new filter which > - assume the input string is UTF-8 if it can be interpreted as such, > - assume it is ISO-8859-1 if not. > > Since UTF-8 encoding method is relatively strict, it is not likely that > ISO-8859-1-intended string is wrongly assumed to be UTF-8. I confirmed > that people.names has no octet stream which can be interpreted as UTF-8. > (Individual 8bit character must not be UTF-8; in UTF-8, 8bit character > must appear in series.) > > With this filter, my concern is completely solved. Also you don't need > to think about future maintainance labor when a new maintainer uses 8bit > characters for his/her name.
Sounds very good, thanks. -- 2. That which causes joy or happiness.