On Wed, 11 Oct 2017, at 11:16 AM, Gianguido SorĂ wrote: > Uhm, so the Replacer sees it as two separate entities, and replaces > the part of the composite that matches one of the cases. Sort of. The emoji is really just the "\xE2\x83\xA3" part (or "\U000020e3") which puts a keycap symbol around the previous character. The "\x32" is just the digit "2". > > What could I do to make the Replacer ignore UTF-8 composites? Is that > even possible or should I handle the presence of these empty square > boxes after the substitution phase?
Depends on what you are trying to achieve. You could replace the "\x32\xE2\x83\xA3" sequence with something else first, then do your actual replacement and restore the original after. Ian -- You received this message because you are subscribed to the Google Groups "golang-nuts" group. To unsubscribe from this group and stop receiving emails from it, send an email to golang-nuts+unsubscr...@googlegroups.com. For more options, visit https://groups.google.com/d/optout.