On Wed, Mar 21, 2018 at 11:40 AM, Anne van Kesteren <[email protected]> wrote: > On Wed, Mar 21, 2018 at 10:27 AM, Henri Sivonen <[email protected]> wrote: >> * A bunch of things atomicize URL components. I'd hope that the URLs >> were converted from UTF-16 to UTF-8 at some prior point ensuring UTF-8 >> validity, but it's hard to be sure. > > At least per the specification all URL components end up with only > ASCII code points after parsing the URL.
Good point. Thanks. I'll go ahead with doing WHATWG-compliant "with replacement" when trying to atomize invalid UTF-8 (which shouldn't happen anyway). -- Henri Sivonen [email protected] https://hsivonen.fi/ _______________________________________________ dev-platform mailing list [email protected] https://lists.mozilla.org/listinfo/dev-platform

