BartC wrote: > On 12/03/2016 19:26, Thomas 'PointedEars' Lahn wrote: >> BartC wrote: >>> On 12/03/2016 12:13, Marko Rauhamaa wrote: >>>> Why, look at the *English* page on Hillary Clinton: >>>> >>>> Hillary Diane Rodham Clinton /ˈhɪləri daɪˈæn ˈrɒdəm ˈklɪntən/ >>>> (born October 26, 1947) is an American politician. >>>> <URL: https://en.wikipedia.org/wiki/Hillary_Clinton> >>>> >>>> You couldn't get past the first sentence in ASCII. >>> >>> I saved that page locally as a .htm file in UTF-8 encoding. I ran a ^^^^^^^^^^^^^^^^^^^^^^ >>> modified version of my benchmark, and it appeared that 99.7% of the >>> bytes had ASCII codes. ^^^^^^^^^^^^^^^^^^^^^ >> That is a contradiction in terms. Obviously you do not know what ASCII >> is. > > What does your own analysis show of that page? > > If you had it in memory as fully expanded 32-bit Unicode values, what > proportion of those would have values below 128?
You are missing the point. -- PointedEars Twitter: @PointedEars2 Please do not cc me. / Bitte keine Kopien per E-Mail. -- https://mail.python.org/mailman/listinfo/python-list