Re: Lying in the bath, but telling the truth.

Mark Waddingham Wed, 15 Jun 2016 09:46:15 -0700

Hi Richmond,

On 2016-06-15 18:27, Richmond wrote:

So, obviously, I will have to set a "bot" to trawl its way through mycode
and replace every incidence of *numToChar* to *numToCodePoint*, and
replace the surrogate pairs in the upcoming *Grantha* interface
with "standard" Unicode addresses. The first of which should (?) berelativelysimple if the global search-N-replace behaves itself, the second willbe a
bother, but nothing insurmountable.

If all your instances of numToChar are where useUnicode is 'true' thenyou probably *won't* have to do this.

When useUnicode is true, numToChar() works as it always did - itproduces two bytes which are the binary encoding of the specifiedunicode code unit (not codepoint - see in a minute) as UTF-16.

Now, numToChar() (with useUnicode true) never supported unicodecodepoints above 65535 - however I think you already figured out how todecompose a character outside of the BMP (i.e > 65535) into twosurrogate pairs which are < 65535 and thus supported by numToChar().

You mention that Devawriter Pro was written against 4.5.x - if I recallcorrectly then this was *before* the field became more intelligent athandling unicode. Around 5.5 we changed the field so that it*understood* that a unicode code unit (any unicode char <= 65535,surrogate pairs are two code units) was a single 'char'. Prior to 5.5,the field used 'char' to mean byte (so char 1 of field 1, where thefirst character in a field was a unicode character would return you thefirst byte of code unit, not the code unit itself - which you would getwith char 1 to 2 of field 1).

This latter fact probably means you will need to spend some time lookingat the code which manipulates fields as, if you are using 'char' on yourfields containing unicode and computing indicies thereof (e.g. char 3 to4 of field 1), you'll need to adjust for that.

So, to sum up, the changes introduced around 5.5 are likely to cause you*more* trouble than those introduced with 7.0 - if you fix your code soit works with 5.5 functioning of the field and make sure you put textinto the field using 'set the unicodeText of <field chunk>' or 'putunicode ... into <field chunk>'; then you *should* find that there islittle or no need to update your unicode construction code - which hasall the instances of numToChar.


Hope this helps!

Warmest Regards,

Mark.

--
Mark Waddingham ~ m...@livecode.com ~ http://www.livecode.com/
LiveCode: Everyone can create apps

_______________________________________________
use-livecode mailing list
use-livecode@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode

Re: Lying in the bath, but telling the truth.

Reply via email to