Re: Lying in the bath, but telling the truth.

Richmond Wed, 15 Jun 2016 10:39:51 -0700


On 15.06.2016 19:43, Mark Waddingham wrote:

Hi Richmond,

On 2016-06-15 18:27, Richmond wrote:
So, obviously, I will have to set a "bot" to trawl its way through mycode
and replace every incidence of *numToChar* to *numToCodePoint*, and
replace the surrogate pairs in the upcoming *Grantha* interface
with "standard" Unicode addresses. The first of which should (?) berelativelysimple if the global search-N-replace behaves itself, the second willbe a
bother, but nothing insurmountable.
If all your instances of numToChar are where useUnicode is 'true' thenyou probably *won't* have to do this.
When useUnicode is true, numToChar() works as it always did - itproduces two bytes which are the binary encoding of the specifiedunicode code unit (not codepoint - see in a minute) as UTF-16.
Now, numToChar() (with useUnicode true) never supported unicodecodepoints above 65535 - however I think you already figured out howto decompose a character outside of the BMP (i.e > 65535) into twosurrogate pairs which are < 65535 and thus supported by numToChar().
You mention that Devawriter Pro was written against 4.5.x - if Irecall correctly then this was *before* the field became moreintelligent at handling unicode. Around 5.5 we changed the field sothat it *understood* that a unicode code unit (any unicode char <=65535, surrogate pairs are two code units) was a single 'char'. Priorto 5.5, the field used 'char' to mean byte (so char 1 of field 1,where the first character in a field was a unicode character wouldreturn you the first byte of code unit, not the code unit itself -which you would get with char 1 to 2 of field 1).
This latter fact probably means you will need to spend some timelooking at the code which manipulates fields as, if you are using'char' on your fields containing unicode and computing indiciesthereof (e.g. char 3 to 4 of field 1), you'll need to adjust for that.
So, to sum up, the changes introduced around 5.5 are likely to causeyou *more* trouble than those introduced with 7.0 - if you fix yourcode so it works with 5.5 functioning of the field and make sure youput text into the field using 'set the unicodeText of <field chunk>'or 'put unicode ... into <field chunk>'; then you *should* find thatthere is little or no need to update your unicode construction code -which has all the instances of numToChar.


This is rather interesting as all my code currently features

set the unicodeText of fld "XYZ" to the unicodeText of fld "XYZ" &numToChar(12345)


in LC/RR 4.5, to which I should add:

1. That works 100% in LC 4.5

2. I thought that was "the way" in 4.5, so don't entirely understand"the changes introduced around 5.5 are likely to cause you *more*trouble than those introduced with 7.0".


Having said that, we'll see soon enough if I come-a-cropper or not :)

Richmond.


Hope this helps!


Very much so.


Warmest Regards,

Mark.



_______________________________________________
use-livecode mailing list
use-livecode@lists.runrev.com
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode

Re: Lying in the bath, but telling the truth.

Reply via email to