Re: Importing Unicode text to a field .. How?

Richmond Mathewson Fri, 27 May 2011 13:05:11 -0700

Something just occurred to me . . .

#  THE FOLLOWING IS A SINGLE LINE IN THE LC FIELD:
Converted from மயிலை text in/Users/sivakatirswami/Documents/Tamil/Natchintanai in Unicode/3Thannai Thannaal.txt தன்னைத்தன்னால்தன்னைத் தன்னால் அறிந்திட வேண்டுமேதானாயெங்குஞ் செறிந்திட வேண்டுமே[snip]
on mouseup
   answer file "Choose the Unicode for this song" with "OK"
   put url ("binfile:/"&  it) into tUnicode
   set the useUnicode to true
  set the unicodetext of  fld "Unicode_Script" to  tUnicode
  replace numtochar(13) with numtochar(10) in fld "unicode_Script"
# the above line restores the line breaks but destroys thetext... or rather, converts it to some other encoding displaysJapanese characters
end mouseup

Unicode text is double-byte stuff, while the CR and LF are single-bytethings.

SO . . . double-byte strings SHOULD always consist of an even number ofbytes,

AND, if a text field is flagged as containing unicodeText when theengine starts reading its contents

it will start taking "double-byte bites" of the string.

THEREFORE, inserting either numToChar(10) or numToChar(13) into adouble-byte text willthrow the engine out of kilter because it will start "biting"double-bytes "off".


TAKE A LOOK at this:

http://en.wikipedia.org/wiki/Newline

and, just possibly, you need to replace you CR/LF with :

numToChar(8232)

worth a try . . .  :)

_______________________________________________
use-livecode mailing list
[email protected]
Please visit this url to subscribe, unsubscribe and manage your subscription 
preferences:
http://lists.runrev.com/mailman/listinfo/use-livecode

Re: Importing Unicode text to a field .. How?

Reply via email to