AndroidBuild

Tobias Klein Sun, 07 Feb 2021 06:01:56 -0800

Hi Troy,

Thanks once more for all the details! I appreciate it.

I just grepped quickly in the SWORD source code (grep -r "upperUTF8" . | grep -v ".svn") and the method upperUTF8 appears to be only used inthe following places:

./src/keys/versekey.cpp: stringMgr->upperUTF8(abbr, (unsignedint)(strlen(abbr)*2));./src/keys/versekey.cpp: stringMgr->upperUTF8(abbr, (unsignedint)(strlen(abbr)*2));./utilities/imp2gbs.cpp:StringMgr::getSystemStringMgr()->upperUTF8(keyBuffer.getRawData(), size-2);

I think neither of those is currently used in Ezra Project, though. Atthe moment I do not have the use case to parse verse keys based on anyspecial Unicode inputs. I am only using the standard Englishabbreviations for verse keys and that only happens internally. So, inthis case I may just process the locales.d files directly in node.js /JavaScript.

Regarding node-sword-interface and the build process for mobileplatforms ... currently I have only tried Android, which works fine. iOSshould technically work as well, but I have not tried that yet. Theboiler plate work to make all that happen smoothly is provided by thenodejs-mobile <https://code.janeasystems.com/nodejs-mobile> cordovaplugin. That plugin contains build scripts that seemlessly compile anynative node.js addons like node-sword-interface or also the sqlite3module that I am using.

And since I am now using an API compatible runtime environment both forElectron/nodejs and Cordova/nodejs-mobile I did not have to add anyadditional glue code. One risk I see with this approach is that the guyswho provide nodejs-mobile discontinue their work for some reason. It'sessentially a completely separately maintained fork of nodejs (it hasnothing to do with V8 actually). Originally it is based on theChakraCore JavaScript engine of the Microsoft Edge browser. But thenodejs-mobile guys ported it to Android and iOS ...

Regarding the StringMgr native callback possibility ... yes technicallythis is possible with a node native addon like node-sword-interface.I am using such a functionality for the InstallMgr and search progressfeedbacks already.

So, long story short ... if in the future a usecase comes up to parseUnicode-based VerseKeys, I will implement a special StringMgr binding asyou suggested. But for now I'll focus on handling the locales.d contentdirectly in JavaScript / node.js.


I will keep you posted.

Best regards,
Tobias

On 2/6/21 11:59 PM, Troy A. Griffitts wrote:

The data is pulled from the locales.d/ files, but the toUpper logic isnecessary in a number of places in the engine. Two come to mindimmediately:
parsing verse references not sensitive to case

parsing LD module keys not sensitive to case
To be able to get an uppercase representation of any Unicodecharacter, it takes a pretty hefty dataset of all known humanlanguages-- that's why we leave it up to an external library. Andyeah, because ICU is so large, that's why I don't compile it into mybinaries in Bishop. Bishop is about 13MB total, which includes ~8MBof default module data (KJV, SME, StrongsGreek, StrongsHebrew). That's about 5MB for the app. If I included ICU, it would greatlyincrease the size. And both iOS and Android (Swift and Java) alreadyhave facilities for getting the toUpper of a string.
I hope you can steal the few lines from Bishop's native SWORD codewhich tells SWORD to call either Java or Swift when toUpperUTF is called.
I am sorry that this might break the nice ability to have exactly thesame code on both iOS and Android (I am surprised that absolutely nochanges were required for you to interface to a native library on bothiOS and Android! cordova required me to provide: Android: Java-jnilayer; iOS: Swift layer. I am jealous.)
If you can think of an alternative, I am happy to listen. We couldprovide a better StringMgr default (I think we simply have a latin-1single byte tranformation table for basically ASCII characters), whichincludes an SW_u32 hash which included German characters, but that'sgoing to limit the languages we support to only the ones we add to ourtoUpper hash, and that's not really a dataset I want to maintain.
Open to suggestions,

Troy


On 2/6/21 2:56 PM, Tobias Klein wrote:
Dear Troy,

Thank you for these explanations! I appreciate it!
For Ezra Project on Android, I am at this point simply compilingnode-sword-interface with the Android cross compilers and it works.However, as I wrote, I have issues for the German Bible book names now.
Is the StringMgr functionality only used to handle the locales.dfiles? Or also for some content inside any SWORD modules?
If it is only used for handling the locales.d files then I wouldconsider handling the Sword locales.d files directly from JavaScript/ node.js, which already supports Unicode.
I also checked whether I can cross-compile the ICU library and thatworked, but this is a huge binary (I think 20-30 MB) and I wouldrather keep the APK size as small as possible.
Best regards,
Tobias

*From: *Troy A. Griffitts <mailto:scr...@crosswire.org>
*Sent: *Sonntag, 31. Januar 2021 18:20
*To: *sword-devel@crosswire.org <mailto:sword-devel@crosswire.org>
*Subject: *Re: [sword-devel] Sword Locales / German Umlaut Issues /AndroidBuild
Dear Tobias,
My apologies for taking so long to respond to this, but I wanted togive a thorough answer. See the summary at the end if you don't careabout the details.
So, SWORD has a class StringMgr, which manages strings within SWORD,and by default SWORD includes a very basic implementation, whichdoesn't necessarily know about or support anything beyond what thebasic C string methods support.
I am sure this invokes a sense of horror from you at first, so let meexplain a bit how we properly handle character sets. First, shortbackground: since we existed well before the Unicode world, we havemultiple locale files for each language, which you will still see inthe locales.d/ folder, each specifying their character encoding, andmost of the time SWORD doesn't need to manipulate characters, sosimply holding data, and passing that data to a display frontend, andspecifying a font which will handle that encoding was enough in theold world. IMPORTANT: the one place we do need to manipulatecharacter data is to perform case-insensitive comparisons. We didthis in the past by converting a string to uppercase beforecomparison. You'll notice this in the section for Bible bookabbreviation in each locale-- the partial match key must be in atoupper state.
Today, everything in SWORD prefers Unicode and specifically, encodedas UTF-8. To support this:
First, we have utility functions within SWORD for working withUnicode encoded strings, see:
http://crosswire.org/svn/sword/trunk/include/utilstr.h

Specifically:

SWBuf assureValidUTF8(const char *buf);
SW_u32 getUniCharFromUTF8(const unsigned char **buf, bool skipValidation = 
false);
SWBuf *getUTF8FromUniChar(SW_u32 uchar, SWBuf *appendTo);
SWBuf utf8ToWChar(const char *buf);
SWBuf wcharToUTF8(const wchar_t *buf);
To wrap this up, by subclassing StringMgr, SWORD supportsimplementing character encoding by linking to other libraries, e.g.,ICU, Qt, etc. to handle full Unicode support. And while theStringMgr interface allow implementation of many string functions,upperUTF8 is the only real method the SWORD engine needs to workcompletely. Some utilities use the other methods in there, but theengine, only needs this method.
In summary, on Android, you are likely not linking to ICU when youbuild the native SWORD binary-- which I don't do either for Bishop. The Cordova SWORD plugin uses the SWORD java-jni bindings, which usethe Java VM to implement StringMgr:
https://crosswire.org/svn/sword/trunk/bindings/java-jni/jni/swordstub.cppSearch for: AndroidStringMgr
And on iOS the Cordova plugin uses the Swift libraries to do thesame. This is done by using the SWORD flatapi call toorg_crosswire_sword_StringMgr_setToUpper to provide a Swiftimplementation to uppercase a string.
http://crosswire.org/svn/sword/trunk/bindings/cordova/cordova-plugin-crosswire-sword/src/ios/SWORD.swift
I hope this give you the information you need to get things workingfor you. Please don't hesitate to ask if you need help,
Troy

On 1/17/21 11:59 AM, Tobias Klein wrote:

Dear Troy,
I'm playing with an Android Build of Sword and I get issues with theGerman Umlauts.
So I have issues with Bible book names like Römer, Könige, etc.

The Umlauts are shown as ?.

I'm configuring the SWORD build with CMake like below (without ICU!)

I remember having similar issues on Linux when building without ICU.

How do you build SWORD for Bishop? Any suggestions?

Best regards,
Tobias
-- Check for working CXX compiler:/opt/Android/SDK/ndk/r21b/toolchains/llvm/prebuilt/linux-x86_64/bin/clang++-- Check for working CXX compiler:/opt/Android/SDK/ndk/r21b/toolchains/llvm/prebuilt/linux-x86_64/bin/clang++-- works
-- Detecting CXX compiler ABI info
-- Detecting CXX compiler ABI info - done
-- Detecting CXX compile features
-- Detecting CXX compile features - done
-- Check for working C compiler:/opt/Android/SDK/ndk/r21b/toolchains/llvm/prebuilt/linux-x86_64/bin/clang-- Check for working C compiler:/opt/Android/SDK/ndk/r21b/toolchains/llvm/prebuilt/linux-x86_64/bin/clang-- works
-- Detecting C compiler ABI info
-- Detecting C compiler ABI info - done
-- Detecting C compile features
-- Detecting C compile features - done
-- Configuring your system to build libsword.
-- SWORD Version 1008900000


_______________________________________________
sword-devel mailing list:sword-devel@crosswire.org
http://crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page
_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

_______________________________________________
sword-devel mailing list: sword-devel@crosswire.org
http://crosswire.org/mailman/listinfo/sword-devel
Instructions to unsubscribe/change your settings at above page

Re: [sword-devel] Sword Locales / German Umlaut Issues / AndroidBuild

Reply via email to