Re: OT: Programming portability

Darren Tucker Sat, 18 Jun 2005 23:30:49 -0700

Chris Zakelj wrote:

I'm curious as to how programs actually get ported from one OS toanother,

Yes, some techniques make the job easier, but it depends on what theprogram does and whether you're doing a one-way port or an ongoing port.The following aren't necessarily the only ways to do porting, but it'show the Open* Portable projects are done.

If it's a one-way port (ie the port will be done once and thereaftermaintained separately), the usual method is to just change what needschanging to suit the target platform. This is effectively what happenedwhen the original ssh-1.2.12 code was 'ported' to OpenBSD.

In the case of OpenSSH and OpenNTPD Portable, they're ongoing ports (iechanges are regularly sync'ed from OpenBSD's to Portable's). There are2 main ways to accomplish this: sprinkle the code with #ifdefs orimplement the missing functionality in a compatibilty layer.

In the Portable projects, the first preference is leave the common codealone and implement the required functionality in a portability layer.This has the advantage of keeping the common code clean and if doneproperly the components are reusable. (A number of functions used byOpenNTPD Portable came unmodified from OpenSSH Portable). Sometimesthat's not possible or more effort than it's worth, so in those cases an#ifdef is used which imposes an ongoing maintenance cost (ie next time achange is made in that area in the main code, you'll have to manuallyresolve conflict when syncing changes).

For example: OpenNTPD Portable was ported to run on QNX4 (a POSIXishembedded system) by Anthony O.Zabelin. The 2 main missing pieces werethe adjtime() and poll() calls. Simplifying somewhat, the code thatused adjtime looked like this:


        d_to_tv(d, &tv);
        if (adjtime(&tv, NULL) == -1)
                log_warn("adjtime failed");

If we had used the #ifdef technique, that would have changed tosomething like this:


#ifndef __QNX__
        d_to_tv(d, &tv);
        if (adjtime(&tv, NULL) == -1)
                log_warn("adjtime failed");
#else
        usec = (int)(d * 1000000);
        if (qnx_adj_time(usec, ADJUST_RATE, NULL, NULL) == -1)
                log_warn("qnx_adj_time failed");
#endif

Now one or two of those aren't too bad, but it rapidly becomes difficultto follow once you add a few more to the same piece of code.

Instead, Anthony wrote a stand-alone replacement adjtime() functionwhich is in the portability layer (openbsd-compat/port-qnx.c). This hada higher initial cost (it's 23 lines of code in a single function plus aMakefile change instead of 6 lines listed above) but it leaves the maincode unchanged. It can also be tested separately and is reusable.

I took the same approach with poll() and built a replacement on top ofselect(), which QNX4 did have. Hey presto, it now worked on QNX4, andthe codebase is no harder to maintain.

and if certain directions are easier than others.

In general, the difficulty is directly proportional to how different thetarget platform is compared to the platforms already supported in thearea in which the program operates. In most current OSes there's a slowconvergance toward common APIs for standard languages so if you stick tothose standard APIs you life will be easier.

Newer/more featureful -> older/less featureful is usually harder thanthe other way around unless the program was originally written to stickto a common subset.

Beyond that it depends on the program. Porting a simple text filterfrom a bleeding-edge Linux to 10-year old BSD is likely to be simple,but other programs may be difficult to impossible.

That is, how does one figure out what needs to be changed in order to
make OpenNTPD work on Linux?

I had the advantage of having worked on Portable OpenSSH for a couple ofyears so had a reasonable idea what to expect, so for OpenNTPD I justcopied the code onto a Linux box, hacked the BSDisms out of the Makefileand tried to compile it. This highlighted some obvious problems (egmissing strl* functions, the lack of sa_len in struct sockaddr). Ifixed these (stole the strl* functions from OpenSSH and changedsa->sa_len into SA_LEN(sa)) and tried again. After a few iterations ofthis process it compiled and after a couple more, amazingly enough, itworked. At that point I added basic autoconf support, put a tarball onmy web page and mentioned it on [EMAIL PROTECTED]

After that, other folks and myself repeated the process on otherplatforms, slowly expanding the list that it would run on. (Theplatform list on openntpd.org is in chronological order, earliest first).

Is it generally easier to move a program from $some_bsdto $some_other_os, or from $some_other_os to $some_bsd?

Depends on the type of program, and in particular what OS-level servicesit uses.

OpenSSH, for example, has to deal with user authentication for whichthere is a large amount of variance between platforms, so the diffbetween OpenBSD's and Portable's is large.

OpenNTPD has to deal with far less variance between platforms so thediff is much smaller. (adjtime() is common and the interface is simple,but if/when it starts compensating for systematic clock skew then thatwill introduce a significant amount of platform-specific code, howevermost of that can be hidden in the compatibility layer.)

How would you even begin to port something like OpenSSH to a non-Unix
system like Windows?

I would say you would need a POSIX-like compatibility layer (eg externallike Cygwin or implement your own), otherwise you would probably have todo a one-way port. In that case you could keep the protocol code as is,but you would probably need to replace large chunks of the rest.

Does the chosen language (C, C++, Java, etc) make a differencein difficulty?

I don't use C++ or Java so I won't comment on them, but IMO the languagecan a make significant difference, but only for some things.

It is possible to write portable C. In the case of OpenSSH, most of theplatform-dependant code is there because the operating system requiresit, not the language, and unless the language provides an abstractionfor exactly what you're trying to do then the choice of language makeslittle difference.

For example (and I'm oversimplifying here), validating a user'spassword: on a traditional Unix system you encrypt the password andcompare against what getpwnam() returns. If your system has shadowpasswords, though, you need to use getspnam() since the encryptedpassword isn't returned by getpwnam() on those. Oh, and some really oldsystems based on SecureWare have another function (getprpwnam). (Thisis ignoring platform-specific functions and the attempts to standardizethis in libraries such as BSD auth or PAM, which have varying degrees ofsuccess.)

If a language provided a "authenticate_user(user, password)" then thatwould help in this case, but most don't seem to (and there are manyother examples of platform-dependant things you would need to do besidesthis one).


Anyway, some guidelines to avoid common traps for portable C:

1) write for correctness and clarity.

2) try to stick to standard functions (eg POSIX).

3) be prepared to implement your own replacements for non-standardfunctions you use.


3b) or standard functions that aren't available on you target platform.

3c) or standard functions that are broken on your target platform.

4) if possible put all the system #includes in a single header andinclude that from all of your source files. Headers vary quite a bitand it's better if you only have to deal with all that variance once.

5) use the datatypes you're supposed to. eg if you need to store a 32bit value, use "u_int32_t" not "unsigned int", if you're using it in asignal handler use "sig_atomic_t" not "int". Check and typedef ityourself if your platform doesn't.

6) Turn on all the compiler warnings and fix what it warns about. Manyare potential portability problems, even if they haven't bitten you yet.

[This list isn't exhaustive, I'm sure other folks will be able to add toit.]

When I've built from ports, I can see make files doingOS detection, but from there (not being a very good coder), I can'treally make out how it changes the code based on that. Anyrecommendations for "casual programmer" books would be cool...


A place to start would be this diff:
http://www.zip.com.au/~dtucker/openntpd/patches/ntpd-vs-openbsd.diff

It shows the (small) changes to the OpenBSD ntpd code (3.6.1) to make it"Portable". The remainder of the changes are in files that are only inPortable (take a look in openbsd-compat/ in the OpenNTPD portabledistribution).

After that, I suggest downloading the OpenBSD-specific ntpd and theequivalent Portable one and comparing them (diff -ru is your friend).It's small enough that it ought to be understandable but it's a realapplication that runs on 9 platforms (so far :-).


--
Darren Tucker (dtucker at zip.com.au)
GPG key 8FF4FA69 / D9A3 86E9 7EEE AF4B B2D4  37C9 C982 80C7 8FF4 FA69
    Good judgement comes with experience. Unfortunately, the experience
usually comes from bad judgement.

Re: OT: Programming portability

Reply via email to