On Tue, 19 Apr 2005 22:26:23 +0300
Eran Tromer <[EMAIL PROTECTED]> wrote:

> Hi,
> 
> On 19/04/05 21:13, Gilad Ben-Yossef wrote:
> > MOSIX/OpenMOSIX is a great
> > academic excersize - a working academic excersize, but not something I
> > would use except for very specific and narrow taks in controled conditions.
> 
> That's consistent with my experience. Here at the Weizmann Institute,
> the IT department built a MOSIX-based cluster out of a dozen high-end
> machines. It failed miserably. AFAIK, the main problem was that
> migration just never happened for most user processes (even after fixing
> the default setup which disallows migration of anything invoked via ssh,
> which wasn't documented anywhere). To start with, anything that used
> shared memory and (IIRC) threads couldn't migrate. Also, anything that
> did noticable amounts of I/O got locked to its home node, even though
> everything was running on an NFS-mounted filesystem anyway [1]. Since
> all processes on the cluster had the same home node (i.e., the formal
> gateway to the cluster which everybody sshed to), they ended up having
> one overloaded node and 11 nearly idle machines.
> 

both mosix and openmosix don't migrate (posix) threads. They both didn't
migrate processes using shared memory. openmosix has a patch that allows
migrating processes using shared memory (migshm) which I believe has been
incorporated into the mainstream code by now.

I don't have experience with mosix but for openmosix io intensive processes
stay on the home node by default. You can override the decision though and
tell openmosix to treat the program as processor intensive and not io intensive.
You can also tell it whether the activity checking should be averaged over
short or long term. BTW there are some system calls that cause problems also as
they have to be executed on the home node (timeofday mainly)

>   Eran
> 
> [1] In theory it might have been possible to work around that using the
> distributed FS that comes with MOSIX/OpenMOSIX, but I wouldn't bet on
> it. I wildly guess it would require a major migration and have some
> funny stuff non-Unix semantics, and my general impression was that the
> FS is half-baked.
> 

The specialized file system was removed from openmosix (forgot its name) as it
gave some trouble. They now recommend using another dedicated shared file
system not developed by the same team (sorry, forgot its name but the mailing
list has some references to it).

NFS is not good for file sharing using openmosix, IIRC there is some problem
with syncing file changes between system which can cause file corruptions.

Another option is kerrighed, developed at INRIA. The downside is that it only
handled up to 32 CPUs last time I checked and can't add and remove nodes
dynamically, but it does support migrating posix threads and shared memory and
at least according to a benchmark done by someone affiliated with them, network
traffic and performance is better on migrated process. with openmosix (and
another option that I don't recall at the moment) pipe and socket performance
dropped dramatically even after migrating back.

BTW there is a way to allow matlab to migrate over openmosix (and
probably mosix) although you need to disable the gui interface (JAVA) using the
- nojvm flag (it can still open image windows) and disable the licence manager
heartbeat (something like TWM_HEARTBEAT_INTERVAL=-1 matlab -nojvm). It doesn't
migrate by default since the gui uses java which in turn uses threads.

> =================================================================
> To unsubscribe, send mail to [EMAIL PROTECTED] with
> the word "unsubscribe" in the message body, e.g., run the command
> echo unsubscribe | mail [EMAIL PROTECTED]
> 
>  
>  +++++++++++++++++++++++++++++++++++++++++++
>  This Mail Was Scanned By Mail-seCure System
>  at the Tel-Aviv University CC.
> 

 
 +++++++++++++++++++++++++++++++++++++++++++
 This Mail Was Scanned By Mail-seCure System
 at the Tel-Aviv University CC.

=================================================================
To unsubscribe, send mail to [EMAIL PROTECTED] with
the word "unsubscribe" in the message body, e.g., run the command
echo unsubscribe | mail [EMAIL PROTECTED]

Reply via email to