Stephan Uphoff wrote:
On Thu, 2004-12-02 at 09:41, Andre Oppermann wrote:

Scott Long wrote:

5.  Clustered FS support.  SANs are all the rage these days, and
clustered filesystems that allow data to be distributed across many
storage enpoints and accessed concurrently through the SAN are very
powerful.  RedHat recently bought Sistina and re-opened the GFS source
code, so exploring this would be very interesting.

There are certain steps that can be be taken one at a time. For example
it should be relatively easy to mount snapshots (ro) from more than one
machine. Next step would be to mount a full 'rw' filesystem as 'ro' on
other boxes. This would require cache and sector invalidation broadcasting
from the 'rw' box to the 'ro' mounts.


Mhhh .. if you plan to invalidate at the disk block cache layer then you
will run into race conditions with UFS/FFS (Especially with remove
operations)
I was once called in to evaluate such a multiple reader/single writer
system based on an UFS like file system and block layer invalidation and
had to convince management to kill it. (It appeared to work and actually
made it though internal and customer acceptance testing before failing
horrible in the field).

If you send me more details on your proposed cache and sector
invalidation/cluster design I will be happy to review it.



The holy grail of course is to mount
the same filesystem 'rw' on more than one box, preferrably more than two.
This requires some more involved synchronization and locking on top of the
cache invalidation.  And make sure that the multi-'rw' cluster stays alive
if one of the participants freezes and doesn't respond anymore.

Scrolling through the UFS/FFS code I think the first one is 2-3 days of
work.  The second 2-4 weeks and the third 2-3 month to get it right.
If someone would throw up the money...



Although I don't know the specifics of your experience, I can easily imagine how hard it would be to make this work on UFS. Common operations like walking a file path to the root are nearly impossible to do reliably without an overbearing amount of synchronization. Then you have all of the problems synchronizing buffered data and metadata. Softupdates would be a nightmare, if not impossible.

Scott
_______________________________________________
[EMAIL PROTECTED] mailing list
http://lists.freebsd.org/mailman/listinfo/freebsd-hackers
To unsubscribe, send any mail to "[EMAIL PROTECTED]"

Reply via email to