> You are managing it in the task struct. No need to. libnuma can handle it.
No - as noted, not all mempolicy system calls go via libnuma. > No current version of libcpuset is available. Wrong. It has not received wide publication yet, but it has been provided to various others under LGPL license. > Those exist? Never seen that. I've seen such go by. David R - is your use of the mbind and *_mempolicy system calls via libnuma, or direct system calls? I've fielded questions from others on how to use these system calls (directly, not via libnuma) for several years now, both from within and without of SGI. The glibc folks have spent some effort on refining the system call wrappers they provide for these calls, which are not used by libnuma (libnuma doesn't use the glibc wrappers for these calls.) I presume that glibc work has at least some actual users. Andi has encouraged everyone to use libnuma, but there is no serious reason why others have to. It is not like it takes thousands of lines of elaborate code to perform basic operations using these calls. A search of some old SGI release software sitting on an internal server just now suggests that products with names including histx, gru, libmpi and pcp might be directly invoking these system calls ... I didn't actually examine the source to determine whether they really use these direct calls -- just got a grep hit. I would not be surprised if some of the big batch schedulers directly invoke these system calls, but don't know for sure. In any event, we must assume that some use these system calls directly. They have been available in Linux for several years. > > With the mode bit as in my patch, there are fewer places in the user > > code that have to be gotten just right. With your way, each and > > every mbind and *_mempolicy call has to be hacked with the new flag > > if one is going to use the new nodemask bit numbering. Some of these > > Yes and that makes sure it is thought through and done right. Maybe for you. Not for the rest of us error prone mere mortals. Forcing coders to specify the same detail in multiple places, when there is no way to validate their consistency, doesn't force them to think or do it right. It increases the error rate due to inconsistencies. If you have to select a binary mode once, you've got a 50-50 chance of getting it correct with just a random try, and a half-way decent chance of noticing an error, since it would affect all such calls. If you have to say it ten times scattered about your code and the code of others you are calling and working with, that is being maintained by various people coming and going over many years, and all must be consistent, and there is -no- feedback on errors short of subtle performance issues that only show up on certain non-trivial configurations doing the particular sequence of calls that hit the necessary miscoded places, then you've got about one chance in 2**10 of getting them all the same and correct, given random poking, and close to zero chance of noticing that error in a timely fashion when it does occur. Admittedly, humans aren't random; but we're no model of perfection either. -- I won't rest till it's the best ... Programmer, Linux Scalability Paul Jackson <[EMAIL PROTECTED]> 1.925.600.0401 - To unsubscribe from this list: send the line "unsubscribe linux-kernel" in the body of a message to [EMAIL PROTECTED] More majordomo info at http://vger.kernel.org/majordomo-info.html Please read the FAQ at http://www.tux.org/lkml/