Bug#1091394: nproc: add new option to reduce emitted processors by system memory

Julien Plissonneau Duquène Fri, 27 Dec 2024 01:12:28 -0800

Hi,

Le 2024-12-26 20:57, Michael Stone a écrit :

As I suggested: you need two tools or one new tool because what you'relooking for is the min of ncpus and (available_mem / process_size). Theresult of that calculation is not the "number of cpus", it is thenumber of processes you want to run.

This is definitely true. "nproc" could potentially be repurposed to mean"number of processes" though.

Here's the problem: the definition of "available memory" is very vague.`free -hwv` output from a random machine:
total used free shared bufferscache availableMem: 30Gi 6.7Gi 2.4Gi 560Mi 594Mi21Gi 23Gi
Swap:           11Gi       2.5Mi        11Gi
Comm:           27Gi        22Gi       4.3Gi

Is the amount of available memory 2.4Gi, 23Gi, maybe 23+11Gi? Or 4.3Gi?
IMO, there is no good answer to that question.

I would rather argue that there is no perfect answer to that question,but that the 23GiB in the "Available" column are good enough for mostuse cases including building stuff, IF (and only if) you take intoaccount that you can't have all of it committed by processes as youstill need a decent amount of cache and buffers (how much? very goodquestion thank you) for that build to run smoothly and efficiently. Swapshould be ignored for all practical purposes here.

(or else, what's wrong with using /proc/meminfo directly?)

I haven't looked at how packages currently try to compute potentialparallelism using data from /proc/meminfo, but my own experience withJava stuff and otherwise perfectly competent, highly qualified engineersgetting available RAM computation wrong makes me not too optimisticabout the overall accuracy of these guesses.


E.g. a few hours ago

I fear your rebuild is ooming workers (...) it seems that some packageis reducing is parallelism to two c++ compilers and that still exceeds20G

Providing a simple tool that standardizes the calculation anddocumenting examples and guidelines is certainly going to help here. Itwill also move the logic to collect, parse and compute the result to asingle place, reducing logic duplication and maintainance burden acrosspackages.

You'd need to somehow get people to define policies, what would thatlook like?

I would suggest making it possible to input the overall marginal RAMrequirements per parallelized process. That is, the amount of additional"available RAM" needed for every additional process. As that value isvery probably going to be larger for the first processes, and as thisfact matters more on constrained environments (e.g. containers, busy CIrunners etc), making it possible to sort of define a curve (e.g. 8 GiB -5 GiB - 2 GiB - 2 GiB ... => 7 workers with 23 GiB available RAM) willallow a closer match to the constraints of these environments.

In addition, providing an option to limit the computed result to thenumber of available actual cpu cores (not vcpus/threads) and another oneto place an arbitrary upper limit of process beyond which no gains areexpected would be nice.


Cheers,

--
Julien Plissonneau Duquène

Bug#1091394: nproc: add new option to reduce emitted processors by system memory

Reply via email to