Hi,
a dedicated/fast stand-alone workstation will be in most cases a better
choice for running single jobs than running on multiple machines as the
network capabilities are a limiting factor once all nodes try to access
the disk with the data (unless locally stored on each node on an
ultra-fast M.2 SSD). Obviously, if a central computing service is
provided to you (for free), the situation may be a different one if time
does not play that big a role or, you plan to run multiple jobs in parallel.
You are correct to note that I/O operations are very important and often
one badly chosen/implemented "element(s)" slows down everything. A
well-balanced I/O speed between DDR memory, GPU (and its memory), CPU
and fast SSD will be needed for a stand-alone machine to be
useful/compatible.
My thoughts / two cents / feedback:
CPU: 1 CPU with 32 cores
GPU: 2 GPUs with 16-24 GB GDDR each
DDR: 256 GB (minimally 128 GB)
SSD: 1 TB M.2 PCIe 3.0 x 4 (for the dataset being processed)
2nd SSD: 1 TB standard SSD (not PCIe 3.0 x 4) for the linux/software etc.
Raid: 2 x 12TB HDD Disks (for general dataset storage)
Motherboard: maximum (all possible) number of PCIe lanes implemented and
with 10GB ethernet (for example to connect 2 dedicated machines with one
another)
- 32 core CPU will do fine unless budget plays no role or, you want to
outperform the GPUs: The 3990X will out-render an RTX 2060 Super. Extra
money is better spent on professional GPUs and 256 GB DDR memory
(instead of the minimum 128 GB)
- 2 GPUs with 16-24 GB GDDR each in order to optimize data shuffling: 2
GPU cards can be each run at PCIe 3.0 x16 speed instead of more cards at
x8 speed. More memory on the card is more efficient/important than the
total number of cards / cuda-cores. With 2 cards you will have less
problems with heat production in the case and no problem with the supply
of power. Many do not notice that because of heat, CPUs and especially
GPUs are throttled severely: Consumer RTX cards are not designed for the
purpose of running 4 of them on one motherboard / in one case! The hot
air is not "exported" out of the case by consumer RTX cards. Finally,
Nvidia still generously allows to install the cuda drivers on consumer
RTX cards but actually not in the environment of a central university
facility (read the small print)! Further, the cuda software profits from
having professional quadro cards installed such as the RTX 5000 or 6000,
which in addition do not get that hot!
- 256 GB for big datasets, minimally 128 GB but with the ability for
later upgrade -> octa-channel mainboard
- 1 TB M.2 PCIe 3.0 x 4 ultra-fast SSD disk for placing the dataset
under process as it will be read more than once during processing: Many
mainboards allow to install only one ultra-fast M.2 disk, for all other
software including Linux and Relion you will thus need a normal SSD. Buy
a Samsung 980 Pro for placing the data to be processed as these will
last longer but be prepared to replace it after some time because of the
heavy I/O burden....
- Motherboard with maximum possible number of PCIe lanes -> Threadripper
motherboard: No consumer motherboard supplies the maximum possible
number of lanes! X299 motherboards (Intel) and several proposedly
high-end Intel CPUs suffer from the same "problem". Beware, PCIe lanes
or other "ports" may become disabled once you plug-in further M.2 SSD
disks etc. Make really sure to study/scrutinize the small print in the
manual of the motherboard, it will pay off.
Cheers,
Jeroen
Am 25.03.21 um 13:29 schrieb Pierre-Damien Coureux:
Hi everyone
we are thinking in buying new nodes for our data processing with
relion. We still have technical questions that need to be addressed
and I couldn't find recent benchmarks for the answers:
- what's the best choice between buying 2 nodes with 4xGPU or 1 node
with 8xGPU with RTX 3080/3090 cards. In terms of speed calculation, is
1 node with 8xGPU still equivalent to 3 nodes with 4xGPU ?
- Comparing RTX 3080 (8704 cores, 10 Go GDDR, 2-Slot height) and RTX
3090 cards (10496 cores, 24 Go GDDR, 3-Slot height), would 4x3090 GPU
cards be more advantageous in one node if you can run relion with 2
mpi/card (because of the 24 Go RAM/card) instead of having one node
with 8x3080 GPU cards and run 1 mpi/card?
- for a node with 8xGPU with RTX 30XX cards, because of the speed of
the cards, would the RAID SSD drive Bus speed (1 Go/s) be enough to
"feed" all the GPU without slowing down the system ? Is extra
PCI-Express card with SSD at 3 Go/s a preferred choice for this kind
of node ?
Any feedback is welcome
Pierre-Damien Coureux
--
*Dr.math. et dis. nat.Jeroen R. Mesters*
Deputy, Lecturer, Program Coordinator /Infection Biology
/
<http://www.uni-luebeck.de/studium/studiengaenge/infection-biology/introduction.html>Visiting
Professorship (South Bohemian University) in Biophysics
Logo Uni Lübeck
*University of Lübeck*
Center for Structural and Cell Biology in Medicine
*Institute of Biochemistry*
Tel +49 451 3101 3105 (secretariate 3101)
Fax +49 451 3101 3104
jeroen.mest...@uni-luebeck.de <mailto:jeroen.mest...@uni-luebeck.de>
www.biochem.uni-luebeck.de <http://www.biochem.uni-luebeck.de>
*Ratzeburger Allee 160
23538 Lübeck, Schleswig-Holstein
Germany*
########################################################################
To unsubscribe from the CCP4BB list, click the following link:
https://www.jiscmail.ac.uk/cgi-bin/WA-JISC.exe?SUBED1=CCP4BB&A=1
This message was issued to members of www.jiscmail.ac.uk/CCP4BB, a mailing list
hosted by www.jiscmail.ac.uk, terms & conditions are available at
https://www.jiscmail.ac.uk/policyandsecurity/