Re: [9fans] Drawterm GPU (was: Software philosophy)

sirjofri Sun, 22 Aug 2021 22:24:12 -0700

Good morning,

GPUs are SIMD processors. The hundreds of cores are great for highlyparallel calculation.

In GLSL/HLSL I can write a program which is calculated for a very smallset of pixels (usually 2x2 or 1x1). So if you have a resolution of 10x10the program is basically run 5x5=25 or 10x10=100 times in parallel. Boostthe resolution to more real values like 1080p you see how the many coresbenefit the whole calculation.

This high parallelization can only really happen because most stuff isindependent. For example, when raytracing each ray is (almost)independent of other rays.

Still, the 2x2 matrix is often calculated dependently because fortexturing (and mip mapping) you need the "distance" between two pixels ina fragment shader. This is why (for texturing) you may end up havingslower programs and some waiting time between some threads, becausesometimes you need the value of the neighbor thread and have to waituntil it's calculated.

Well, these are very language-specific details that are important forgraphics, but apply similarly to other use cases. I can imagine that forneural networks you can just write the code for one node nad execute it500 times for 500 nodes in parallel. Imagine having this beast on the CPUwith just 4 cores...


I hope this helps you understand how GPU cores ("shaders") work.

Vulkan would indeed be interesting. Since we are only interested in thecompute part it might even make our programs really small, the "helloworld" part of drawing triangles would be the "client" side (writing arasterizer, raymarcher, tracer, whatever). It could still be a lot linesof code, but maybe we still benefit from the 10% speedup.

I still have to understand how all this "shader compilation" stuff works.In webgl it's like, "here's my code, make a shader from it, then I tellyou it's a fragment shader". Shader compilation happens automatically. InUE shader compilation takes a long time, and I believe also in blendershaders are stored in a precompiled binaries.


sirjofri

23.08.2021 06:13:53 Bakul Shah <ba...@iitbombay.org>:

Don't high end GPUs have thousands of "cores"? Even high end CPUs don'thave more than a few dozen cores to 128 or so. While each kind's coresare very different, seems to me GPU/CPU paths have diverged for good.Or we need some massive shift in programming languages + compilers. Ilack imagination how. Still, the thought of the CPUs gaining thecomplexity of the graphics engine scares me!
-- Bakul
On Aug 22, 2021, at 12:09 PM, Paul Lalonde <paul.a.lalo...@gmail.com>wrote:
I'm pretty sure we're still re-inventing, though it's the CPU's turn togain some of the complexity of the graphics engine.
Paul

On Sun, Aug 22, 2021, 12:05 PM Bakul Shah <ba...@iitbombay.org> wrote:
Thanks. Looks like Sutherland's "Wheel ofReincarnation[https://www2.cs.arizona.edu/~cscheid/reading/myer-sutherland-design-of-display-processors.pdf]";has not only stopped but exploded :-) Or stopped being applicable.
-- Bakul
On Aug 22, 2021, at 9:23 AM, Paul Lalonde <paul.a.lalo...@gmail.com>wrote:
It got complicated because there's no stable interface or ISA. Thehardware evolved from fixed-function to programmable in a commercialenvironment where the only meaningful measure was raw performance perdollar at many price points. Every year the hardware spins andbecomes more performant, usually faster than Moore's law. With 3DAPIs hiding the hardware details there is no pressure to make thehardware interface uniform, pretty, or neat. And with the need forperformance there are dozens of fixed function units that effectivelyneed their own sub-drivers while coordinating at high performance withthe other units. The system diagrams for GPUs look complex, but they are radicalsimplifications of what's really on the inside.
Intel really pioneered the open driver stacks, but performancegenerally wasn't there. That might be changing now, but I don't knowif their recently announced discrete product line will bedriver-compatible.
Paul
On Sun, Aug 22, 2021 at 8:48 AM Bakul Shah <ba...@iitbombay.org>wrote:
The FreeBSD amdgpu.ko is over 3Mbytes of compiled code. Not countingthe "firmware" that gets loaded on the GPU board. drm/amd/amdgpu has200K+ lines of source code. drm/amd over 2M lines of code. Intel'si915 seems to be about 1/10th the amd size. AIUI, this is linux GPUdriver code, more or less unchanged (FreeBSD has shim code to useit). How did the interface to an SIMD processor get so complicated?
…



-- Bakul
*9fans[https://9fans.topicbox.com/latest]* / 9fans / seediscussions[https://9fans.topicbox.com/groups/9fans] +participants[https://9fans.topicbox.com/groups/9fans/members] +delivery options[https://9fans.topicbox.com/groups/9fans/subscription]Permalink[https://9fans.topicbox.com/groups/9fans/Tad29bfc223dc4fbe-Me78513510ae4df2da186c73a]


------------------------------------------
9fans: 9fans
Permalink: 
https://9fans.topicbox.com/groups/9fans/Tad29bfc223dc4fbe-M40ea45711a1551fd53807b84
Delivery options: https://9fans.topicbox.com/groups/9fans/subscription

Re: [9fans] Drawterm GPU (was: Software philosophy)

Reply via email to