Re: multi-core software

Dave Angel Sun, 07 Jun 2009 18:03:57 -0700

Lew wrote:

<div class="moz-text-flowed" style="font-family: -moz-fixed">ScottDavid Daniels wrote:
the nub of the problem is not on the benchmarks.  There is something
to be said for the good old daays when you looked up the instruction
timings that you used in a little document for your machine, and could
know the cost of any loop.  We are faster now, but part of the cost of
that speed is that timing is a black art.
Those good old days never existed. Those manuals never accounted forthings that affected timing even then, like memory latency or refreshtime. SRAM cache made things worse, since the published timings nevermentioned cache-miss delays. Though memory cache might seem a recentinnovation, it's been around a while. It would be challenging to findany published timing since the commercialization of computers thatwould actually tell the cost of any loop.
Things got worse when chips like the '86 family acquired multipleinstructions for doing loops, still worse when pre-fetch pipelinesbecame deeper and wider, absolutely Dark Art due to multi-level memorycaches becoming universal, andthrow-your-hands-up-and-leave-for-the-corner-bar with multiprocessorNUMA systems. OSes and high-level languages complicate the matter -you never know how much time slice you'll get or how your source gotcompiled or optimized by run-time.
So the good old days are a matter of degree and self-deception - itwas easier to fool ourselves then that we could at least guess timingsproportionately if not absolutely, but things definitely get moreunpredictable over evolution.

Nonsense. The 6502 with static memory was precisely predictable, andmany programmers (working in machine language, naturally) counted onit. Similarly the Novix 4000, when programmed in its native Forth.

And previous to that, I worked on several machines (in fact, I wrote theassembler and debugger for two of them) where the only variable was thedelay every two milliseconds for dynamic memory refresh. Separatecontrol memory and data memory, and every instruction preciselyclocked. No instruction prefetch, no cache memory. What you see iswhat you get.

Would I want to go back there? No. Sub-megaherz clocks with much lesshappening on each clock means we were operating at way under .01% ofpresent day.


--
http://mail.python.org/mailman/listinfo/python-list

Re: multi-core software

Reply via email to