Scott David Daniels <scott.dani...@acm.org> wrote: >The opcode cannot simply talk to its cache, it must either go directly >to off-chip memory or communicate to other processors that it (and it >alone) owns the increment target.
In fact all it does simply talk to its cache. From the "Intel 64 and IA-32 Architectures Software Developer's Manual, Volume 3A: System Programming Guide, Part 1": For the P6 and more recent processor families, if the area of memory being locked during a LOCK operation is cached in the processor that is performing the LOCK operation as write-back memory and is completely contained in a cache line, the processor may not assert the LOCK# signal on the bus. Instead, it will modify the memory location internally and allow it's cache coherency mechanism to insure that the operation is carried out atomically. This operation is called "cache locking." The cache coherency mechanism automatically prevents two or more processors that have cached the same area of memory from simultaneously modifying data in that area. The same cache coherency mechanism that prevents ordinary "unlocked" instructions from simulanteously modifying the same cache line on two different processors also provides the guarantee with "locked" instructions. There's no additional hardware locks involved, and no additional communication required. Ross Ridge -- l/ // Ross Ridge -- The Great HTMU [oo][oo] rri...@csclub.uwaterloo.ca -()-/()/ http://www.csclub.uwaterloo.ca/~rridge/ db // -- http://mail.python.org/mailman/listinfo/python-list