On Wed, 22 Jun 2011 11:13:09 +0800, "Zou, Nanhai" <nanhai....@intel.com> wrote:

>       If I upload input buffer with movnti or movntdq (bypass cache) +
>       sfence(clear write combine buffer) in the end, clflush should
>       not be needed.

Alas, neither of these will flush existing cached data, so you must
still use clflush to ensure that the data makes it out to memory. All
that they do is avoid consuming additional cache lines.

You want to use a write combining mapping, which should give you full
bandwidth access to memory without hitting any caches. You can use the GTT
mapping as the aperture is configured for write combining access, or we
can figure out how to make PAT work.

>       Since it is CPU read only surface, clflush in not needed at all.

You'd still have to invalidate cache lines using clflush to avoid using
stale data in the CPU cache.

-- 
keith.pack...@intel.com

Attachment: pgp9XcWfQAQPl.pgp
Description: PGP signature

_______________________________________________
Intel-gfx mailing list
Intel-gfx@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/intel-gfx

Reply via email to