[PATCH] drm: micro optimise cache flushing

2012-09-20 Thread Jani Nikula
On Wed, 19 Sep 2012, Chris Wilson wrote: > On Wed, 19 Sep 2012 11:12:41 +1000, Dave Airlie wrote: >> From: Dave Airlie >> >> We hit this a lot with i915 and although we'd like to engineer things to hit >> it a lot less, this commit at least makes it consume a few less cycles. >> >> from someth

Re: [PATCH] drm: micro optimise cache flushing

2012-09-20 Thread Jani Nikula
On Wed, 19 Sep 2012, Chris Wilson wrote: > On Wed, 19 Sep 2012 11:12:41 +1000, Dave Airlie wrote: >> From: Dave Airlie >> >> We hit this a lot with i915 and although we'd like to engineer things to hit >> it a lot less, this commit at least makes it consume a few less cycles. >> >> from someth

[PATCH] drm: micro optimise cache flushing

2012-09-19 Thread Dave Airlie
From: Dave Airlie We hit this a lot with i915 and although we'd like to engineer things to hit it a lot less, this commit at least makes it consume a few less cycles. from something containing movzwl 0x0(%rip),%r10d to add%r8,%rdx I only noticed it while using perf to profile something else

[PATCH] drm: micro optimise cache flushing

2012-09-19 Thread Chris Wilson
On Wed, 19 Sep 2012 11:12:41 +1000, Dave Airlie wrote: > From: Dave Airlie > > We hit this a lot with i915 and although we'd like to engineer things to hit > it a lot less, this commit at least makes it consume a few less cycles. > > from something containing > movzwl 0x0(%rip),%r10d > to > add

Re: [PATCH] drm: micro optimise cache flushing

2012-09-19 Thread Chris Wilson
On Wed, 19 Sep 2012 11:12:41 +1000, Dave Airlie wrote: > From: Dave Airlie > > We hit this a lot with i915 and although we'd like to engineer things to hit > it a lot less, this commit at least makes it consume a few less cycles. > > from something containing > movzwl 0x0(%rip),%r10d > to > add

[PATCH] drm: micro optimise cache flushing

2012-09-18 Thread Dave Airlie
From: Dave Airlie We hit this a lot with i915 and although we'd like to engineer things to hit it a lot less, this commit at least makes it consume a few less cycles. from something containing movzwl 0x0(%rip),%r10d to add%r8,%rdx I only noticed it while using perf to profile something else