Re: [Mesa-dev] KWin and Mesa

2011-04-22 Thread Ian Pilcher
On 04/20/2011 01:00 PM, Kenneth Graunke wrote:
> Please do let us know about any issues you come across. (If you can work
> around them, you can report them...)  Even just a ping saying "Someone
> told me KWin was broken on " is helpful.  We have the
> hardware and can always try to reproduce the problem, even if you aren't
> able to.

https://bugs.freedesktop.org/show_bug.cgi?id=32534
https://bugs.kde.org/show_bug.cgi?id=252817
https://bugzilla.redhat.com/show_bug.cgi?id=691084

-- 
============
Ian Pilcher arequip...@gmail.com
"If you're going to shift my paradigm ... at least buy me dinner first."


___
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/mesa-dev


Re: [Mesa-dev] KWin and Mesa

2011-04-23 Thread Ian Pilcher
On 04/22/2011 11:11 PM, Mario Kleiner wrote:
> At least the visual corruption with compositing that is described in
> these reports matches this bug very well:
> 
> <https://bugs.freedesktop.org/show_bug.cgi?id=35452>
> 
> The bug is in the DRI2 implementation of the x-server and should affect
> any compositor when kms page flipping is enabled. x-servers 1.8, 1.9,
> (afaik) 1.10 x-server's would be affected. If you follow this thread
> <http://lists.x.org/archives/xorg-devel/2011-March/020716.html> it
> points to a proposed patch by Michel Daenzer, and a refined version by
> Keith Packard. I don't know if a version has landed in master already,
> but testing it would help to get it there and maybe to get it merged to
> 1.9 and 1.10.

Results of initial testing are positive.  Thank you!

Rather disappointing to read the discussion on xorg-devel.  (Talk about
"making the perfect the enemy of the good!")  I have posted my results
there; perhaps that will get things moving.

Thanks!

-- 
============
Ian Pilcher arequip...@gmail.com
"If you're going to shift my paradigm ... at least buy me dinner first."


___
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/mesa-dev


Re: [Mesa-dev] KWin and Mesa

2011-04-25 Thread Ian Pilcher
On 04/25/2011 08:38 AM, Solerman Kaplon wrote:
> Em 23-04-2011 01:11, Mario Kleiner escreveu:
>> At least the visual corruption with compositing that is described in
>> these reports matches this bug very well:
>>
>> <https://bugs.freedesktop.org/show_bug.cgi?id=35452>
> 
> This affects other driver's too, right? Shouldn't the component gets
> changed in the bug report? (still re-reading the whole thread)

It definitely affects at least intel and radeon, and the fix is in the
X server, so that makes sense.

-- 
========
Ian Pilcher arequip...@gmail.com
"If you're going to shift my paradigm ... at least buy me dinner first."


___
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/mesa-dev


[Mesa-dev] NULL in dri_drawable.textures borking glDrawArrays

2011-02-26 Thread Ian Pilcher
I spent some time today looking into a KWin crash I've been experiencing
when I unlock an OpenGL screensaver.  I was able to track it down to
this bug:

https://bugs.freedesktop.org/show_bug.cgi?id=33036

Sure enough, applying the patch in that bug got rid of the crash.
Unfortunately, I'm now seeing really problematic behavior when I test an
OpenGL screen saver in the KDE "Display and Monitor" applet.

Based on the stack trace from the crash, I've tried to figure out what's
going on, but I'm unable to connect all the dots.  Here's the relevant
subset of the stack from the (now fixed) crash:

#6  radeon_r300_winsys_buffer_from_handle (rws=,
whandle=0x7fff73559780, stride=0x7fff73559698, size=0x7fff7355969c) at
radeon_r300.c:122
#7  0x7fb1e84f93d6 in r300_texture_from_handle (screen=0x10336a0,
base=0x7fff73559740, whandle=) at r300_texture.c:828
#8  0x7fb1e84e94f3 in dri2_drawable_process_buffers (count=1,
buffers=0x104f748, drawable=0x104d820) at dri2.c:252
#9  dri2_allocate_textures (drawable=0x104d820, statts=,
count=1934989244) at dri2.c:276
#10 0x7fb1e84ea187 in dri_st_framebuffer_validate (stfbi=, statts=0x12e8650, count=1, out=0x7fff73559860) at dri_drawable.c:73
#11 0x7fb1e8528617 in st_framebuffer_validate (stfb=0x12e8200,
st=0x1112c70) at state_tracker/st_manager.c:156
#12 0x7fb1e8529afe in st_manager_validate_framebuffers
(st=0x1112c70) at state_tracker/st_manager.c:859
#13 0x7fb1e865a848 in st_validate_state (st=0x1112c70) at
state_tracker/st_atom.c:143
#14 0x7fb1e85bde98 in st_draw_vbo (ctx=0x10e47c0, arrays=0x11164a0,
prims=0x7fff73559f90, nr_prims=1, ib=0x0, index_bounds_valid=, min_index=0, max_index=11) at state_tracker/st_draw.c:649
#15 0x7fb1e85babf3 in vbo_draw_arrays (ctx=0x10e47c0, mode=7,
start=0, count=, numInstances=1) at vbo/vbo_exec_array.c:588
#16 0x00397581cffe in KWin::renderGLGeometry (region=..., count=12,
vertices=0x130ffa0, texture=0x1318f40, color=0x0, dim=2, stride=0) at
/usr/src/debug/kdebase-workspace-4.6.0/kwin/lib/kwinglutils.cpp:185

What I've been able to figure out is that KWin is calling glDrawArrays,
which is really vbo_draw_arrays.  (I assume that glDrawArrays is really
a macro or inline function.)  This makes it all the way "down" to
radeon_r300_winsys_buffer_from_handle, which calls
radeon_drm_bufmgr_create_buffer_from_handle.

radeon_drm_bufmgr_create_buffer_from_handle returns NULL.  This used to
cause a crash, but the NULL is now returned by
radeon_r300_winsys_buffer_from_handle.  In turn,
r300_texture_from_handle returns NULL to
dri2_drawable_process_buffers, which puts that NULL into the textures
array of its drawable parameter.

As I said above, I'm unable to trace the impact of this NULL value back
up the call stack to vbo_draw_arrays.  Maybe it's OK for the NULL to be
there and maybe it's not.  (The fact that glDrawArrays doesn't appear to
have a way to report this sort of situation tends to make me believe
that it should be handled somehow.)

Any pointers are appreciated.

Thanks!

-- 
========
Ian Pilcher arequip...@gmail.com


___
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/mesa-dev


[Mesa-dev] Sandy Bridge heisenbug

2011-03-31 Thread Ian Pilcher
With latest batch of Fedora 15 test updates, I get GPU hangs very
shortly after logging in (with KDE desktop effects enabled).
Unfortunately, the hangs do not occur if I boot with drm.debug=14.

Any ideas on how I can gather more information?

Thanks!

-- 

Ian Pilcher arequip...@gmail.com
"If you're going to shift my paradigm ... at least buy me dinner first."


___
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/mesa-dev


Re: [Mesa-dev] Sandy Bridge heisenbug

2011-03-31 Thread Ian Pilcher
On 03/31/2011 02:53 PM, Chris Wilson wrote:
> https://bugs.freedesktop.org/show_bug.cgi?id=35820 perhaps?

I don't think so.  My system doesn't hang, and I don't get any X crash,
just messages in the syslog (see below).

I'm not expecting a quick fix.  Just hoping for some ideas on gathering
the necessary information to debug the problem.

Thanks!

Mar 30 20:47:16 ian kernel: [  233.620717] [drm:i915_hangcheck_elapsed]
*ERROR*
Hangcheck timer elapsed... GPU hung
Mar 30 20:47:16 ian kernel: [  233.620730] [drm:kick_ring] *ERROR* Kicking
stuck semaphore on blt ring
Mar 30 20:47:18 ian kernel: [  235.121821] [drm:i915_hangcheck_elapsed]
*ERROR*
Hangcheck timer elapsed... GPU hung
Mar 30 20:47:18 ian kernel: [  235.121834] [drm:kick_ring] *ERROR* Kicking
stuck semaphore on blt ring
Mar 30 20:47:19 ian kernel: [  236.622925] [drm:i915_hangcheck_elapsed]
*ERROR*
Hangcheck timer elapsed... GPU hung
Mar 30 20:47:19 ian kernel: [  236.622937] [drm:kick_ring] *ERROR* Kicking
stuck semaphore on blt ring
Mar 30 20:47:21 ian kernel: [  238.124022] [drm:i915_hangcheck_elapsed]
*ERROR*
Hangcheck timer elapsed... GPU hung
Mar 30 20:47:21 ian kernel: [  238.124034] [drm:kick_ring] *ERROR* Kicking
stuck semaphore on blt ring
Mar 30 20:47:22 ian kernel: [  239.625122] [drm:i915_hangcheck_elapsed]
*ERROR*
Hangcheck timer elapsed... GPU hung
Mar 30 20:47:22 ian kernel: [  239.625133] [drm:kick_ring] *ERROR* Kicking
stuck semaphore on blt ring
Mar 30 20:47:24 ian kernel: [  241.126214] [drm:i915_hangcheck_elapsed]
*ERROR*
Hangcheck timer elapsed... GPU hung
Mar 30 20:47:24 ian kernel: [  241.126226] [drm:kick_ring] *ERROR* Kicking
stuck semaphore on blt ring
Mar 30 20:47:25 ian kernel: [  242.627316] [drm:i915_hangcheck_elapsed]
*ERROR*
Hangcheck timer elapsed... GPU hung
Mar 30 20:47:25 ian kernel: [  242.627328] [drm:kick_ring] *ERROR* Kicking
stuck semaphore on blt ring
Mar 30 20:47:27 ian kernel: [  244.128421] [drm:i915_hangcheck_elapsed]
*ERROR*
Hangcheck timer elapsed... GPU hung
Mar 30 20:47:27 ian kernel: [  244.128433] [drm:kick_ring] *ERROR* Kicking
stuck semaphore on blt ring

(Bugzilla'ed at https://bugzilla.redhat.com/show_bug.cgi?id=692334.)

-- 
========
Ian Pilcher arequip...@gmail.com
"If you're going to shift my paradigm ... at least buy me dinner first."


___
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/mesa-dev


Re: [Mesa-dev] Sandy Bridge heisenbug

2011-03-31 Thread Ian Pilcher
On 03/31/2011 04:05 PM, Chris Wilson wrote:
> On Thu, 31 Mar 2011 15:01:53 -0500, Ian Pilcher  wrote:
>>
>> Mar 30 20:47:16 ian kernel: [  233.620717] [drm:i915_hangcheck_elapsed]
>> *ERROR*
>> Hangcheck timer elapsed... GPU hung
> 
> Ah. You will want
> 
> kernel commit 91355834646328e7edc6bd25176ae44bcd7386c7

Unfortunately, that patch appears to already be in the latest Fedora
kernel, kernel-2.6.38.2-9.fc15.x86_64, and I still get the hangs with
the latest Mesa packages.  (No hangs with 7.10, though.)

-- 
============
Ian Pilcher arequip...@gmail.com
"If you're going to shift my paradigm ... at least buy me dinner first."


___
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/mesa-dev


Re: [Mesa-dev] Sandy Bridge heisenbug

2011-03-31 Thread Ian Pilcher
On 03/31/2011 05:20 PM, Chris Wilson wrote:
> However, if you are reasonably confident that a change in mesa provokes
> the issue, you can bisect the commit in question. That may give another
> clue as to where to look next.

I'm willing to take a shot.  Got a link to a good HOWTO?  (My experience
with git is very limited.)

Also, with the latest Fedora kernel I now see the problem even with
drm.debug=14.  dmesg is at
https://bugzilla.redhat.com/show_bug.cgi?id=692334.

Thanks!

-- 
============
Ian Pilcher arequip...@gmail.com
"If you're going to shift my paradigm ... at least buy me dinner first."


___
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/mesa-dev


Re: [Mesa-dev] Sandy Bridge heisenbug

2011-04-01 Thread Ian Pilcher
On 04/01/2011 02:09 AM, Dave Airlie wrote:
> 
> try the latest mesa in koji,
> 
> I' ve pushed some reverts in,
> 
> https://bugs.freedesktop.org/show_bug.cgi?id=35853
> 
> is upstream bug.
> 

That appears to have made things worse.

With mesa-dri-drivers-7.11-0.3.20110330.0.fc15.x86_64 I get:

Apr  1 11:38:18 ian kernel: [   77.890422] [drm:i915_hangcheck_elapsed]
*ERROR* Hangcheck timer elapsed... GPU hung
Apr  1 11:38:18 ian kernel: [   77.892175] [drm:i915_do_wait_request]
*ERROR* i915_do_wait_request returns -11 (awaiting 4284 at 4281, next 4285)
Apr  1 11:38:25 ian kernel: [   84.254124] [drm:i915_hangcheck_elapsed]
*ERROR* Hangcheck timer elapsed... GPU hung
Apr  1 11:38:25 ian kernel: [   84.254161] [drm:i915_do_wait_request]
*ERROR* i915_do_wait_request returns -11 (awaiting 4529 at 4525, next 4530)
Apr  1 11:38:31 ian kernel: [   90.585892] [drm:i915_hangcheck_elapsed]
*ERROR* Hangcheck timer elapsed... GPU hung
Apr  1 11:38:31 ian kernel: [   90.585925] [drm:i915_do_wait_request]
*ERROR* i915_do_wait_request returns -11 (awaiting 4664 at 4661, next 4666)

And then KWin disables desktop effects.

With mesa-dri-drivers-7.11-0.5.20110401.0.fc15.x86_64 I get fewer error
messages:

Apr  1 11:33:52 ian kernel: [  272.178080] [drm:i915_hangcheck_elapsed]
*ERROR* Hangcheck timer elapsed... GPU hung
Apr  1 11:33:52 ian kernel: [  272.179796] [drm:i915_do_wait_request]
*ERROR* i915_do_wait_request returns -11 (awaiting 10986 at 10979, next
11027)
Apr  1 11:33:59 ian kernel: [  278.725439] [drm:i915_hangcheck_elapsed]
*ERROR* Hangcheck timer elapsed... GPU hung
Apr  1 11:33:59 ian kernel: [  278.725481] [drm:i915_do_wait_request]
*ERROR* i915_do_wait_request returns -11 (awaiting 11895 at 11862, next
11896)

But KWin never recovers.  I can see gkrellm chugging along, but I'm
unable do actually do anything in X.

(Note, however, that the system is still OK.  I can ssh in, downgrade
the mesa packages, restart X, and be up and running.)

Looks like I will be bisecting ...

-- 
========
Ian Pilcher arequip...@gmail.com
"If you're going to shift my paradigm ... at least buy me dinner first."


___
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/mesa-dev


Re: [Mesa-dev] Sandy Bridge heisenbug

2011-04-01 Thread Ian Pilcher
Well, I've had to give up on bisecting.  It turns out that I really
don't have a clear definition of working/broken that can be reliably
tested.

With some snapshots, I get problems as soon as I log in.  With others,
a few clicks does it.  In other cases, the GPU only hangs after I've
done a bunch of "desktop cube" switches.  Etc., etc.

Even with 7.10, which I had believed to be solid, I can easily produce
a GPU hang by moving a glxgears window.

:-(

-- 
========
Ian Pilcher arequip...@gmail.com
"If you're going to shift my paradigm ... at least buy me dinner first."


___
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/mesa-dev


[Mesa-dev] GEM object creation ioctl?

2011-04-18 Thread Ian Pilcher
I am continuing to dig into my GLMatrix-induced X crash.  It appears to
be a GEM object lifecycle problem (DRM_IOCTL_GEM_OPEN returning
ENOENT).  I've added a bunch of logging to drmIoctl to try figure out
what is going on, but I haven't been able to figure out how objects are
created.

http://lwn.net/Articles/283798/ mentions a DRM_IOCTL_GEM_CREATE ioctl,
but it doesn't appear to actually exist on my system.  Because of this,
I'm currently logging DRM_IOCTL_GEM_{CLOSE,FLINK,OPEN}, but I'm unable
to actually see objects being created.

What should I be looking for?

Thanks!

-- 
========
Ian Pilcher arequip...@gmail.com
"If you're going to shift my paradigm ... at least buy me dinner first."


___
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/mesa-dev


Re: [Mesa-dev] GEM object creation ioctl?

2011-04-18 Thread Ian Pilcher
On 04/18/2011 02:09 PM, Ian Pilcher wrote:
> http://lwn.net/Articles/283798/ mentions a DRM_IOCTL_GEM_CREATE ioctl,
> but it doesn't appear to actually exist on my system.  Because of this,
> I'm currently logging DRM_IOCTL_GEM_{CLOSE,FLINK,OPEN}, but I'm unable
> to actually see objects being created.

Amazing how writing a question concentrates the mind.  It looks like
DRM_IOCTL_I915_GEM_CREATE is what I'm looking for.  Are there any other
ioctls that I should be logging?

Thanks!

-- 
========
Ian Pilcher arequip...@gmail.com
"If you're going to shift my paradigm ... at least buy me dinner first."


___
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/mesa-dev


[Mesa-dev] How report error to application?

2011-04-18 Thread Ian Pilcher
Following up on my earlier GEM object lifecycle question, it looks my
problem is probably a combination of a race condition in KDE and the
fact that Mesa is currently swallowing memory allocation and GEM ioctl
errors (creating region-less intel_renderbuffers which ultimately cause
segfaults).

The obvious next step is to patch Mesa to report the error back to the
application.  How is this done?

Thanks!

-- 

Ian Pilcher arequip...@gmail.com
"If you're going to shift my paradigm ... at least buy me dinner first."


___
mesa-dev mailing list
mesa-dev@lists.freedesktop.org
http://lists.freedesktop.org/mailman/listinfo/mesa-dev