Understood -- I thought you might not want to take this patch, but I
went ahead and sent it out because Christian requested it, and it
seems like he doesn't think VRAM bos should ever evict back to VRAM at
all?
No, I've requested reverting the patch for now because it causes an
obviously and rather severe problem. If you guys can quickly find how to
fix it feel free to use that instead.
Is my understanding of the original commit correct in that it tries to
rewrite the eviction placements of CPU accessible bos so that they are
either size zero (fpfn and lpfn = start of inaccessible VRAM) or they
are in inaccessible VRAM (fpfn = start of inaccessible VRAM and lpfn = 0)?
That for example could work as well, but see below.
if these sorts of evictions are desirable, would it make more sense to
treat CPU inaccessible/accessible VRAM as distinct entities with their
own lrus?
Actually I'm pretty sure that it isn't desirable. See the evict function
doesn't know if we try to evict BOs because we need CPU accessible VRAM
or if we just run out of VRAM.
This code only makes sense when we need to move different BOs into the
CPU accessible part round robin because they are accessed by the CPU,
but then it is actually better to move them to GTT sooner or later.
Regards,
Christian.
Am 23.03.2017 um 16:31 schrieb Zachary Michaels:
Was userspace maybe performing concurrent CPU access to the BOs in
question?
As far as I know Julien has demonstrated that this is not the case.
I hope we can find a better solution.
Understood -- I thought you might not want to take this patch, but I
went ahead and sent it out because Christian requested it, and it
seems like he doesn't think VRAM bos should ever evict back to VRAM at
all?
Is my understanding of the original commit correct in that it tries to
rewrite the eviction placements of CPU accessible bos so that they are
either size zero (fpfn and lpfn = start of inaccessible VRAM) or they
are in inaccessible VRAM (fpfn = start of inaccessible VRAM and lpfn = 0)?
In this case, to me it seems that the simplest fix would be to iterate
using i to rewrite all the VRAM placements instead of just the first
one (rbo->placements[i] instead of rbo->placements[0]). In the case
where RADEON_GEM_NO_CPU_ACCESS is set, the second placement will be in
CPU accessible VRAM, and that doesn't seem correct to me as there is
no longer any sort of ordering for evictions. (Unfortunately I'm not
currently in a position to test whether this fixes our issue.) Sorry,
I meant to make a note of this originally.
Also, I don't claim to understand this code well enough, but I wonder:
if these sorts of evictions are desirable, would it make more sense to
treat CPU inaccessible/accessible VRAM as distinct entities with their
own lrus?
I should also note that we are experiencing another issue where the
kernel locks up in similar circumstances. As Julien noted, we get no
output, and the watchdogs don't seem to work. It may be the case that
Xorg and our process are calling ttm_bo_mem_force_space concurrently,
but I don't think we have enough information yet to say for
sure. Reverting this commit does not fix that issue. I have some small
amount of evidence indicating that bos flagged for CPU access are
getting placed in CPU inaccessible memory. Could that cause this sort
of kernel lockup?
Thanks for your help.
_______________________________________________
amd-gfx mailing list
amd-...@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/amd-gfx
_______________________________________________
dri-devel mailing list
dri-devel@lists.freedesktop.org
https://lists.freedesktop.org/mailman/listinfo/dri-devel