Hi,

Please help review this patch to reduce stack frame size per GPU thread. The 
default allocation size per thread (1024 bytes) is excessive and can be reduced 
to 128 bytes based on nvidia cuda kernel compilation statistics. This should 
help with reducing video memory usage per cuda context.

Thanks,
Ben

-----------------------------------------------------------------------------------
This email message is for the sole use of the intended recipient(s) and may 
contain
confidential information.  Any unauthorized review, use, disclosure or 
distribution
is prohibited.  If you are not the intended recipient, please contact the 
sender by
reply email and destroy all copies of the original message.
-----------------------------------------------------------------------------------

Attachment: 0001-Reduce-cuda-context-s-stack-frame-size-limit-through.patch
Description: 0001-Reduce-cuda-context-s-stack-frame-size-limit-through.patch

_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
http://ffmpeg.org/mailman/listinfo/ffmpeg-devel

Reply via email to