Re: [FFmpeg-devel] Development of a CUDA accelerated variant of the libav vf_tonemap

Felix LeClair Tue, 12 Jan 2021 13:13:27 -0800

That's great! Any way for me to pull that branch or otherwisecontribute?

Have been using FFmpeg for a few years now, so hopping to be able togive back.


On Tue, Jan 12, 2021 at 5:55 am, Lynne <d...@lynne.ee> wrote:

Jan 11, 2021, 23:27 by felix.leclair...@hotmail.com<mailto:felix.leclair...@hotmail.com>:
Hi guys and gals, first post on this mailing list, apologies forany formatting/stylistic snafus
TLDR; we currently have tone mapping filters (typically used to mapcontent from a 10bit HDR source to an 8bit SDR output) that are doneon CPU with Zscale from Zlib, or hardware implementations usingVAAPI or OpenCL. Having a version implemented in CUDA would roundout the main HWaccels types.
 Context:
I'm a computer engineering student up in Canada with an interestin high efficiency distributed processing. As a personal project I'mtrying to build a cluster of Nvidia Jetson Nano's to be able tohandle a few dozen streams (mix of SD, HD, FHD, UHD, 4kHDR) at oncewhile drawing south of 100W at peak. These little devices can doanywhere from 1 to 9 streams of content at a time depending onresolution/framerate in hardware in any mix of HEVC or H.264, so 3of them should get me most of the way to where I want to go (thiswould be a 30W package capable of ~12 2160p30@10 bit -> 1080p30 8bitstreams).
The issue is that, 4 little arm64 cores are just not going to beable to tonemap using Zscale in real time, even with the encoder anddecoders sharing memory with the CPU (so no PCIe memcopy penalty).On the other hand, the built in GPU and the relative simplicity ofmost tone mapping algorithms (say hable) should make quick work ofthis. Unfortunately (or fortunately for me to learn with?) thereisn't a CUDA version of the filter.
 Question/guidance:
I've read through the doc on how to write filters, as well aslooking at the other cuda filters currently in the source and have ageneral idea of where I'm going, but haven't been able to fully naildown how to access frames from hwupload_cuda passed tovf_tonemap_cuda.c which in turn passes that frame tovf_tonemap_cuda.cu for processing. I have a repo with everythingI've been pulling together for my project, but the piece of interestis under */cuda_filter/ in the source tree.<<https://github.com/Camofelix/Jetson_ffmpeg_trancode_cluster/>>
 Would anyone mind helping me out with how to architect this?
The tonemap filter is just a (very old by now) copy of libplacebo'stonemapping.
No one has bothered to keep it in sync.
I'm working on a libplacebo wrapper currently, so once that's mergedthere
will be up to date hardware tonemapping.
_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org <mailto:ffmpeg-devel@ffmpeg.org>
<https://ffmpeg.org/mailman/listinfo/ffmpeg-devel>

To unsubscribe, visit link above, or email
ffmpeg-devel-requ...@ffmpeg.org<mailto:ffmpeg-devel-requ...@ffmpeg.org> with subject "unsubscribe".


_______________________________________________
ffmpeg-devel mailing list
ffmpeg-devel@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-devel

To unsubscribe, visit link above, or email
ffmpeg-devel-requ...@ffmpeg.org with subject "unsubscribe".

Re: [FFmpeg-devel] Development of a CUDA accelerated variant of the libav vf_tonemap

Reply via email to