Hi,

yesterday, news made the round, that ffmpeg 8 is going to be released, soon, and it will contain whisper, an AI software that can understand spoken text and create subtitles.

Their github page https://github.com/ggml-org/whisper.cpp says they offer a handful of models.

Model   Disk    Mem
tiny    75 MiB  ~273 MB
base    142 MiB         ~388 MB
small   466 MiB         ~852 MB
medium  1.5 GiB         ~2.1 GB
large   2.9 GiB         ~3.9 GB

How does this work? Will all of this be compiled into the ffmpeg binary?
_______________________________________________
ffmpeg-user mailing list
ffmpeg-user@ffmpeg.org
https://ffmpeg.org/mailman/listinfo/ffmpeg-user

To unsubscribe, visit link above, or email
ffmpeg-user-requ...@ffmpeg.org with subject "unsubscribe".

Reply via email to