https://bugs.kde.org/show_bug.cgi?id=467172
Bug ID: 467172 Summary: Add ability to use Whisper models for speech recognition Classification: Applications Product: kdenlive Version: 22.12.3 Platform: Archlinux OS: Linux Status: REPORTED Severity: wishlist Priority: NOR Component: Effects & Transitions Assignee: j...@kdenlive.org Reporter: calibre...@gmail.com Target Milestone: --- The VOSK API compatible models for the speech recognition subtitles are not bad, but the Whisper models are better and allow for automatic detection of punctuation - VOSK not detecting these creates unreadable sentences where you do not know where they began and end. A temporary workaround is to use the recasepunc model separately, but it puts a full stop at the end of every line of text, rather than where the sentence actually ends 4 subtitles later, requiring every subtitle to be edited to remove the excess full stops. I hope to see Whisper support soon. -- You are receiving this mail because: You are watching all bug changes.