https://bugs.kde.org/show_bug.cgi?id=467172

            Bug ID: 467172
           Summary: Add ability to use Whisper models for speech
                    recognition
    Classification: Applications
           Product: kdenlive
           Version: 22.12.3
          Platform: Archlinux
                OS: Linux
            Status: REPORTED
          Severity: wishlist
          Priority: NOR
         Component: Effects & Transitions
          Assignee: j...@kdenlive.org
          Reporter: calibre...@gmail.com
  Target Milestone: ---

The VOSK API compatible models for the speech recognition subtitles are not
bad, but the Whisper models are better and allow for automatic detection of
punctuation - VOSK not detecting these creates unreadable sentences where you
do not know where they began and end.

A temporary workaround is to use the recasepunc model separately, but it puts a
full stop at the end of every line of text, rather than where the sentence
actually ends 4 subtitles later, requiring every subtitle to be edited to
remove the excess full stops.

I hope to see Whisper support soon.

-- 
You are receiving this mail because:
You are watching all bug changes.

Reply via email to