You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
A useful notebook (or addition to the existing whisper transcription notebook) would be one that enabled users to run whisper over audio for transcription, and then access produce word- or sentence-level timestamps (not part of whisper's functionality).
That's very cool @belisards - thank you! word level timestamps probably isn't important for most research, so those sentence-level ones look really good!
Another feature that might be useful for open-source research is speech diarization. There is is a great video covering many Whisper variants and features like this: https://www.youtube.com/watch?v=Thc0vtnWYOo
A useful notebook (or addition to the existing whisper transcription notebook) would be one that enabled users to run whisper over audio for transcription, and then access produce word- or sentence-level timestamps (not part of whisper's functionality).
A package like https://github.com/linto-ai/whisper-timestamped might make this easy.
The text was updated successfully, but these errors were encountered: