YouTube Video Transcriber

A Python script that automatically downloads YouTube videos and generates text transcriptions using OpenAI's Whisper model.

Features

Downloads audio from YouTube videos using yt-dlp
Transcribes audio using OpenAI's Whisper model
Saves transcriptions as formatted text files
Handles temporary file management
Supports various audio formats through FFmpeg
Processes transcriptions into readable sentences

Prerequisites

Before running this script, make sure you have the following installed:

pip install -r requirements.txt

Required packages:

yt-dlp
whisper
librosa
ffmpeg (system dependency)

Installation

Clone this repository:

git clone https://github.com/Khalid1G/Whisper-YouTube-Transcription.git
cd Whisper-YouTube-Transcription

Install the required packages:

pip install -r requirements.txt

Install FFmpeg (if not already installed):

On Ubuntu/Debian: sudo apt-get install ffmpeg
On macOS: brew install ffmpeg
On Windows: Download from FFmpeg official website

Usage

Run the script:

python transcriber.py

Enter the YouTube video ID when prompted. The video ID is the part after v= in the YouTube URL. For example, for https://www.youtube.com/watch?v=dQw4w9WgXcQ, the ID is dQw4w9WgXcQ
The script will:
- Download the audio from the YouTube video
- Process the audio using Whisper
- Save the transcription to Youtube/[video_id]_transcription.txt

Output

The transcription will be saved in the Youtube directory with the filename pattern: [video_id]_transcription.txt

The output text is formatted with one sentence per line for better readability.

Project Structure

.
├── transcriber.py          # Main script
├── yt_transcription.ipynb  # Notebook script
├── requirements.txt        # Python dependencies
├── Youtube/                # Directory for transcription outputs
└── README.md               # This file

Performance

Transcription time varies based on:
- Video length
- Computer specifications
- Chosen Whisper model ("base" in this implementation)

Limitations

Requires stable internet connection for YouTube downloads
Processing time depends on video length and hardware capabilities
Accuracy depends on audio quality and Whisper model used

Contributing

Feel free to open issues or submit pull requests with improvements.

License

MIT License

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

YouTube Video Transcriber

Features

Prerequisites

Installation

Usage

Output

Project Structure

Performance

Limitations

Contributing

License

Acknowledgments

About

Releases 1

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
Youtube		Youtube
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt
transcriber.py		transcriber.py
yt_transcription.ipynb		yt_transcription.ipynb

License

Khalid1G/Whisper-YouTube-Transcription

Folders and files

Latest commit

History

Repository files navigation

YouTube Video Transcriber

Features

Prerequisites

Installation

Usage

Output

Project Structure

Performance

Limitations

Contributing

License

Acknowledgments

About

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Languages

Packages