Table of Contents

Using Whisper to Transcribe Podcasts

Using Whisper to Transcribe Podcasts

swyx 2023-07-01

Prerequisites

these are onetime setup things

brew install ffmpeg # takes a while!
git clone https://github.com/ggerganov/whisper.cpp
cd whisper.cpp
make
cd models
./download-ggml-model.sh base.en or ./download-ggml-model.sh medium.en # see full model list here https://github.com/ggerganov/whisper.cpp/tree/master/models

Steps

convert your audio file to a 16khz .wav file: ffmpeg -i SOURCE_FILE.wav -ar 16000 output.wav
THEN you can do ./main -m models/ggml-medium.en.bin -f output.wav >> output.txt inside of the whisper.cpp directory, which pipes the transcription into output.txt

at a rate of about 3 minutes of input: 2 minutes to transcribe (for the medium - 769M param model)
or at a rate of about 12 minutes of input: 1 minute to transcribe (for the base - 74M param model)

Leave a reaction if you liked this post! 🧡

Loading comments...

Webmentions

loading counts

Latest Posts

📓 Becoming a High Taste Tester 2025-07-25
🎧 AIE on MonkCast 2025-07-23
📓 Gemini Nano in Chrome 137: notes for AI Engineers 2025-07-08
📓 notes from Naval (2025) 2025-07-06
📓 Elixir/Phoenix Liveview was a mistake 2025-05-23
📓 2025 Advice to my old selves 2025-05-15
🎧 ThisDot Leadership 2025-04-25
📓 Supa Pecha Kucha 2025-03-11
📓 How to download YouTube Videos quickly 2025-01-25
🎧 WHCR Community & Technology 2025-01-21

Search and see all content