All Guides
OpenAI Whisper Setup (Speech-to-Text)
Install the OpenAI Whisper model and convert audio files to text.
Intermediate25 min
Setup Steps
1. Install ffmpeg (required for audio processing):
sudo apt install ffmpeg2. Install Whisper via pip:
pip install openai-whisper3. Command line usage:
whisper audio_file.mp3 --language English --model medium4. Available models (smallest to largest): tiny, base, small, medium, large-v3
5. Python usage:
python
import whisper
model = whisper.load_model("medium")
result = model.transcribe("audio_file.mp3", language="en")
print(result["text"])6. Subtitle format output:
whisper audio.mp3 --language en --output_format srt7. GPU accelerated usage:
python
model = whisper.load_model("large-v3", device="cuda")8. Faster Whisper alternative (faster):
pip install faster-whisperpython
from faster_whisper import WhisperModel
model = WhisperModel("large-v3", device="cuda")
segments, info = model.transcribe("audio.mp3", language="en")
for segment in segments:
print(f"[{segment.start:.2f}s -> {segment.end:.2f}s] {segment.text}")Related Guides
Claude Code Setup
Install Anthropic Claude Code CLI on your server or PC. API key configuration and basic commands.
ChatGPT API Setup
Integrate OpenAI ChatGPT API into your project. Getting API keys and sending your first request.
Google Gemini API Setup
Set up Google Gemini (formerly Bard) API and use it in your projects.
Midjourney Usage Guide
Create AI-powered images with Midjourney. Step-by-step guide for Discord usage.