Voice Transcription
Speech to Text is experimental. Expect issues and changes as it matures.
Use voice input in prompt fields instead of typing. Transcription uses your Kilo account through Kilo Gateway.
Get ready
Voice input needs FFmpeg plus access to the Kilo provider.
Install FFmpeg
FFmpeg is required for audio capture and processing. Install it for your platform:
macOS:
brew install ffmpeg
Linux (Ubuntu/Debian):
sudo apt update sudo apt install ffmpeg
Windows: Download from ffmpeg.org/download.html and add to your system PATH.
Sign in
Enable and sign in to the Kilo provider to use voice input in prompt fields. Requests use your Kilo account through Kilo Gateway, so no separate OpenAI provider profile or API key is needed.
Enable input
Voice input is experimental and must be enabled:
- Open Kilo Code settings
- Open Experimental settings
- Enable the Speech to Text experiment
Kilo stores this toggle in your global Kilo CLI config (~/.config/kilo/kilo.jsonc), not VS Code user settings:
{
"experimental": {
"speech_to_text": true
}
}
Record prompts
Once enabled, a microphone button appears in prompt fields:
- Click the microphone button to start recording
- Speak your message clearly
- Click again to stop recording
- Your speech is transcribed into text
The feature includes real-time audio level visualization and voice activity detection to automatically detect when you're speaking.
Review details
- Audio processing: Uses FFmpeg for system audio capture
- Transcription: Sends audio through Kilo Gateway with the selected transcription model
Fix issues
Microphone button not appearing:
- Ensure the Speech to Text experiment is enabled
- Verify FFmpeg is installed and in your PATH
- Enable and sign in to the Kilo provider
Transcription errors:
- Confirm the Kilo provider remains enabled and signed in
- Check your internet connection
- Try speaking more clearly or adjusting your microphone settings
Know limits
Speech to Text is experimental and may have limitations:
- Requires an active internet connection
- Requires Kilo Gateway access through your Kilo account
- Transcription accuracy depends on audio quality and speech clarity