Voice interface for AI coding assistants. Talk to your CLI agent and hear it respond, powered by Azure Speech Services.
Record from microphone and transcribe to text via Azure STT or local Whisper.
Convert text to natural speech using Azure HD voices with streaming playback.
Speak text then immediately listen for a reply — full-duplex TTS + STT in one call.
Continuous voice loop — listen, let the AI respond, listen again.
Multiple voices in one call — parallel TTS requests, sequential playback.
View or change runtime settings: voice, quality, timeouts, chimes, and more.