Skip to content

Voice Chat

Voice chat lets you speak to Snippbot agents and hear their responses using speech-to-text (STT) input and text-to-speech (TTS) output.

  • A microphone and speaker (or headphones)
  • A modern browser (Chrome or Firefox recommended) for microphone access
  • For cloud providers: an API key for your chosen TTS/STT provider (OpenAI, ElevenLabs, or Hume AI). Local and Browser providers work without API keys.
ProviderKey settingNotes
Local (piper-tts)(none)Offline local TTS, no API key needed
OpenAI TTSopenai_api_keyHigh quality cloud TTS
ElevenLabselevenlabs_api_keyHighest quality, many voices
Hume AIhume_api_keyEmotionally expressive voice
Browser (Web Speech)(none)Built-in browser TTS, no API key needed
ProviderNotes
Local (faster-whisper)Offline local STT, no API key needed
OpenAI WhisperHigher accuracy, requires OpenAI API key
ElevenLabs ScribeHigh-accuracy transcription via ElevenLabs
  1. Create an account at elevenlabs.io and get an API key

  2. Add the key in Snippbot: Settings → Integrations → ElevenLabs API Key

  3. Enable in chat — click the TTS icon (speaker) in the chat toolbar and select ElevenLabs

  4. Choose a voice from the voice selector dropdown

  1. Create an account at hume.ai and get an API key

  2. Add the key in Snippbot: Settings → Integrations → Hume API Key

  3. Enable in chat — click the TTS icon in the chat toolbar and select Hume AI

No API key needed. In the Chat UI: TTS icon → select Browser.

Browser TTS quality varies by OS and browser. System voices are used.

  1. Open a chat with any agent

  2. Click the microphone icon in the message input area — the browser will request microphone access

  3. Speak your message — recording is active while the icon is animated

  4. Click the microphone again (or press Space) to stop recording

  5. The speech is transcribed and sent as a chat message

  6. When TTS is enabled, the agent’s response is read aloud automatically

ShortcutAction
Space (when focused)Start/stop recording
EscapeCancel recording
SToggle TTS on/off

Enable Continuous mode to keep the microphone active after each agent response:

  1. Open voice settings — in the Chat UI toolbar, click Settings → Voice → Continuous mode: On

  2. Start talking — after each response, recording automatically restarts

  3. Exit — press Escape to leave continuous mode

While recording, you can use voice commands:

Voice commandAction
”Stop recording”Sends the message
”Cancel”Discards the recording
”New conversation”Starts a fresh chat session

Configure TTS behavior in Settings → Voice:

SettingOptionsDescription
ProviderLocal (piper-tts) / OpenAI TTS / ElevenLabs / Hume AI / Browser (Web Speech)TTS engine
Voice(provider-specific)Voice selection
Speed0.5 – 2.0Playback speed
Auto-playOn / OffAuto-speak agent responses
Skip code blocksOn (default)Don’t read code aloud
SettingOptionsDescription
ProviderLocal (faster-whisper) / OpenAI Whisper / ElevenLabs ScribeSTT engine
Language(locale)Recognition language
Auto-sendOn / OffSend immediately after recording stops
  • Microphone access is browser-gated — you grant permission per site
  • Audio is processed locally (browser STT) or sent to the selected provider (ElevenLabs, Hume, Whisper)
  • Snippbot does not record or store your voice
  • Review the privacy policy of your chosen TTS/STT provider
  1. Check browser microphone permission: chrome://settings/content/microphone
  2. Ensure no other app has exclusive microphone access
  3. Try refreshing the page and granting permission again
  1. Check the selected voice in Settings → Voice
  2. Reduce playback speed if the voice is too fast
  3. Verify the API key is valid (test at the provider’s website)
  1. Speak clearly and close to the microphone
  2. Switch to Whisper for better accuracy (requires OpenAI key)
  3. Set the correct language in STT settings