Add Speech-to-Text (stt) to elevenlabs (#40063)

ehendrix23 · web-flow · commit 98b68d8fdb2c · 2025-10-10T17:01:33.000+02:00
diff --git a/source/_integrations/elevenlabs.markdown b/source/_integrations/elevenlabs.markdown
@@ -7,29 +7,43 @@ ha_release: 2024.8
 ha_iot_class: Cloud Polling
 ha_domain: elevenlabs
 ha_platforms:
+  - stt
   - tts
 ha_config_flow: true
 ha_integration_type: service
 ha_codeowners:
   - '@sorgfresser'
 ---
 
-The **ElevenLabs** text-to-speech {% term integrations %} adds support for [ElevenLabs](https://elevenlabs.io/) as text-to-speech to read a text with natural sounding voices.
+The **ElevenLabs** {% term integrations %} adds support for [ElevenLabs](https://elevenlabs.io/) as text-to-speech to read a text with natural sounding voices and speech-to-text to convert speech into text.
 
 ## Prerequisites
 
 - You need an ElevenLabs account to use this integration. A free account suffices for basic usage.
 - For custom voices or more quota you need a subscription.
 - You need your API key from the ElevenLabs website.
+- Your API key requires the following permissions:
+  - Text to Speech
+  - Speech to Text
+  - Voices (Read only)
+  - Models (Read only)
 
 {% include integrations/config_flow.md %}
 
+### Text-to-speech
+
 For an overview of which languages can be used, check the [complete list of supported languages](https://elevenlabs.io/languages).
 
 For a description of which voices are available for use, check your VoiceLab voices.
 
 To see the available models and their benefits, check the [models documentation](https://elevenlabs.io/docs/speech-synthesis/models).
 
+### Speech-to-text
+
+An overview of supported languages is available at [complete list of supported languages](https://elevenlabs.io/docs/capabilities/speech-to-text#supported-languages).
+
+List of models and their capabilities can be determined at [models documentation](https://elevenlabs.io/docs/capabilities/speech-to-text#models).
+
 ## Action speak
 
 The `tts.speak` action is the modern way to use TTS. Add the `speak` action, select the entity for your ElevenLabs TTS (it's named ElevenLabs by default), select the media player entity or group to send the TTS audio to, and enter the message to speak.
@@ -61,14 +75,25 @@ voice:
   required: false
   type: string
 model:
-  description: "Model ID for the model to use. Will override the default model of the entity!"
+  description: "Model ID for the text-to-speech model to use. Will override the default model of the entity!"
   required: false
   type: string
+Speech-to-text model:
+  description: "Model ID for the speech-to-text model to use. Will override the default model of the entity!"
+  required: false
+  type: string
+Auto-detect language:
+  description: "Should speech-to-text auto-detect the language spoken, overrides the language selected in voice assistant!"
+  required: false
+  type: boolean
+  default: false
 
 {% endconfiguration %}
 
 For more information about using text-to-speech with Home Assistant and more details on all the options it provides, see the [TTS documentation](/integrations/tts/).
 
+For more information about using speech-to-text with Home Assistant and more details on all the options it provides, see the [STT documentation](/integrations/stt/).
+
 ## Removing the integration
 
 This integration follows standard integration removal. No extra steps are required.