Skip to content

Commit 98b68d8

Browse files
authored
Add Speech-to-Text (stt) to elevenlabs (#40063)
1 parent 8e3d0bb commit 98b68d8

File tree

1 file changed

+27
-2
lines changed

1 file changed

+27
-2
lines changed

source/_integrations/elevenlabs.markdown

Lines changed: 27 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -7,29 +7,43 @@ ha_release: 2024.8
77
ha_iot_class: Cloud Polling
88
ha_domain: elevenlabs
99
ha_platforms:
10+
- stt
1011
- tts
1112
ha_config_flow: true
1213
ha_integration_type: service
1314
ha_codeowners:
1415
- '@sorgfresser'
1516
---
1617

17-
The **ElevenLabs** text-to-speech {% term integrations %} adds support for [ElevenLabs](https://elevenlabs.io/) as text-to-speech to read a text with natural sounding voices.
18+
The **ElevenLabs** {% term integrations %} adds support for [ElevenLabs](https://elevenlabs.io/) as text-to-speech to read a text with natural sounding voices and speech-to-text to convert speech into text.
1819

1920
## Prerequisites
2021

2122
- You need an ElevenLabs account to use this integration. A free account suffices for basic usage.
2223
- For custom voices or more quota you need a subscription.
2324
- You need your API key from the ElevenLabs website.
25+
- Your API key requires the following permissions:
26+
- Text to Speech
27+
- Speech to Text
28+
- Voices (Read only)
29+
- Models (Read only)
2430

2531
{% include integrations/config_flow.md %}
2632

33+
### Text-to-speech
34+
2735
For an overview of which languages can be used, check the [complete list of supported languages](https://elevenlabs.io/languages).
2836

2937
For a description of which voices are available for use, check your VoiceLab voices.
3038

3139
To see the available models and their benefits, check the [models documentation](https://elevenlabs.io/docs/speech-synthesis/models).
3240

41+
### Speech-to-text
42+
43+
An overview of supported languages is available at [complete list of supported languages](https://elevenlabs.io/docs/capabilities/speech-to-text#supported-languages).
44+
45+
List of models and their capabilities can be determined at [models documentation](https://elevenlabs.io/docs/capabilities/speech-to-text#models).
46+
3347
## Action speak
3448

3549
The `tts.speak` action is the modern way to use TTS. Add the `speak` action, select the entity for your ElevenLabs TTS (it's named ElevenLabs by default), select the media player entity or group to send the TTS audio to, and enter the message to speak.
@@ -61,14 +75,25 @@ voice:
6175
required: false
6276
type: string
6377
model:
64-
description: "Model ID for the model to use. Will override the default model of the entity!"
78+
description: "Model ID for the text-to-speech model to use. Will override the default model of the entity!"
6579
required: false
6680
type: string
81+
Speech-to-text model:
82+
description: "Model ID for the speech-to-text model to use. Will override the default model of the entity!"
83+
required: false
84+
type: string
85+
Auto-detect language:
86+
description: "Should speech-to-text auto-detect the language spoken, overrides the language selected in voice assistant!"
87+
required: false
88+
type: boolean
89+
default: false
6790

6891
{% endconfiguration %}
6992

7093
For more information about using text-to-speech with Home Assistant and more details on all the options it provides, see the [TTS documentation](/integrations/tts/).
7194

95+
For more information about using speech-to-text with Home Assistant and more details on all the options it provides, see the [STT documentation](/integrations/stt/).
96+
7297
## Removing the integration
7398

7499
This integration follows standard integration removal. No extra steps are required.

0 commit comments

Comments
 (0)