ElevenLabs Provider for Dify
A Dify plugin for integrating ElevenLabs Text-to-Speech and Speech-to-Text services.
Features
- Text-to-Speech (TTS): Convert text to high-quality, natural-sounding speech using ElevenLabs' advanced voice synthesis technology.
- Speech-to-Text (STT): Transcribe audio to text with accurate speech recognition.
Setup
- Install the plugin in your Dify instance
- Configure the plugin with your ElevenLabs API key
- Use the ElevenLabs models in your Dify applications
Requirements
Models
Text-to-Speech
- Model:
- Voices: Aria, Roger, Sarah, Laura, Charlie, George, Callum, River, Liam, Charlotte, Alice, Matilda, Will, Jessica, Eric, Chris, Brian, Daniel, Lily, Bill, Koby
- Default Voice: Sarah
Speech-to-Text
- Model:
- Mode: transcription
- Supported File Extensions: mp3, mp4, mpeg, mpga, m4a, wav, webm
- File Upload Limit: 25MB
License
This plugin is provided under the MIT license.