Skip to content

ElevenLabs

ElevenLabs is an AI-powered voice synthesis company specializing in ultra-realistic text-to-speech and voice cloning technology.

Categories: artificial-intelligence

Type: elevenLabs/v1


Connections

Version: 1

API Key

Properties

NameLabelTypeDescriptionRequired
keyKeySTRINGtrue
valueAPI KeySTRINGtrue

Actions

Create Sound Effect

Name: createSoundEffect

Turn text into sound effects for your videos, voice-overs or video games using the most advanced sound effects model in the world.

Properties

NameLabelTypeDescriptionRequired
textTextSTRINGThe text that will get converted into a sound effect.true

Example JSON Structure

{
"label" : "Create Sound Effect",
"name" : "createSoundEffect",
"parameters" : {
"text" : ""
},
"type" : "elevenLabs/v1/createSoundEffect"
}

Output

The output for this action is dynamic and may vary depending on the input parameters. To determine the exact structure of the output, you need to execute the action.

Create Speech

Name: createSpeech

Converts text into speech using a voice of your choice and returns audio.

Properties

NameLabelTypeDescriptionRequired
voiceIdVoiceSTRINGVoice you want to use for converting the text into speech.true
textTextSTRINGText you want to convert into speech.true

Example JSON Structure

{
"label" : "Create Speech",
"name" : "createSpeech",
"parameters" : {
"voiceId" : "",
"text" : ""
},
"type" : "elevenLabs/v1/createSpeech"
}

Output

The output for this action is dynamic and may vary depending on the input parameters. To determine the exact structure of the output, you need to execute the action.

Create Speech With Timing

Name: createSpeechWithTiming

Generate speech from text with precise character-level timing information for audio-text synchronization.

Properties

NameLabelTypeDescriptionRequired
voiceIdVoiceSTRINGVoice you want to use for converting the text into speech.true
textTextSTRINGText you want to convert into speech.true

Example JSON Structure

{
"label" : "Create Speech With Timing",
"name" : "createSpeechWithTiming",
"parameters" : {
"voiceId" : "",
"text" : ""
},
"type" : "elevenLabs/v1/createSpeechWithTiming"
}

Output

Type: OBJECT

Properties

NameTypeDescription
audio_base64STRINGBase64 encoded audio data
alignmentOBJECT
Properties {[STRING](characters), [NUMBER](character_start_times_seconds), [NUMBER](character_end_times_seconds)}
normalized_alignmentOBJECT
Properties {[STRING](characters), [NUMBER](character_start_times_seconds), [NUMBER](character_end_times_seconds)}

Output Example

{
"audio_base64" : "",
"alignment" : {
"characters" : [ "" ],
"character_start_times_seconds" : [ 0.0 ],
"character_end_times_seconds" : [ 0.0 ]
},
"normalized_alignment" : {
"characters" : [ "" ],
"character_start_times_seconds" : [ 0.0 ],
"character_end_times_seconds" : [ 0.0 ]
}
}

Create Transcript

Name: createTranscript

Transcribe an audio or video file.

Properties

NameLabelTypeDescriptionRequired
model_idModelSTRINGThe ID of the model to use for transcription, currently only ‘scribe_v1’ is available.true
fileFileFILE_ENTRYThe file to transcribe. All major audio and video formats are supported. The file size must be less than 1GB.true

Example JSON Structure

{
"label" : "Create Transcript",
"name" : "createTranscript",
"parameters" : {
"model_id" : "",
"file" : {
"extension" : "",
"mimeType" : "",
"name" : "",
"url" : ""
}
},
"type" : "elevenLabs/v1/createTranscript"
}

Output

Type: OBJECT

Properties

NameTypeDescription
language_codeSTRINGThe detected language code (e.g. ‘eng’ for English).
language_probabilityNUMBERThe confidence score of the language detection (0 to 1).
textSTRINGThe raw text of the transcription.
wordsARRAY
Items [{STRING(text), NUMBER(start), NUMBER(end), STRING(type)}]
List of words with their timing information.

Output Example

{
"language_code" : "",
"language_probability" : 0.0,
"text" : "",
"words" : [ {
"text" : "",
"start" : 0.0,
"end" : 0.0,
"type" : ""
} ]
}

Additional instructions