ByteChef LogoByteChef
Components

ElevenLabs

ElevenLabs is an AI-powered voice synthesis company specializing in ultra-realistic text-to-speech and voice cloning technology.

Categories: Artificial Intelligence

Type: elevenLabs/v1


Connections

Version: 1

API Key

Properties

NameLabelTypeDescriptionRequired
keyKeySTRINGtrue
valueAPI KeySTRINGtrue

Actions

Create Sound Effect

Name: createSoundEffect

Turn text into sound effects for your videos, voice-overs or video games using the most advanced sound effects model in the world.

Properties

NameLabelTypeDescriptionRequired
textTextSTRINGThe text that will get converted into a sound effect.true

Example JSON Structure

{
  "label" : "Create Sound Effect",
  "name" : "createSoundEffect",
  "parameters" : {
    "text" : ""
  },
  "type" : "elevenLabs/v1/createSoundEffect"
}

Output

Type: FILE_ENTRY

Properties

NameTypeDescription
extensionSTRING
mimeTypeSTRING
nameSTRING
urlSTRING

Output Example

{
  "extension" : "",
  "mimeType" : "",
  "name" : "",
  "url" : ""
}

Create Speech

Name: createSpeech

Converts text into speech using a voice of your choice and returns audio.

Properties

NameLabelTypeDescriptionRequired
voiceIdVoiceSTRINGVoice you want to use for converting the text into speech.true
textTextSTRINGText you want to convert into speech.true

Example JSON Structure

{
  "label" : "Create Speech",
  "name" : "createSpeech",
  "parameters" : {
    "voiceId" : "",
    "text" : ""
  },
  "type" : "elevenLabs/v1/createSpeech"
}

Output

Type: FILE_ENTRY

Properties

NameTypeDescription
extensionSTRING
mimeTypeSTRING
nameSTRING
urlSTRING

Output Example

{
  "extension" : "",
  "mimeType" : "",
  "name" : "",
  "url" : ""
}

Create Speech With Timing

Name: createSpeechWithTiming

Generate speech from text with precise character-level timing information for audio-text synchronization.

Properties

NameLabelTypeDescriptionRequired
voiceIdVoiceSTRINGVoice you want to use for converting the text into speech.true
textTextSTRINGText you want to convert into speech.true

Example JSON Structure

{
  "label" : "Create Speech With Timing",
  "name" : "createSpeechWithTiming",
  "parameters" : {
    "voiceId" : "",
    "text" : ""
  },
  "type" : "elevenLabs/v1/createSpeechWithTiming"
}

Output

Type: OBJECT

Properties

NameTypeDescription
audio_base64STRINGBase64 encoded audio data
alignmentOBJECT
Properties {[STRING](characters), [NUMBER](character_start_times_seconds), [NUMBER](character_end_times_seconds)}
normalized_alignmentOBJECT
Properties {[STRING](characters), [NUMBER](character_start_times_seconds), [NUMBER](character_end_times_seconds)}

Output Example

{
  "audio_base64" : "",
  "alignment" : {
    "characters" : [ "" ],
    "character_start_times_seconds" : [ 0.0 ],
    "character_end_times_seconds" : [ 0.0 ]
  },
  "normalized_alignment" : {
    "characters" : [ "" ],
    "character_start_times_seconds" : [ 0.0 ],
    "character_end_times_seconds" : [ 0.0 ]
  }
}

Create Transcript

Name: createTranscript

Transcribe an audio or video file.

Properties

NameLabelTypeDescriptionRequired
model_idModelSTRINGThe ID of the model to use for transcription, currently only ‘scribe_v1’ is available.true
fileFileFILE_ENTRYThe file to transcribe. All major audio and video formats are supported. The file size must be less than 1GB.true

Example JSON Structure

{
  "label" : "Create Transcript",
  "name" : "createTranscript",
  "parameters" : {
    "model_id" : "",
    "file" : {
      "extension" : "",
      "mimeType" : "",
      "name" : "",
      "url" : ""
    }
  },
  "type" : "elevenLabs/v1/createTranscript"
}

Output

Type: OBJECT

Properties

NameTypeDescription
language_codeSTRINGThe detected language code (e.g. ‘eng’ for English).
language_probabilityNUMBERThe confidence score of the language detection (0 to 1).
textSTRINGThe raw text of the transcription.
wordsARRAY
Items [{STRING(text), NUMBER(start), NUMBER(end), STRING(type)}]
List of words with their timing information.

Output Example

{
  "language_code" : "",
  "language_probability" : 0.0,
  "text" : "",
  "words" : [ {
    "text" : "",
    "start" : 0.0,
    "end" : 0.0,
    "type" : ""
  } ]
}