Synthome Docs
Models

ElevenLabs Turbo

High-quality text-to-speech with voice cloning

ElevenLabs Turbo v2.5

Fast, high-quality text-to-speech from ElevenLabs with extensive voice options.

PropertyValue
Model IDelevenlabs/turbo-v2.5
ProvidersElevenLabs, Replicate
TypeText-to-speech

Basic Usage

import { compose, generateAudio, audioModel } from "@synthome/sdk";

const execution = await compose(
  generateAudio({
    model: audioModel("elevenlabs/turbo-v2.5", "elevenlabs"),
    text: "Hello! Welcome to our video tutorial.",
    voiceId: "21m00Tcm4TlvDq8ikWAM", // Rachel voice
  }),
).execute();

Voice Selection

ElevenLabs provides pre-made voices and custom voice cloning.

VoiceIDDescription
Rachel21m00Tcm4TlvDq8ikWAMCalm, professional female
AdampNInz6obpgDQGcFmaJgBDeep, authoritative male
JoshTxGEqnHWrfWFTfGW9XjXYoung, energetic male
BellaEXAVITQu4vr4xnSDxMaLWarm, friendly female
generateAudio({
  model: audioModel("elevenlabs/turbo-v2.5", "elevenlabs"),
  text: "Your narration text here.",
  voiceId: "pNInz6obpgDQGcFmaJgB", // Adam
});

Options

OptionTypeDefaultDescription
textstringrequiredText to convert (max 40,000 chars)
voiceIdstringrequiredElevenLabs voice ID
modelIdstringeleven_turbo_v2_5ElevenLabs model variant
stabilitynumber0.5Voice consistency (0-1)
similarityBoostnumber0.75Voice similarity (0-1)
stylenumber0Style exaggeration (0-1)
useSpeakerBoostbooleantrueEnhanced audio quality
outputFormatstringmp3_44100_128Audio format
languageCodestringautoLanguage code

Voice Settings

Stability

Controls consistency vs expressiveness:

generateAudio({
  model: audioModel("elevenlabs/turbo-v2.5", "elevenlabs"),
  text: "This is a test.",
  voiceId: "21m00Tcm4TlvDq8ikWAM",
  stability: 0.7, // More consistent, less expressive
});
  • Low (0.0-0.3): More expressive, variable delivery
  • Medium (0.4-0.6): Balanced
  • High (0.7-1.0): Very consistent, robotic at extremes

Similarity Boost

How closely to match the original voice:

generateAudio({
  model: audioModel("elevenlabs/turbo-v2.5", "elevenlabs"),
  text: "Match the voice closely.",
  voiceId: "21m00Tcm4TlvDq8ikWAM",
  similarityBoost: 0.9, // Very close to original
});

Style

Exaggerate speaking style (v2 models):

generateAudio({
  model: audioModel("elevenlabs/turbo-v2.5", "elevenlabs"),
  text: "An exciting announcement!",
  voiceId: "21m00Tcm4TlvDq8ikWAM",
  style: 0.5, // More expressive delivery
});

Multi-Language

Supports 32 languages:

// Spanish
generateAudio({
  model: audioModel("elevenlabs/turbo-v2.5", "elevenlabs"),
  text: "Hola, bienvenidos a nuestro canal.",
  voiceId: "21m00Tcm4TlvDq8ikWAM",
  languageCode: "es",
});

// French
generateAudio({
  model: audioModel("elevenlabs/turbo-v2.5", "elevenlabs"),
  text: "Bonjour, bienvenue sur notre chaîne.",
  voiceId: "21m00Tcm4TlvDq8ikWAM",
  languageCode: "fr",
});

Output Formats

FormatDescription
mp3_44100_128MP3 128kbps (default)
mp3_44100_192MP3 192kbps (higher quality)
pcm_16000PCM 16kHz
pcm_22050PCM 22kHz
pcm_24000PCM 24kHz
pcm_44100PCM 44.1kHz

Context for Better Pronunciation

Provide surrounding text for better prosody:

generateAudio({
  model: audioModel("elevenlabs/turbo-v2.5", "elevenlabs"),
  text: "This is the middle part.",
  voiceId: "21m00Tcm4TlvDq8ikWAM",
  previousText: "Here's the introduction.",
  nextText: "And here's the conclusion.",
});

With Video Generation

Create narrated videos:

import {
  compose,
  generateVideo,
  generateAudio,
  videoModel,
  audioModel,
} from "@synthome/sdk";

const execution = await compose(
  generateVideo({
    model: videoModel("veed/fabric-1.0", "fal"),
    image: "https://example.com/speaker.jpg",
    audio: generateAudio({
      model: audioModel("elevenlabs/turbo-v2.5", "elevenlabs"),
      text: "Welcome to today's tutorial on AI video generation.",
      voiceId: "21m00Tcm4TlvDq8ikWAM",
    }),
  }),
).execute();

Model Variants

Model IDSpeedQualityLanguages
eleven_turbo_v2_5FastestHigh32
eleven_multilingual_v2FastHighest29
eleven_flash_v2_5Very fastGood32
generateAudio({
  model: audioModel("elevenlabs/turbo-v2.5", "elevenlabs"),
  text: "Using the multilingual model for highest quality.",
  voiceId: "21m00Tcm4TlvDq8ikWAM",
  modelId: "eleven_multilingual_v2",
});

How is this guide?