Eleven v3 (Flash)

ElevenLabsText-to-SpeechVoice CloningGenerally AvailableProprietaryvm-el-002

About

Most expressive text-to-speech model from ElevenLabs, capable of generating speech with sighing, whispering, laughing, and other non-verbal vocalizations. Delivers unprecedented emotional range and natural delivery.

Capabilities (5)

Sighing & whispering

Laughter generation

Emotional expression

Non-verbal vocalizations

Fine-grained prosody

161 chars

Speed1.0x

Pitch1.0

0:00.00

Key Highlights

First commercial TTS to generate natural laughter and sighs

Emotional range spans from whispering to excited exclamation

Paired with Instant Voice Cloning for expressive custom voices

Use Cases

Audiobook Narration

Generate natural-sounding narration for long-form content with consistent voice quality.

Notification Systems

Deliver voice alerts and notifications with expressive, human-like speech synthesis.

Multilingual Content

Produce audio content in multiple languages from a single text source.

Real-Time Voice Chat

Power low-latency voice responses in interactive applications and games.

Code Example

// Eleven v3 (Flash) — Text-to-Speech
import { synthesize } from "@arkitekton/voice";

const audio = await synthesize({
  model: "vm-el-002",
  vendor: "elevenlabs",
  input: "Hello, welcome to Arkitekton.",
  voice: "alloy",
  response_format: "mp3",
  speed: 1.0,
});

// Play the audio
const blob = new Blob([audio], { type: "audio/mp3" });
const url = URL.createObjectURL(blob);
const player = new Audio(url);
player.play();

Related Models

PersonaPlex 7B

NVIDIA

NeMo TTS

NVIDIA

Riva

NVIDIA

ACE (Avatar Cloud Engine)

NVIDIA

OpenAI TTS

OpenAI

WaveNet

Google

Quick Stats

Languages32 supported

LicenseProprietary

PricingFrom $0.18 / 1K characters

StatusGenerally Available

Vendor

ElevenLabs

Ultra-realistic voice synthesis and conversational AI platform

View all ElevenLabs models

Documentation

View on ElevenLabs Site

Audiobook Narration

Generate natural-sounding narration for long-form content with consistent voice quality.

Notification Systems

Deliver voice alerts and notifications with expressive, human-like speech synthesis.

Multilingual Content

Produce audio content in multiple languages from a single text source.

Real-Time Voice Chat

Power low-latency voice responses in interactive applications and games.

Code Example

// Eleven v3 (Flash) — Text-to-Speech
import { synthesize } from "@arkitekton/voice";

const audio = await synthesize({
  model: "vm-el-002",
  vendor: "elevenlabs",
  input: "Hello, welcome to Arkitekton.",
  voice: "alloy",
  response_format: "mp3",
  speed: 1.0,
});

// Play the audio
const blob = new Blob([audio], { type: "audio/mp3" });
const url = URL.createObjectURL(blob);
const player = new Audio(url);
player.play();