← Back to all sparks
E

ElevenLabs

INFRA · APIS
Velocity6.3

AI voice generation platform for creating realistic text-to-speech and voice cloning.

ElevenLabs is turning voice agents into versioned, multi-model infrastructure.

voice-aiagentsmodel-releasestelephonyversioningapi
Current state
ElevenLabs is building two layers at once: a flagship model line (Music v2, Speech Engine) and the developer plumbing around agents, including branch merge/rebase previews, version metadata, and new telephony providers. The changelog reads like a platform maturing past single-call TTS into managed agent infrastructure. Scheduled deprecations of v1 TTS and Scribe models signal a deliberate cleanup of the older surface.
Where it's heading
The direction is agents-as-software: branches, rebases, previews, and version parents borrow Git's model for managing agent configuration, while telephony (Exotel alongside Twilio and SIP) and Speech Engine widen where that voice runs. Model releases and lifecycle removals are being run on a schedule. Expect the agent-versioning surface and provider integrations to keep expanding.
Prediction
Next likely: broader availability of Speech Engine, more telephony and provider integrations, and completion of the July 9 removal of v1 TTS and Scribe models that pushes users onto v2.

Recent moves

  1. 4d ago

    ElevenAgents

    Branch merge and rebase previews let developers inspect a merged or rebased agent configuration before committing, alongside new branch usage metrics. Continues the Git-like model for managing agent configs.

  2. 11d ago

    ElevenAgents

    A branch rebase endpoint lands the counterpart to the preview work, letting a branch move onto latest main while keeping its changes; conversation lists gain product-type filtering. Steady buildout of agent versioning.

  3. 18d ago

    Introducing Music v2

    ⚡ SPARK

    Music v2 arrives as a new model with chunk-based composition plans, a step up in structural control from the prompt-only v1 flow. It extends ElevenLabs' model line beyond speech into more controllable music generation.

  4. 25d ago

    Text to Speech

    ElevenLabs set July 9 removal dates for its v1 monolingual and multilingual TTS and Scribe v1, pushing users onto v2 successors. A routine but breaking lifecycle move that forces migration.

  5. 1mo ago

    ElevenAgents

    Exotel joins Twilio and SIP as a first-class telephony provider with its own outbound-call and phone-number endpoints. Broadens where voice agents can place calls, notably for India-centric deployments.

  6. 1mo ago

    Introducing Speech Engine

    ⚡ SPARK

    Speech Engine lets developers add real-time voice to their own chat agent or LLM, with ElevenLabs handling STT, turn-taking, TTS, and playback while the customer's server owns the logic. It decouples voice I/O from ElevenLabs' hosted agent runtime.