Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.avaturn.live/llms.txt

Use this file to discover all available pages before exploring further.

Avaturn.Live renders a photo-realistic avatar driven by a real-time conversation engine. Mint a session on your backend, pass a short-lived token to the frontend, and the Web SDK streams the avatar over WebRTC.

How it works

1

Create a session

POST /api/v1/sessions with a conversation engine config. Returns session_id (backend) and token (frontend).
2

Connect

Pass token to the Web SDK. The avatar joins over WebRTC and starts conversing.
3

Terminate

DELETE /api/v1/sessions/{id}, or let it expire on idle.

Conversation engines

EngineWhenConfigured
OpenAI RealtimeLow-latency voice-to-voice with inline promptsPer session, ephemeral client secret
CartesiaVoice-to-voice agents deployed on Cartesia LinePer agent, in your Cartesia deployment
New integrations should use OpenAI Realtime — it’s the fastest path to a working voice-to-voice avatar. Cartesia is the right pick if your agent already lives on Cartesia Line.
A legacy text-echo flow (backend pushes text, avatar speaks it) is preserved under Legacy for existing integrations.

Start here

Quickstart

Voice-to-voice avatar in 5 minutes.

Web SDK

Render, devices, events, lifecycle.

REST API

Session lifecycle from your backend.

API Reference

Endpoints, schemas, errors.