Understanding LiveKit Cloud architecture for per-user agents

LiveKit-Community · January 21, 2026, 1:41pm

This question originally came up in our Slack community and the thread has been consolidated here for long-term reference.

I need a clearer explanation of how LiveKit Cloud works.

Our MVP is four Python processes per user: one real-time agent that creates the room and talks to the user, plus three data-only agents that join the room and guide the real-time agent.

We’ve been running this locally, and now we need to scale and integrate it into an application. What’s the recommended LiveKit Cloud approach for deploying and running these agents per user?

LiveKit-Community · January 21, 2026, 1:41pm

Agents are generally per-user. You run one or more of your agent processes and dispatch users and agents to a room. LiveKit Cloud will load-balance across the available agent processes you have running.

It should not be much different than what you’re doing locally, except your agent process runs in the cloud.

See these docs for more details:

Topic		Replies	Views
How Does LiveKit Route Agent Jobs Across Multiple EC2 Instances and Support Autoscaling? Getting Started agent-deployment , livekit-cloud	1	44	June 3, 2026
How to run a single LiveKit worker for multiple clients (multi-tenancy) Agents agent-deployment	1	42	January 21, 2026
LiveKit Cloud billing and concurrency limits for agents Cloud Dashboard agent-deployment	1	72	January 21, 2026
How are workers distributed across multiple LiveKit server nodes? Agents agent-deployment	1	33	January 21, 2026
LiveKit Cloud Agent Draining Cloud Dashboard python , agent-deployment	1	105	February 25, 2026

Understanding LiveKit Cloud architecture for per-user agents

Related topics