Hi everyone,
I currently have a LiveKit Agent running on a single AWS EC2 instance. I’m planning to perform load testing to understand how the system behaves when a large number of users connect simultaneously.
My goal is to horizontally scale the agent service by running multiple EC2 instances (for example, 3–4 instances) with the same LiveKit configuration and credentials.
I have a few questions:
-
If multiple agent instances are running, how are incoming requests or agent jobs distributed among them?
-
Does LiveKit automatically route jobs to an available/free agent instance, or is additional configuration required?
-
What is the recommended architecture for autoscaling LiveKit Agents on AWS based on load (CPU, memory, active sessions, etc.)?
Any guidance or best practices for load testing and autoscaling LiveKit Agents would be greatly appreciated.
Thanks!