Error: 429 Too Many Requests on agent-gateway.livekit.cloud

o.shafique · April 19, 2026, 8:11am

Hi — Inference gateway returns HTTP 429 for my Cloud project.

Project name: Production Grade Voice Bot
Plan: Build

What I run: LiveKit Agents Python, agent.py console on Windows.
Error: 429 Too Many Requests on agent-gateway.livekit.cloud

TTS prewarm: wss://agent-gateway.livekit.cloud/v1/tts?model=cartesia/sonic-2
STT: livekit.agents.inference.stt.STT also returns 429

When: 2026-04-19 around 11:00 am (my local timezone: Saudi Arabia)
Billing page shows Inference STT/TTS/LLM usage — not at $0 because of “no usage”, but gateway still 429 immediately.

Question: Is there a burst/minute/regional cap or account flag on Inference gateway for Build? Can you check my project?

I am NOT sharing API keys here.

Isaac_Huntsman · April 19, 2026, 3:51pm

I’ve experienced this issue too, I ultimately just stopped using livekit inference.stt. I was getting weird behavior where apparently on connect, the client was sending 4 bursts of stt connection requests, immediately 429’ing.

Does this crash your session?

I suspect this is a bug; the previous post I made about this wasn’t solved as I just switched to a direct STT connection.

One question, though: do you do await ctx.connect() explicitly? If so, is it right above your session.start()?

darryncampbell · April 20, 2026, 9:52am

There are concurrency limits on Inference STT / TTS. Please see this section of the pricing page: Pricing | LiveKit

If you look at your plan quotas: Sign in | LiveKit Cloud, you have a peak usage of 2 STT and 3 TTS. The total of 5 is equal to the concurrency limit on build, so I assume you tried to create a 6th connection, which triggered the 429

Topic		Replies	Views
Failed to synthesize speech: Invalid response status (429 Too Many Requests) Agents tts	1	34	March 26, 2026
429 too many requests when trying to preemptively connect user audio Agents agent-development	1	40	April 10, 2026
Trying to understand how cloud limit on STT and TTS reset Getting Started	11	50	May 27, 2026
TTS/STT Inference fails due to APIConnectionError with no clear error message Agents stt , tts , node-js	4	90	March 8, 2026
Inference STT WebSocket fails (APIConnectionError) while room connection works Agents agent-development	1	34	March 26, 2026

Error: 429 Too Many Requests on agent-gateway.livekit.cloud

Related topics