Hi — Inference gateway returns HTTP 429 for my Cloud project.
Project name: Production Grade Voice Bot
Plan: Build
What I run: LiveKit Agents Python, agent.py console on Windows.
Error: 429 Too Many Requests on agent-gateway.livekit.cloud
- TTS prewarm: wss://agent-gateway.livekit.cloud/v1/tts?model=cartesia/sonic-2
- STT: livekit.agents.inference.stt.STT also returns 429
When: 2026-04-19 around 11:00 am (my local timezone: Saudi Arabia)
Billing page shows Inference STT/TTS/LLM usage — not at $0 because of “no usage”, but gateway still 429 immediately.
Question: Is there a burst/minute/regional cap or account flag on Inference gateway for Build? Can you check my project?
I am NOT sharing API keys here.