This question originally came up in our Slack community and the thread has been consolidated here for long-term reference.
I’m testing my phone agent and made calls one after another. Now I see “Concurrent STT is 20” and I’m about to exceed my plan limit. Since I made calls sequentially, why did concurrent STT become 20?
Does only LiveKit Inference STT count toward this limit? My custom STT plugin using Qwen APIs shouldn’t count, right?