Hey guys I have been using gpt 4o mini as the model; sarvam v3 as TTS and Deepgram STT Nova 3 and am getting abnormal latency. Is it because of livekit or something with my code. And when I am using tool calling thats taking like 10-10 sec. The voice quality also isnt very good , it keeps on breaking. What should I do/ optimise??
How do you deploy it? is it self-hosted? also where are you calling from vs where are the different components located?
I would start by looking at Agent insights and usage metrics. That should help you to start isolating the issue. If the voice is not good try a different model and see if it fits your needs more.
This doc may help: