Just wanted to confirm few things we are using deepgram stt whose trancripttion latency is 200 ms and we’re using Livekit’s multilingual turn detection model here we’ve setup min_delay as 400 ms and max_delay as 2 secs with this config my eou_delay is around ~ 900 ms so is it because of turn detection is taking more time then required or something else as i was under the assumption that turn detector take around 50-180 ms not this much time can shed some light on it
900ms is longer than I would expect, I usually see about half that.
As for what is ‘typical’, many developers compare the experience to speaking with the LiveKit agent hosted at https://livekit.com/ (select ‘Talk to LiveKit Agent’), and that will give you metrics as you speak.