Handling response latency: Playing fallback/filler audio if no response arrives within a timeout

Aman · April 11, 2026, 5:41pm

I’m building a real-time voice interaction system using the LiveKit SDK, where responses from my backend (LLM + TTS) can sometimes take longer than expected.

To improve the user experience, I’m considering adding a fallback mechanism:

After sending a user query, I start a timer (e.g., 1.5 seconds)
If no response audio has started by then, I play a short filler message like “Let me check that for you…”
If the actual response arrives while the filler is playing, I want the filler to play completely and switch to the real response audio

However, since the STT → LLM → TTS pipeline is managed internally by LiveKit, I don’t have direct control over:

I wanted to ask:

What’s the recommended way in LiveKit to handle this kind of timed fallback behavior?
Is there a best practice for interrupting and replacing an ongoing audio track with a new one?
Should this be handled entirely at the application layer, or is there any built-in support/pattern in LiveKit for such cases?
If anyone has implemented something similar, I’d love to hear how you handled audio track switching and synchronization
Even a minimal sample showing how to inject and interrupt audio alongside the pipeline would be really helpful.

Any guidance or examples would be really helpful. Thanks!

Rajan_kumar · April 15, 2026, 7:25am

@darryncampbell Did you get a chance to review this? It’s important for us as we’re planning to launch the product soon.

What I want here is: if the LLM response arrives before the threshold timer is reached, we should cancel the filler message instead of sending it.

I checked this example: agents/examples/voice_agents/fast-preresponse.py at main · livekit/agents · GitHub but this guarantees each time filler message will be there and this is not time specific.

darryncampbell · April 16, 2026, 9:31am

I can’t find an example which shows this, but what sounds most sensible to me is:

Start a timer in on_user_turn_completed(), Pipeline nodes and hooks | LiveKit Documentation
Cancel the timer in llm_node() when the agent responds, Pipeline nodes and hooks | LiveKit Documentation
If the timer fires, generate_reply()

We also have this page, which you have probably seen, for a similar use case: External data and RAG | LiveKit Documentation

Topic		Replies	Views
How to add filler words to reduce perceived latency Agents agent-development	1	47	January 21, 2026
Latency issue how to fix this? Getting Started	13	232	April 13, 2026
Issue with programmatically toggle STT/TTS on off Agents agent-development , python , stt , tts	6	49	February 24, 2026
Want to play a music while executing a tool it should play parallelly with the execution of the api clal Getting Started agent-development	4	35	April 2, 2026
Difference between context.wait_for_playout and speechhandle.wait_for_playout Getting Started	6	68	February 17, 2026

Handling response latency: Playing fallback/filler audio if no response arrives within a timeout

Related topics