Does LLM output stream directly to TTS or wait for complete response?

LiveKit-Community · January 21, 2026, 1:29pm

This question originally came up in our Slack community and the thread has been consolidated here for long-term reference.

Does LLM output get streamed directly to the TTS model, or does it wait for the entire response?

I’m using inference and can’t find documentation on this. Do I have to create separate node functions?

LiveKit-Community · January 21, 2026, 1:29pm

Inference uses the Pipeline model, where LLM output flows into the TTS model as it is generated. You do not have to wait for the entire response.

See the nodes documentation for more details:

Topic		Replies	Views
How to stop routing LLM output to TTS when sound is off Agents agent-development , tts	1	16	January 21, 2026
How to log inputs and outputs for LLM node Agents agent-development	1	8	January 21, 2026
Voxtral TTS API 1,230ms TTFB in real-time voice agent pipeline Agents tts , mistralai	3	74	April 6, 2026
Question with LLM tool calling Agents agent-development	9	123	February 16, 2026
How to pause STT/LLM processing during a long-running function call Agents agent-development	1	35	January 21, 2026