How to set max tokens for OpenAI Realtime model

LiveKit-Community · January 21, 2026, 1:36pm

This question originally came up in our Slack community and the thread has been consolidated here for long-term reference.

How do I give max tokens in the OpenAI Realtime model?

LiveKit-Community · January 21, 2026, 1:36pm

Use update_options on the model:

llm_model = openai.realtime.RealtimeModel()
llm_model.update_options(max_response_output_tokens=500)

session = AgentSession(
    llm=llm_model,
    vad=silero.VAD.load()
)

Topic		Replies	Views
Gpt-realtime-1.5 leaks audio control tokens (<\|audio_text\|>, <\|caption_quality_N\|>) into text stream when run with modalities=["text"] Agents tts , realtime	1	15	April 20, 2026
Using livekit.agents.llm.RealtimeModel with liteLLM Agents llm , realtime , openai	2	39	March 2, 2026
Facing errors while calling update_chat_ctx when using azure open ai realtime llm Agents agent-development , python , realtime	2	32	March 17, 2026
Realtime model with Azure whisper STT Agents python , stt , realtime , openai , azure	17	106	February 26, 2026
Can I use session.say() with a realtime model? Agents realtime	1	15	January 21, 2026

How to set max tokens for OpenAI Realtime model

Related topics