Adaptive Turn Handling Ignores Single Word Answers

Ubani_Balogun · May 13, 2026, 4:21am

Hi there,

I’ve noticed that adaptive turn handling by default doesn’t accept single-word answers like “Yes”, “No” or fuller words like “Chicago”. It filters them out as noise. Is there a way to configure this so it doesn’t ignore one word answers? My use-case here is a questionnaire where I anticipate responders may answer with single-words.

I’m looking at the interruption options here and its not immediately clear which knob is the right one Turn handling options | LiveKit Documentation

darryncampbell · May 13, 2026, 9:36am

What you’re describing isn’t an interruption since the agent has finished asking its question. The default settings should serve you well:

turn_detection=MultilingualModel(),

Unless your environment is extremely noisy, I would expect the Agent to be able to pick out what you are saying: Noise & echo cancellation | LiveKit Documentation

I think this is more likely your STT settings, can you share your agent session configuration?

To answer your question however, if you wanted to test your hypothesis, you could disable adaptive turn handling as follows:

session = AgentSession(
    # ... stt, llm, tts, vad
    turn_handling=TurnHandlingOptions(
        turn_detection=MultilingualModel(),
        interruption={
            "mode": "vad", <-- This setting
        },
    ),
)

Muhammad_Usman_Bashir · May 13, 2026, 5:44pm

@Ubani_Balogun, Building on @darryncampbell’s STT hypothesis, two specific places single-word answers usually drop:

STT endpointing too short. A quick “Yes” lands before the final transcript commits; the framework sees no transcript. Check your STT plugin’s endpointing / utterance-finalization settings (Deepgram, Google, AssemblyAI all expose this).
Turn detector waiting for more. MultilingualModel is contextual; a bare “Yes” plus silence can read as “user paused mid-thought.” TurnHandlingOptions.min_endpointing_delay is the knob; if you bumped it for phone use, short answers feel swallowed.

Your AgentSession config will pin which one.

Ubani_Balogun · May 13, 2026, 11:11pm

@darryncampbell / @Muhammad_Usman_Bashir . Thanks for the comments! This ended up getting resolved by a mix of version updates (bumping to 1.4.1) and turn handling configurations. The config I ended up with is below. It seems to have stabilized now but open to any thoughts you have on my current config. Thanks!

turnHandling: {
      turnDetection: new livekit.turnDetector.MultilingualModel(),
      endpointing: { minDelay: 400, maxDelay: 1500 },
      interruption: { backchannelBoundary: null },
      preemptiveGeneration: { enabled: true },
    },

darryncampbell · May 14, 2026, 8:17am

Glad you resolved the issue but I’m confused by your turn handling options

Ubani Balogun:

turnHandling: {
      turnDetection: new livekit.turnDetector.MultilingualModel(),
      endpointing: { minDelay: 400, maxDelay: 1500 },
      interruption: { backchannelBoundary: null },
      preemptiveGeneration: { enabled: true },
    },

endpointing is defined here: Turn handling options | LiveKit Documentation but you need to specify the mode.

backchannelBoundary does not exist, it is not a supported property.

I assume your coding assistant is hallucinating, I strongly recommend you follow the instructions at Coding agent support and tools | LiveKit Documentation to install our MCP server and update your Agents.md file; the difference in how useful the coding assistant is after these changes is significant.

Ubani_Balogun · May 24, 2026, 2:58am

Hey @darryncampbell , Can we double confirm this please and clarify please? I’m looking in the livekit/agent-js repository and backchannelBoundary is documented in the codebase (but not in the livekit documentation). See here

github.com/livekit/agents-js

agents/src/voice/turn_config/interruption.ts

0f29f6bd3


      
             * pass through.
             *
             * Pass a single number to use the same value for both the start and end boundaries, or a
             * `[start, end]` tuple to configure them separately. The end value should be higher than the
             * start to account for STT transcript timestamp inaccuracy.
             *
             * `null` disables.
             *
             * @defaultValue [1000, 3500]
             */
            backchannelBoundary: number | [number, number] | null;
          }
          
          export const defaultInterruptionOptions = {
            enabled: true,
            mode: undefined,
            discardAudioIfUninterruptible: true,
            minDuration: 500,
            minWords: 0,
            falseInterruptionTimeout: 2000,
            resumeFalseInterruption: true,

Can we also get clarity on whether Dynamic endpointing is supported in the Agent-js library? The official documentation says its supported in Python only but I see the option available in the Agent-js library here

github.com/livekit/agents-js

agents/src/voice/turn_config/endpointing.ts

0f29f6bd3


      
          
          /**
           * Configuration for endpointing, which determines when the user's turn is complete.
           */
          export interface EndpointingOptions {
            /**
             * Endpointing mode. `"fixed"` uses a fixed delay, `"dynamic"` adjusts delay based on
             * end-of-utterance prediction.
             * @defaultValue "fixed"
             */
            mode: 'fixed' | 'dynamic';
            /**
             * Minimum time in milliseconds since the last detected speech before the agent declares the user's
             * turn complete. In VAD mode this effectively behaves like `max(VAD silence, minDelay)`;
             * in STT mode it is applied after the STT end-of-speech signal, so it can be additive with
             * the STT provider's endpointing delay.
             * @defaultValue 500
             */
            minDelay: number;
            /**
             * Maximum time in milliseconds the agent will wait before terminating the turn.

Muhammad_Usman_Bashir · May 24, 2026, 6:47am

@Ubani_Balogun, Looks reasonable. The endpointing tweaks (minDelay 400ms / maxDelay 1500ms) are tighter than the defaults (500ms / 3000ms) [Turn-taking tuning | LiveKit Documentation], which matches the questionnaire use case where short answers benefit from snappier turn closes. preemptiveGeneration: enabled matches the default explicitly [same page], so no behavior change there.

One thing for the future: backchannelBoundary: null isn’t documented on the turn-handling pages I can find, so the behavior you’re relying on is whatever the current Node implementation does. If the bump to 1.4.1 was specifically what unblocked the short-word case, that’s a signal the backchannel handling changed in that release. If your scenarios broaden to longer multi-clause answers, that’s the flag to revisit.

Glad it stabilized.

darryncampbell · May 26, 2026, 9:45am

@Ubani_Balogun thanks for flagging:

I owe you an apology!! I must have not pulled the latest code before my response a fortnight ago. BackchannelBoundary was added in Python here: feat(interruption): barge-in cooldown window for corrections by chenghao-mou · Pull Request #5269 · livekit/agents · GitHub and ported over to JS shortly after. There is a docs ticket in the system to document this externally.
There is a docs ticket in place to document dynamic endpointing mode in Agents-js. I can confirm the options are supported, we just haven’t documented them yet.

Topic		Replies	Views
Python Agents 1.5.0 Released Agents python	0	199	March 19, 2026
With the new adaptive interruption feature, im seeing the same agent interruption behaviour resume_false_interruption True or False Getting Started	1	30	April 6, 2026
Solving unwanted interruptions with Adaptive Interruption Handling Getting Started	6	101	March 30, 2026
Turn detection support for local languages (Hindi, Punjabi, Tamil) Agents agent-development	1	74	January 21, 2026
Please update Turn Detection Model Server SDKs turn-detection	3	78	March 9, 2026

Adaptive Turn Handling Ignores Single Word Answers

Related topics