Configurable Backchannel Responses for Natural Conversational Flow

I am currently working on a conversational voice agent and would like to explore whether LiveKit supports (or plans to support) configurable backchannel behavior during active user speech.

Specifically, I am looking to implement functionality where the conversational agent can speak configurable backchannel words (e.g., “mm-hmm”, “I see”, “right”, etc.) while the user is actively speaking. The goal is to make the interaction feel more natural and reassure the user that they are being heard — without interrupting or cutting off their speech.

The key requirements are:

  • The agent should emit backchannel words only from a predefined configurable list.

  • The agent must not interrupt or override the user’s speech.

  • The timing between backchannel utterances should be controllable (e.g., randomized within a defined interval range).

  • No words outside the configured list should be spoken.

Could you please let me know:

  1. Whether this type of functionality is currently supported?

  2. If not natively supported, whether there are recommended patterns using LiveKit agent to implement this behavior?

I would appreciate any guidance or documentation you can share.

Hi, this is not currently supported, but we are actively working on bringing this feature to LiveKit. Unfortunately I don’t have anything I can share on a public forum at this time but I hope to be able to give more information very soon.

Thanks again for your time and support. I look forward to hearing more soon.