Generate reply timeout for gemini-live-2.5-flash-native-audio realtime model

@livekit @LiveKit-Community @livekitteams @darryncampbell @CWilson @Muhammad_Usman_Bashir
Hi LiveKit Team,

We are experiencing intermittent timeouts while using the Gemini Live 2.5 Flash Native Audio realtime model via Vertex AI with LiveKit Agents.

Environment:

  • LiveKit Agents Version: 1.5.17
  • Model: gemini-live-2.5-flash-native-audio

Error observed:

failed to generate a reply: generate_reply timed out waiting for generation_created event

The issue occurs during response generation, where the agent waits for the generation_created event but eventually times out.

We would like to understand:

  1. Is this a known issue with Gemini Live 2.5 Flash Native Audio integration?

  2. Are there any recommended configuration changes, timeout settings, or code adjustments required when using this model with LiveKit Agents?

  3. Have there been any fixes or improvements in versions newer than 1.5.17 related to realtime Gemini models?

    Any guidance would be greatly appreciated.

There’s a known issue with Gemini Live 3.1 generate_reply, but no known issue with Gemini Live 2.5.

This issue appears to be a similar report:

But that fix was implemented months ago now.

How can this be reproduced? Also, was this working for you in a previous version, and this is a regression with the latest release?