Support for Live STT Partial Transcripts in Python SDK for OpenAI models

Akshay_Sharma · June 22, 2026, 9:51am

Hi LiveKit team,

I’m using the LiveKit Python Agents SDK with the OpenAI STT plugin and the gpt-4o-mini-transcribe model, and have a question regarding interim transcript streaming.

My use case is live captioning/transcript updates while the user is speaking. However, the current behavior I’m seeing is that no transcript events are emitted while the user is actively speaking. Both partial and final transcript events are only received after the user stops speaking and VAD determines the end of the utterance, at which point the transcript is delivered.

What I’m looking for is:

Continuous transcript updates while the user is actively speaking.
Interim/partial transcripts streamed in real time.
The ability to surface those updates immediately in the UI.

Is this expected behavior with the Python Agents SDK and the OpenAI STT plugin when using the gpt-4o-mini-transcribe model?

If realtime interim transcripts are supported for gpt-4o-mini-transcribe, is there any configuration or API that needs to be enabled? If not, what is the recommended approach for implementing live transcript streaming with the Python SDK?

For context, I’m using AgentSession with the OpenAI STT plugin and the gpt-4o-mini-transcribe model.

Thanks!

darryncampbell · June 22, 2026, 10:34am

Are you open to changing your STT provider? That model won’t reliably stream progressive updates.
For example, you could switch to:

    stt=inference.STT(model="deepgram/nova-3", language="multi"),

Then you’ll get partials as your user speaks:

    @session.on("user_input_transcribed")
    def _on_transcript(ev: UserInputTranscribedEvent):
        if ev.is_final:
            logger.info(f">>> FINAL TRANSCRIPT: {ev.transcript!r}")
        else:
            logger.info(f">>> PARTIAL TRANSCRIPT: {ev.transcript!r}")

Akshay_Sharma · June 22, 2026, 11:09am

Hi Darryn,

Currently we would want to use openai models since we have a partnership with them. In order to get this working do we need to then write a custom plugin here, since as per claude analysis it seems this feature is there supported for node based agents?

darryncampbell · June 22, 2026, 12:12pm

Does this work with Node.JS? I tried passing in use_realtime=True, which is what I believe you are referring to, and the STT doesn’t provide the updates you are looking for.

Akshay_Sharma · June 22, 2026, 2:16pm

Ok, so are we saying this is something related to open ai and there are reliability issues from their end ?

darryncampbell · June 22, 2026, 2:41pm

I wouldn’t say it was reliability, I didn’t think it was possible for gpt-4o-mini-transcribe to return partial transcripts continuously as the user is speaking, even if you configure it for realtime. See here, GPT-4o mini Transcribe Model | OpenAI API, it doesn’t offer realtime transcription.

Akshay_Sharma · June 23, 2026, 8:00am

Ok, tried with streaming model offerings from open ai (whisper real-time) and i get the required behavior.

So it seems it’s not there with the STT model offerings from open ai

Topic		Replies	Views
Realtime model with Azure whisper STT Agents python , stt , realtime , openai , azure	17	277	February 26, 2026
Gpt realtime transcription misses Getting Started	1	22	June 18, 2026
Gpt-realtime-1.5 leaks audio control tokens (<\|audio_text\|>, <\|caption_quality_N\|>) into text stream when run with modalities=["text"] Agents tts , realtime	1	35	April 20, 2026
Assistance Needed: Agents egress , stt , llm , tts , realtime	6	80	March 16, 2026
Improving accuracy Agents agent-development , python , plugin , realtime , openai	6	98	May 26, 2026

Support for Live STT Partial Transcripts in Python SDK for OpenAI models

Related topics