Gpt realtime transcription misses

AI_Arjun · June 18, 2026, 1:58pm

Hey for the folks using gpt realtime api for their agents, do you guys face an issue where the text generated by the model along with the audio misses at times.
That is, in frontend the transcript misses being shown, but the audio is completely spoken out by the agent.

Pawel_Lach · June 18, 2026, 2:03pm

As far as I know with the GPT real-time you cannot see the exact text that model understand, it’s rather some sort of STT model running in parallel like whisper for example, there for there may be discrepancy between what model actually hear and what is transcribed.

Topic		Replies	Views
Gpt-realtime-1.5 leaks audio control tokens (<\|audio_text\|>, <\|caption_quality_N\|>) into text stream when run with modalities=["text"] Agents tts , realtime	1	35	April 20, 2026
Realtime model with Azure whisper STT Agents python , stt , realtime , openai , azure	17	277	February 26, 2026
Support for Live STT Partial Transcripts in Python SDK for OpenAI models Agents stt , openai	6	22	June 23, 2026
Unstability with livekit plugins for azure openai realtime Getting Started	5	50	June 2, 2026
Inconsistent transcripts language when using Gemini realtime model ( gemini-live-2.5-flash-native-audio ) Agents agent-development , plugin , gemini , google	3	86	March 3, 2026

Gpt realtime transcription misses

Related topics