LiveKit Support for Transcript Normalization Before LLM Calls

sahil.dutta · June 11, 2026, 5:52am

Hi LiveKit Team,

We’re using LiveKit Agents (1.5.16) with Azure OpenAI (gpt-5.4 via openai.LLM.with_azure).

We’re seeing Azure Prompt Shield false positives when callers spell their names letter-by-letter (e.g. “S A H I L”). Azure rejects the request with:

{
“error”: {
“code”: “content_filter”,
“status”: 400,
“message”: “The response was filtered due to the prompt triggering Azure OpenAI’s content management policy…”,
“innererror”: {
“code”: “ResponsibleAIPolicyViolation”,
“content_filter_result”: {
“hate”: { “filtered”: false, “severity”: “safe” },
“self_harm”: { “filtered”: false, “severity”: “safe” },
“sexual”: { “filtered”: false, “severity”: “safe” },
“violence”: { “filtered”: false, “severity”: “safe” },
“jailbreak”: { “detected”: true, “filtered”: true }
}
}
}
}

Have you seen this issue before, and does LiveKit provide any built-in way to preprocess or normalize STT transcripts before they are sent to the LLM (for example, converting spelled-out letters into a single word)?

sahil.dutta · June 11, 2026, 5:54am

@darryncampbell @Muhammad_Usman_Bashir

darryncampbell · June 11, 2026, 9:51am

Interesting, I haven’t come across this before.

Anecdotally, I have heard that some customers have had success disabling the policy (as detailed here).

My immediate suggestion would be to pre-process the input using on_user_turn_completed, as documented here: Pipeline nodes and hooks | LiveKit Documentation, to preprocess the name.

Alternatively, you should also be able to use Pipeline nodes and hooks | LiveKit Documentation, but the former is likely more straight-forward.

Topic		Replies	Views
Gemini 2.5 Flash Native Audio skipping letters during "Spelling Out" tasks Agents realtime , gemini	1	34	February 27, 2026
Realtime model with Azure whisper STT Agents python , stt , realtime , openai , azure	17	207	February 26, 2026
Intermittent repeated agent responses with Azure OpenAI in LiveKit Agents Agents agent-development , llm , tts , azure	3	36	June 3, 2026
Gpt-realtime-1.5 leaks audio control tokens (<\|audio_text\|>, <\|caption_quality_N\|>) into text stream when run with modalities=["text"] Agents tts , realtime	1	34	April 20, 2026
Agent speaking audio_text tokens out loud Agents llm , openai	4	65	March 6, 2026

LiveKit Support for Transcript Normalization Before LLM Calls

Related topics