Any missing plugins on Livekit right now?

Laurent_Bo · April 27, 2026, 8:39pm

Hi everyone, Laurent from France

I’ve been digging into the LiveKit plugin ecosystem and want to contribute something. The TTS/STT/LLM lanes seem well-covered, but the surrounding layers (memory, eval, telephony, A/B testing, compliance) maybe not so much?

Would love to hear from people building agents and also the team at Livekit

What are the plugins you wish existed?
Where do you currently work around the gap, rolling your own, gluing two services together, or just living with it?

Will summarize back what I hear.

Thanks

_E_r_r_o_n_e_o_u_s_D_e_s_i_g_n_s · April 28, 2026, 1:20am

LiveKit is technically a module (large addon) so plugins are more integrations rather than true plugins. Yes core lanes (TTS/STT/LLM) are mostly solved. The real pain (and opportunity) is everything around the agent lifecycle. Here’s what people actually end up building themselves over and over and what’s missing, current workarounds people usually use, and what I believe would be beneficial if you’re looking for ideas, just be aware LiveKit is stubborn so you’ll have your work cut out for you.

Persistent + Structured Memory (not just vector DB)

What’s missing

→ Session → user → long-term memory unification
→ Write policies (what gets remembered vs ignored)
→ Conflict resolution (old vs new facts)

What people hack together

→ Redis + Pinecone + custom schemas
→ Ad-hoc “memory summarizer” jobs

Plugin that should exist

Drop-in memory layer with:
→ short_term, session, long_term
→ automatic summarization + pruning
→ policy hooks (PII filtering, retention rules)

Real-time Evaluation / Guardrails (in-stream, not batch)

What’s missing

→ Evaluating agent responses while speaking
→ Interrupting bad outputs mid-stream

Current workaround

→ Post-hoc eval pipelines (too late)
→ Regex / keyword filters (primitive)

Plugin idea

Streaming evaluator:
→ toxicity / hallucination / policy checks
→ can pause, rewrite, or cut audio mid-sentence
→ integrates directly with LiveKit data/audio tracks

Telephony Bridge (done right)

What’s missing

→ Clean abstraction over SIP/Twilio + LiveKit
→ Latency-optimized voice pipelines

What people do

→ Glue Twilio + SIP + WebRTC manually
→ Fight jitter, transcoding, and echo

Plugin idea

“Telephony adapter”:
→ inbound/outbound calls
→ DTMF → structured events
→ built-in jitter buffer tuning for voice agents

A/B Testing + Experimentation for Agents

What’s missing

→ Compare prompts, voices, models in production
→ Measure real conversational outcomes

Current workaround

→ Manual routing logic
→ Logging + spreadsheets

Plugin idea

Experiment layer:
→ split traffic across agents/prompts/models
→ metrics: latency, interruption rate, user sentiment
→ automatic winner selection

Conversation Replay + Debugging (this one hurts)

What’s missing

Full timeline replay:
→ audio
→ transcripts
→ LLM decisions
→ tool calls

What people do

→ Log fragments everywhere and pray

Plugin idea

“Session debugger”:
→ scrub timeline like a video editor
→ inspect each token / audio chunk
→ see why the agent said what it said

Compliance / PII / Redaction Layer

What’s missing

→ Real-time redaction before storage or TTS
→ Regional compliance (GDPR, HIPAA-style patterns)

Current workaround

→ Bolt-on filters after the fact

Plugin idea

Streaming compliance middleware:
→ detect + mask PII in audio + text
→ configurable per region
→ audit logs built-in

Tool Orchestration (for voice-first, not chat-first)

What’s missing

→ Latency-aware tool calling during speech
→ Partial responses while tools execute

Current workaround

→ Standard LLM tool calling (too slow for voice UX)

Plugin idea

Voice-native tool runner:
→ interruptible
→ progressive responses (“Let me check that…” → result)
→ timeout-aware fallbacks

Latency Optimizer / Prewarm Manager

(I literally touched on this problem previously)

What’s missing

Coordinated prewarm:
→ STT
→ TTS
→ LLM
→ audio pipeline

Plugin idea

“Warmup orchestrator”:
→ primes models + audio contexts
→ manages iOS autoplay unlock flows
→ reduces first-token + first-audio delay

Multi-agent Coordination Layer

What’s missing

→ Multiple agents in one room (handoff, roles)

Current workaround

→ Hardcoded routing logic

Plugin idea

Agent router:
→ role-based agents (sales, support, verifier)
→ dynamic handoff mid-conversation
→ shared memory context

Synthetic Load / Voice Testing

What’s missing

→ Simulate 1000 concurrent voice users
→ Stress test audio + agent latency

Current workaround

→ None, or very hacky scripts

Plugin idea

Voice load tester:
→ generates synthetic conversations
→ measures end-to-end latency + dropouts

Highest impact plugins would be to build one of these:

→ (ref #5) (Replay/Debugger) → everyone needs it immediately
→ (ref #2) (Real-time guardrails) → hard + valuable
→ (ref #8) (Latency/prewarm) → directly improves UX (and dev’s sanity)

The meta gap (important)

What’s really missing is a standard plugin interface for:

→ intercepting audio streams
→ intercepting LLM tokens
→ injecting/modifying responses in real time

If you build your plugins around these concepts (middleware-style), it’ll be way more valuable than a single-purpose tool.

Hope this helps.

Laurent_Bo · April 28, 2026, 10:59am

Thanks a lot @_E_r_r_o_n_e_o_u_s_D_e_s_i_g_n_s for the very detailed answer! this is very helpful.

First, if I understand well, there is actually a missing middleware piece that would let you compose interception/observation around the existing plugins instead of replacing them? And all of those ideas would only work if there was this clean interface?

Also if you have one more minute I am curious to know:

On prewarm. Is it mostly cold start (first call after idle) or per session warmup of STT/TTS/LLM? or both?
On replay/debugger. It seems like there is a missing PostHog. Did you also try to hack this one yourself? if yes, what was the hardest parts?
You mentioned LiveKit can be stubborn. What did you mean by this?

Thanks a lot

CWilson · April 28, 2026, 2:43pm

It is interesting to see that so many of the items above are already available; we may not be doing a good job of communicating these.

For the timeline replay, have you tried Agent Insights?

Benjamin_Lowe · May 11, 2026, 10:12pm

I would love a Fallback Adapter that works with realtime models (not cascaded models) to provide fallback when the provider is e.g. unresponsive or rate limited

e.g. OpenAI gpt-realtime → azure gpt-realtime model fallback etc.

Muhammad_Usman_Bashir · May 11, 2026, 11:11pm

Extending @CWilson’s point: most of this list already exists in livekit-agents 1.5.x, just under-surfaced.

Replay / debugger. Agent Insights, plus conversation_item_added on the AgentSession event stream for self-host.
Real-time guardrails. Subclass Agent and override llm_node() or tts_node() (Pipeline nodes and hooks | LiveKit Documentation). That’s the middleware interface in the meta-gap section.
Progressive tool responses. session.say("checking...") before a slow tool + preemptive_generation=True on AgentSession.
Multi-agent handoff. Shipped as Workflows.

@Benjamin_Lowe FallbackAdapter for realtime is Feature Request: Extend Fallback Provider to support OpenAI Realtime API · Issue #2342 · livekit/agents · GitHub, closed 2026-03-17 without an implementation. Workaround: on_error handler watching for unrecoverable errors and re-instantiating against the fallback provider.

CWilson · May 11, 2026, 11:36pm

@Benjamin_Lowe As you know, we’ve discussed it in the past. I will raise it with the team again.

Benjamin_Lowe · May 12, 2026, 8:32am

Thanks, yes we can implement this internally to solve it with on_error, but just mention it despite the closed issue so as to answer OPs question

Where do you currently work around the gap, rolling your own, gluing two services together, or just living with it?

That would be a gap where we need to roll in our own glue

Kamil · May 25, 2026, 8:12am

Hi, nice summary

We recently published PII redaction plugin for transcripts. Seems like it covers parts of your suggested scope for it GitHub - syntropicsignal-ai/livekit-plugins-piiguard: PII redaction for LiveKit Agents. Pluggable local detectors (regex / Presidio) + end-of-call redacted-transcript hook. · GitHub

Topic		Replies	Views
Livekit agent framework agnostic Agents agent-development	2	46	May 11, 2026
Request for review: Respeecher TTS plugin Agents tts , plugin	3	49	May 21, 2026
Assistance Needed: Agents egress , stt , llm , tts , realtime	6	78	March 16, 2026
Hello all, what is the difference between inference and plugins. Does inference will be fast compared to plugins? Agents agent-development	1	49	March 4, 2026
How to contribute a new video agent plugin to LiveKit Agents other	1	28	January 21, 2026

Any missing plugins on Livekit right now?

Related topics