SIP participants transcripts

Tushar_Gupta · March 13, 2026, 12:48pm

Was not able to find a reasonable answer to this so posting here. What is the best possible way to produce transcripts for sip participants in a livekit room. I’m aware of the agent way, but it will be cpu intensive as I intend to use VAD for quality and there maye be any number of sip participants. For web participants I’m already trying out running vad in the client side itself and then piping the transcript through an api or ws. Is there a similar way for SIPs?

Feel free to ask any questions if I was not able to articulate the problem.

Saqlain_Ahmed_P · March 16, 2026, 6:24am

To get transcripts for SIP participants in a LiveKit room without burning too much CPU, try running VAD on the server side. Use a media server like Janus Gateway or FreeSWITCH to handle the audio streams, detect speech with a VAD library, and generate transcripts using a speech-to-text service. Send the transcripts to your app via an API or WebSocket. This way, you can handle multiple SIP participants efficiently.

Tushar_Gupta · March 16, 2026, 6:58am

What do you mean by “handle it n server side”. Afaik, using an agent to subscribe to audio tracks and then using vad is also basically a server side solution. Are you suggesting the same or something else?

Saqlain_Ahmed_P · March 16, 2026, 10:14am

Yes, i think it is what i am trying to convey.

Also, try out on device solutions making use of VAD directly on the client device using lightweight libraries like Vosk or WebRTC VAD.
Alternatively, You may try to Send the generated transcripts to your application via an API or WebSocket.

darryncampbell · March 16, 2026, 1:37pm

I am not aware of a better approach than using an agent, as you already explored. To save resource, you only need to enable the STT part of the pipeline, Text and transcriptions | LiveKit Documentation .

We do have an example of a multi user transcriber: agents/examples/other/transcription/multi-user-transcriber.py at main · livekit/agents · GitHub

Topic		Replies	Views
How to transcribe all participants in a room, not just the first one Agents agent-development	1	16	January 21, 2026
Audio Isolation/Volume control in LiveKit SIP Rooms Agents agent-development , python , sip-trunking	3	48	February 17, 2026
Best way to get conversation transcription between agent and caller Agents agent-development	1	12	January 21, 2026
Agents-playground not displaying text in Speech to text agent Agents agent-development , stt	2	16	March 11, 2026
Carrier-specific audio issues with SIP telephony callers (G.711 via Asterisk) — seeking advice on VAD/NC configuration Telephony sip-other-provider , turn-detection	0	37	March 23, 2026

SIP participants transcripts

Related topics