Want to play a music while executing a tool it should play parallelly with the execution of the api clal

sahil.dutta · March 31, 2026, 9:13am

from livekit import rtc
from livekit.agents import Agent, RunContext, function_tool

# Pre-synthesize a hold message once at startup
HOLD_FRAMES: list[rtc.AudioFrame] = []

async def preload_hold_message(tts) -> None:
    global HOLD_FRAMES
    async for event in tts.synthesize("Let me check that for you."):
        HOLD_FRAMES.append(event.frame)

class MyAgent(Agent):
    @function_tool()
    async def check_order_status(
        self,
        context: RunContext,
        order_id: str,
    ) -> str:
        """Check the status of an order.

        Args:
            order_id: The order ID to look up.
        """
        async def cached_audio():
            for frame in HOLD_FRAMES:
                yield frame

        # Play the hold message concurrently — don't await
        hold_handle = context.session.say(
            "Let me check that for you.",
            audio=cached_audio(),
            add_to_chat_ctx=False,
        )

        # Call the external API (runs while the hold message plays)
        result = await fetch_order_status(order_id)

        # If the API returned before the hold message finished, cancel it
        if not hold_handle.interrupted and not hold_handle.done():
            hold_handle.interrupt()

        return result

i found this code in this doc https://docs.livekit.io/agents/multimodality/audio/#caching-tts
Can anyone confirm is it possible to play a music from a file here ?

sahil.dutta · March 31, 2026, 10:52am

@darryncampbell adding to check if this is possible

darryncampbell · April 1, 2026, 9:05am

Yes, you can play music from a file in an agent.

Use BackgroundAudioPlayer and pass a local file path. Local files are supported and decoded automatically, as described in the Background audio guide.

sahil.dutta · April 2, 2026, 5:13am

Thanks for your insights @darryncampbell
But i wanted to know specifically if using session.say() we are able to play a music or not

as mentioned in this doc we can play audio using session.say(). Does that mean this audio should be a voice only or it can be a music as well ?

darryncampbell · April 2, 2026, 1:42pm

In that case, the audio can be anything, it doesn’t have to be text, as documented here: Agent speech and audio | LiveKit Documentation

Playing audio through session.say is really designed for use cases where you always want the agent to say the same thing every time (such as reading some legalese) as it saves TTS credits if you play it through a wave file, rather than generate the same thing every time. I don’t see why it wouldn’t work for your approach, but that is why I recommended the BackgroundAudioPlayer.

Topic		Replies	Views
Agent Session Say method with local audio files Server SDKs python	2	46	February 11, 2026
Difference between context.wait_for_playout and speechhandle.wait_for_playout Getting Started	6	114	February 17, 2026
Agent speaking audio_text tokens out loud Agents llm , openai	4	57	March 6, 2026
Background Tool Execution with LiveKit Agents Getting Started	6	38	April 7, 2026
Add api ElevenLabs key to agents TTS Getting Started	4	55	March 26, 2026

Want to play a music while executing a tool it should play parallelly with the execution of the api clal

Related topics