This question originally came up in our Slack community and the thread has been consolidated here for long-term reference.
Can someone recommend a good TTS? I was considering ElevenLabs but it’s too expensive.
Here’s my current setup:
session = AgentSession(
stt=deepgram.STT(**stt_config),
llm=openai.realtime.RealtimeModel(
voice="verse",
modalities=["text", "audio"],
turn_detection=TurnDetection(
type="semantic_vad",
eagerness="high",
create_response=True,
interrupt_response=True,
)
),
)
I’m trying to achieve a more human-like conversation with the agent.