Best STT Alternative to OpenAI whisper-1 for Japanese in LiveKit

lick · March 7, 2026, 8:55am

Hi,

I want to ask about best practices for TTS in LiveKit. We currently use OpenAI whisper-1 (realtime) as the STT model in our LiveKit agent to transcribe Japanese utterances, but we sometimes experience delays. Because of that, we’re planning to replace OpenAI whisper-1 with another model. What would be the best choice? Does anyone have experience with this?

Domen_Zajc1 · March 9, 2026, 7:20am

I don’t have experience with the Japanese language, but I would definitely try Soniox because it’s in my opinion the best STT out there for non-English languages.

Saqlain_Ahmed_P · March 9, 2026, 8:12am

You can Try making use of Deepgram Nova-3 and Nova-2. Also you can try to make use of Nvidia Riva.

Topic		Replies	Views
Real-time STT with auto language detection and code-switching support Agents stt	1	50	January 21, 2026
Lowest latency STT/TTS/LLM stack for German - what's your experience? Agents agent-development , stt , llm , tts	1	79	March 13, 2026
Realtime model with Azure whisper STT Agents python , stt , realtime , openai , azure	17	247	February 26, 2026
Add api ElevenLabs key to agents TTS Getting Started	4	69	March 26, 2026
Elevenlabs Voice ID outside of Defaults - Livekit Inference Agents livekit-inference , elevenlabs	3	39	April 12, 2026

Best STT Alternative to OpenAI whisper-1 for Japanese in LiveKit

Related topics