Python 1.5.5: Release livekit-agents@1.5.5 · livekit/agents · GitHub
Node.JS 1.2.8: Release @livekit/agents@1.2.8 · livekit/agents-js · GitHub
livekit-agents@1.5.5
tinalenguyen released this 9 hours ago
What’s Changed
-
feat(inference): STT diarization capabilities and speaker_id on TimedString, add xAI TTS support for inference by @russellmartin-livekit in #5438
-
[inworld] timed_string to no longer have trailing spaces by @ianbbqzy in #5470
-
fix(examples): update e2ee.py to use encryption kwarg and env var by @aryeila in #5469
-
chore(deps): update dependency pillow to v12.2.0 [security] by @renovate[bot] in #5440
-
fix(tests): update preemptive_generation mock to use dict by @longcw in #5468
-
fix(telemetry): bound OTel provider shutdown to avoid watchdog kills by @theomonnom in #5471
-
feat(assemblyai): log connection lifecycle, silence, and session correlators by @dlange-aai in #5476
-
fix: strip markdown emphasis adjacent to punctuation by @carschandler in #5481
-
(aws realtime): add expiry check for cached credentials by @tinalenguyen in #5485
-
(hedra): note deprecation in readme by @tinalenguyen in #5475
-
(deepgram sttv2): add flux-general-multi support by @tinalenguyen in #5486
-
(xai stt): expose endpointing param to user by @tinalenguyen in #5493
-
fix(room-io): ownership-aware FrameProcessor lifecycle management by @longcw in #5467
-
(openai responses): drop prompt_cache_retention in received responses by @tinalenguyen in #5502
-
feat(avatar): add AvatarSession base class, warn on sync mis-wire by @longcw in #5499
-
livekit-agents@1.5.5 by @github-actions[bot] in #5503
New Contributors
- @carschandler made their first contribution in #5481
Full Changelog: https://github.com/livekit/agents/compare/livekit-agents@1.5.4…livekit-agents@1.5.5
@livekit/agents@1.2.8
github-actions released this 12 hours ago
Patch Changes
-
Add Deepgram
flux-general-multiSTTv2 model support with multi-language detection. Introduces a newlanguageHintoption for biasing the model toward specific languages (only used byflux-general-multi), and adds a newsourceLanguagesfield onSpeechDatathat carries all detected languages sorted by prevalence. For multi-language detection, the dominant language is set onlanguagewhilesourceLanguagesretains the full list. - #1275 (@toubatbrian) -
fix(voice): don’t commit unplayed LLM response to chat context when interruption happens before any text is synchronized - #1270 (@u9g)
-
feat(stt): add diarization capabilities and speaker_id support - #1267 (@toubatbrian)
-
feat(voice): add PreemptiveGenerationOptions for fine-grained control - #1265 (@toubatbrian)