Slow draining old versions

First of all, love the platform is such a great experience to launch an agent here

One thing that it’s a bit frustrating when deploying a new version of the agent is that I don’t fully understand why when a new version of an agent is deployed the draining process of the former version takes a lot of time.

Usuallly could be around one hour and though the draining state tag in cloud livekit helps understanding the state of version rollout, it gets a bit frustrating, specially if you’re fixing an issue in production having this feedback loop length is inconvenient.

Btw, I’m not handling lots of calls (~6 calls an hour)

Happy to help with more details,
Thanks

CWilson wrote this really good blog that explains the drain process: Deployment reliability on LiveKit Cloud | LiveKit

If your agents are in the draining state consistently for an hour, it sounds like they are not exiting cleanly for whatever reason, since 1 hour is the upper bound.

1 Like