This question originally came up in our Slack community and the thread has been consolidated here for long-term reference.
We’re interested in developing a device-directed speech detection model and running it on our LiveKit Cloud server, possibly with similar performance requirements to the turn detection model. Is it possible to bring our own model to run in LiveKit Cloud?