Since the multimodal turn detection is based on Qwen2.5-0.5B, which supports Thai, would it be possible to include Thai as a supported language? Would this require fine-tuning, or just building a proper test set?
If finding native Thai resources is a challenge, I’d be happy to help with that.