The big labs are constantly releasing newer models and new providers keep showing up ![]()
Its hard to keep up.
So, post your go-to model(s) and providers. If there are > 10 responses, I’ll maintain a summary of the results for everyone.
The big labs are constantly releasing newer models and new providers keep showing up ![]()
Its hard to keep up.
So, post your go-to model(s) and providers. If there are > 10 responses, I’ll maintain a summary of the results for everyone.
Currently its gpt-5-chat-latest but I just saw OpenAI’s deprecation notice for it.
The model choice will vary a lot depending on the use case and requirements - and as you mentioned, new models keep getting released. And usually there’s a mix of models in the same agent for different use cases to optimise on cost and latency.
do you have examples @Raghu_Udiyar ? like which models to use for which context?
curious to understand better the logic behind
thx
Thats totally fair.
gpt-5-chat-latest has been good for conversations that are 2-10mins and a good balance of instruction following vs latency using about 5-15K context.
I use gpt 4o-mini. cheaper and faster.
RealtimeAPI 1.5 on Azure. it works with Japanese. Gemini Flash Live could be okay but some gotchas on tool calling for now.
Interesting - both of those make sense.
Has the instruction following and tool calling been good enough that you haven’t needed to look at newer models ?