Whats your current go-to LLM model?

The big labs are constantly releasing newer models and new providers keep showing up :slight_smile:

Its hard to keep up.

So, post your go-to model(s) and providers. If there are > 10 responses, I’ll maintain a summary of the results for everyone.

Currently its gpt-5-chat-latest but I just saw OpenAI’s deprecation notice for it.

The model choice will vary a lot depending on the use case and requirements - and as you mentioned, new models keep getting released. And usually there’s a mix of models in the same agent for different use cases to optimise on cost and latency.

do you have examples @Raghu_Udiyar ? like which models to use for which context?

curious to understand better the logic behind

thx

Thats totally fair.

gpt-5-chat-latest has been good for conversations that are 2-10mins and a good balance of instruction following vs latency using about 5-15K context.

I use gpt 4o-mini. cheaper and faster.

RealtimeAPI 1.5 on Azure. it works with Japanese. Gemini Flash Live could be okay but some gotchas on tool calling for now.

Interesting - both of those make sense.

Has the instruction following and tool calling been good enough that you haven’t needed to look at newer models ?