Whats your current go-to LLM model?

Sameer_Vohra · April 23, 2026, 4:38pm

The big labs are constantly releasing newer models and new providers keep showing up

Its hard to keep up.

So, post your go-to model(s) and providers. If there are > 10 responses, I’ll maintain a summary of the results for everyone.

Sameer_Vohra · April 23, 2026, 4:39pm

Currently its gpt-5-chat-latest but I just saw OpenAI’s deprecation notice for it.

Raghu_Udiyar · April 29, 2026, 7:36am

The model choice will vary a lot depending on the use case and requirements - and as you mentioned, new models keep getting released. And usually there’s a mix of models in the same agent for different use cases to optimise on cost and latency.

Laurent_Bo · April 29, 2026, 12:53pm

do you have examples @Raghu_Udiyar ? like which models to use for which context?

curious to understand better the logic behind

thx

Sameer_Vohra · April 29, 2026, 1:27pm

Thats totally fair.

gpt-5-chat-latest has been good for conversations that are 2-10mins and a good balance of instruction following vs latency using about 5-15K context.

RabbaniF22 · April 30, 2026, 5:19am

I use gpt 4o-mini. cheaper and faster.

Takashi_Futada · April 30, 2026, 6:47am

RealtimeAPI 1.5 on Azure. it works with Japanese. Gemini Flash Live could be okay but some gotchas on tool calling for now.

Sameer_Vohra · April 30, 2026, 1:54pm

Interesting - both of those make sense.

Has the instruction following and tool calling been good enough that you haven’t needed to look at newer models ?

RabbaniF22 · May 5, 2026, 11:45am

I cannot say it does follow instructions all the time. I can say it works 90% of the time. But ours is a complex scenario.
but for simple to medium tasks the model should do just good.
Tool calling is good.

Topic		Replies	Views
Gpt-Realtime 2: Experience so far? Agents agent-development , llm , realtime , openai	2	156	May 9, 2026
Response.prompt_cache_retention Input should be ‘in-memory’ or ‘24h Agents agent-development , openai	2	55	April 21, 2026
Support persistent custom model options in openai.LLM.withTelnyx() / OpenAI-compatible LLMs Agents llm , node-js , telnyx	2	26	May 27, 2026
Gpt-realtime-2 set reasoning_effort to none or very low Agents agent-development , realtime , openai	1	230	May 8, 2026
Gemini 3.1 Flash Live Preview Getting Started	1	39	May 6, 2026

Whats your current go-to LLM model?

Related topics