Pro feature. Reasoning Effort is available on Pro plans.
Screenshot coming. Effort dropdown.
The four levels
| Level | When to use | What it buys you |
|---|---|---|
| Instant | Quick lookups, casual replies | No reasoning. Fastest. Cheapest. |
| Low | Light synthesis, simple multi-step | Brief reasoning trace. Snappy. |
| Medium | Coding, analysis, comparisons | Balanced. The default for serious work. |
| High | Hard math, long planning, complex research | Deepest trace. Slowest. Highest token cost. |
What “instant” means
Instant turns reasoning off, even when the routed model is reasoning-capable. The reply behaves like a regular chat completion. The Reasoning block doesn’t render.
Picking a level
Most people pick Medium and leave it there. Move to High for one-off hard problems, Low when you want speed, Instant when you don’t want any thinking. The footer of every reply tells you which level was used so you can audit cost and latency.Limits
- Reasoning Effort only affects reasoning-capable models. If the router picks a non-reasoning model for your prompt, the level is ignored.
- High runs can take 30+ seconds on the hardest prompts. Pair with browser notifications so you don’t sit and watch.
Related
Live reasoning view
Watch the trace stream.
Reasoning models
Concept overview.