Reasoning Support
Some Emby models can show their step-by-step reasoning process before giving the final answer.This is useful for:
- Debugging & code explanation
- Math & symbolic reasoning
- Logic puzzles
- Complex planning
- Multi-step problem solving
reasoning_effort.
Reasoning-Capable Models
You can find all reasoning models on the/models endpoint.
These usually include:
- Kimi 1.5+ (Emby-hosted)
- DeepSeek-R1 & DeepSeek-V3 R1
- Qwen 3.5 Reasoning
- GLM-4 Reasoning series
- OSS Reasoning Models (gpt-oss-20b, 120b, etc.)
Emby returns only provider-approved reasoning fields.
Reasoning Levels
Addreasoning_effort to your request:
| Level | What it means |
|---|---|
"minimal" | Fastest, lightweight reasoning |
"low" | Good for simple chain-of-thought |
"medium" | Balanced accuracy + cost (recommended) |
"high" | Deep reasoning for complex problems |
Example Request
Example Response
Streaming Reasoning
When using"stream": true, reasoning is streamed before the answer.
Usage Tracking
Every reasoning-enabled call includes:reasoning_tokenscompletion_tokensprompt_tokenstotal_tokens
- Full reasoning text
- Latency
- Token costs
- Model behavior
Auto-Routing Behavior
When using generic models like"deepseek-r1" without specifying version:
Emby will:
- Choose a reasoning-enabled variant
- Apply a safe default reasoning level
- Only route to providers that support reasoning
- Normalize the output format
Model Differences
Not all models expose reasoning equally:Full reasoning shown
DeepSeek R1, Qwen Reasoning, GLM Reasoning, OSS Reasoners
Internal reasoning only
Some vendor models compute reasoning internally but hide chain-of-thought.
Best Practices
Choose the right effort
Use low/medium for most tasks.
High can greatly increase token usage.
High can greatly increase token usage.
Use streaming for UX
Let users see the model’s thought process as it unfolds.
Inspect logs
View full reasoning + token split in the dashboard.
Monitor usage
Reasoning can multiply token usage—plan accordingly.
Error Handling
Ifreasoning_effort is used on a model without reasoning support:
Need help choosing a reasoning model?
We help teams pick the right models for large codebases & refactoring workflows. 📞 Book a call: https://cal.com/absolum/30min💬 WhatsApp us: https://wa.absolum.nl

