Recommended open models

Which open models should I use?

There’s no single right answer! Here’s a curated list based on Fireworks internal testing, community feedback, and external benchmarks. We recommend using it as a starting point, and we will update it regularly as new models emerge.

Model sizes are marked as Small, Medium, or Large
For best latency, use small or medium models. For best quality, use large models or fine-tune medium or small models.
You can explore all models in the Fireworks Model Library

Use Case	Recommended Models
Code generation & reasoning	Kimi K2 0905, Deepseek V3.1, Qwen3 Coder 480B, Deepseek R1 0528 (Large) Qwen2.5-32B-Coder (Medium)
Code completion & bug fixing	Qwen2.5-32B-Coder (Medium) Qwen3 14B, Qwen3 8B (Small)
AI Agents with tool use	Kimi K2 0905, Deepseek V3.1, Qwen3 235B A22B (Large) Qwen 3 Family Models (Large/Medium/Small)
General reasoning & planning	Kimi K2 0905, Deepseek V3.1, Qwen3 235B Thinking 2507, GLM 4.5, Deepseek R1 0528 (Large) Qwen2.5-72B-Instruct, Llama 3.3 70B (Medium)
Long context & summarization	Llama 4 Maverick, Kimi K2 0905 (Large) Llama 4 Scout (Medium)
Vision & document understanding	Llama 4 Maverick (Large) Qwen2.5-VL 32B Instruct, Qwen2.5-VL 72B Instruct, Llama 4 Scout (Medium) Qwen2.5-VL 3-7B (Small)
Low-latency NLU & extraction	Llama 3.1 8B, Llama 3.2 3B, Llama 3.2 1B, Qwen3 8B (Small)

Last updated: Sep 8, 2025

Quickstart

OpenAI compatibility

⌘I

Get Started

Querying models

Dedicated Deployments

Fine-tuning

Integrations

Policies

Administration

Which open models should I use?

Get Started

Querying models

Dedicated Deployments

Fine-tuning

Integrations

Policies

Administration

​Which open models should I use?

Which open models should I use?