Platform costs

Q: How much does Fireworks cost?

Fireworks AI operates on a pay-as-you-go model for all non-Enterprise usage, and new users automatically receive free credits. You pay based on:

  • Per token for serverless inference
  • Per GPU usage time for on-demand deployments
  • Per token of training data for fine-tuning

For customers needing enterprise-grade security and reliability, please reach out to us at inquiries@fireworks.ai to discuss options.

Find out more about our current pricing on our Pricing page.


Fine-tuning fees

Q: Are there extra fees for serving fine-tuned models?

No, deploying fine-tuned models to serverless infrastructure is free. Here’s what you need to know:

What’s free:

  • Deploying fine-tuned models to serverless infrastructure
  • Hosting the models on serverless infrastructure
  • Deploying up to 100 fine-tuned models

What you pay for:

  • Usage costs on a per-token basis when the model is actually used
  • The fine-tuning process itself, if applicable

Note: This differs from on-demand deployments, which include hourly hosting costs.


Additional resources