Pricing and spend limits
Cost structure
Understanding Fireworks.ai pricing and fees for various services.
Platform costs
Q: How much does Fireworks cost?
Fireworks AI operates on a pay-as-you-go model for all non-Enterprise usage, and new users automatically receive free credits. You pay based on:
- Per token for serverless inference
- Per GPU usage time for on-demand deployments
- Per token of training data for fine-tuning
For customers needing enterprise-grade security and reliability, please reach out to us at inquiries@fireworks.ai to discuss options.
Find out more about our current pricing on our Pricing page.
Fine-tuning fees
Q: Are there extra fees for serving fine-tuned models?
No, deploying fine-tuned models to serverless infrastructure is free. Here’s what you need to know:
What’s free:
- Deploying fine-tuned models to serverless infrastructure
- Hosting the models on serverless infrastructure
- Deploying up to 100 fine-tuned models
What you pay for:
- Usage costs on a per-token basis when the model is actually used
- The fine-tuning process itself, if applicable
Note: This differs from on-demand deployments, which include hourly hosting costs.
Additional resources
- Discord Community: discord.gg/fireworks-ai
- Email Support: inquiries@fireworks.ai
- Documentation: Fireworks.ai docs
Was this page helpful?