Pricing

Are there extra fees for serving fine-tuned models?

There are no extra costs for fine-tuned models outside of the initial tuning cost. Fine-tuned models are served at the same price as base text models. See pricing page for details.

How much does Fireworks cost?

Fireworks AI is pay-as-you-go for all non-Enterprise usage and new users automatically receive free credits. You pay per token for serverless inference, per GPU usage time for on-demand deployments and per token of training data for tuning. For customers that require Enterprise-grade security and reliability, please reach out to us at inquiries@fireworks.ai to discuss options.

Head over to Pricing to see more details.

What are spend limits? How do I increase my limits?

Spend limits (a.k.a Usage limits) restrict how much you can spend on Fireworks every month, it caps the accrued usage of the month. API requests will be rejected when your account’s usage exceeds that limit. This helps prevent customers from getting unexpectedly high bills if their app goes viral.

We enforce different spend limit based on usage tiers and will automatically increase your spend limit quota to the next tier as your historic spend on Fireworks API goes up. The historical spend includes payments for both credits and past invoices. Head over to Pricing to see how much you need to spend in order to move to the next tier.

To increase your usage limit, you can buy prepaid credits at Billing to increase your historic spend. For example, if your account is on tier 1 with 50permonthspendlimit,youcanbuy50 per month spend limit, you can buy 100+ credit and your spend limit will be increased to $500 per month automatically. Note: There could be a propagation delay after credit payment is complete. It’s possible that you may still see “monthly usage exceeded error” persists for a few minutes after topup, please retry again later.

Why am I getting a “monthly usage exceeded error”? Do credits count against spend limits?

Yes credits count against spend or usage limits. For example, on tier 1, you can purchase 60increditsbutstillhaveyourusagestoppedafter60 in credits but still have your usage stopped after 50 in usage. You’ll need to purchase a larger volume of prepaid credits to advance tiers. If you exceed your account’s usage limit, API requests will be rejected. Visit Billing to add your payment method and monitor your usage and invoices.

If you do not have a payment method on file, your account will be suspended after your credits are depleted. Failure to pay a past invoice may also result in account suspension. Your usage limit will be set to $0 per month in both cases.

Are there discounts for bulk usage?

We offer discounts for bulk or pre-paid purchases only for on-demand deployments, not for serverless GPUs. Please contact raythai@fireworks.ai if you’re interested.