Skip to main content
Fireworks AI Docs home page
Documentation
API & SDK Reference
CLI Reference
Resources
Community
Status
Dashboard
Dashboard
Search...
Navigation
Deployment & Infrastructure
Are there any quotas for serverless?
Search...
⌘K
Reference
Concepts
Changelog
Examples
Featured
Fine-tuning
Reinforcement Learning
FAQ
Account & Access
Billing & Pricing
Deployment & Infrastructure
Serverless SLAs
Serverless quotas
Model removal notice
Serverless timeout issues
System scaling
Auto scaling support
Throughput capacity
Request handling factors
Autoscaling cost impact
On-demand rate limits
On-demand billing
GPU deployment billing
Models & Inference
Deployment & Infrastructure
Are there any quotas for serverless?
Copy page
Copy page
Yes, serverless deployments have rate limits and quotas.
For detailed information about serverless quotas, rate limits, and daily token limits, see our
Rate Limits & Quotas guide
.
Was this page helpful?
Yes
No
Are there SLAs for serverless?
Previous
Do you provide notice before removing model availability?
Next
⌘I