Skip to main content
Fireworks AI Docs home page
Documentation
API & SDK Reference
CLI Reference
Resources
Community
Status
Dashboard
Dashboard
Search...
Navigation
Deployment & Infrastructure
Do you support Auto Scaling?
Search...
⌘K
Reference
Concepts
Changelog
Examples
Featured
Fine-tuning
Reinforcement Learning
FAQ
Account & Access
Billing & Pricing
Deployment & Infrastructure
Serverless SLAs
Serverless quotas
Model removal notice
Serverless timeout issues
System scaling
Auto scaling support
Throughput capacity
Request handling factors
Autoscaling cost impact
On-demand rate limits
On-demand billing
GPU deployment billing
Models & Inference
Deployment & Infrastructure
Do you support Auto Scaling?
Copy page
Copy page
Yes, our system supports
auto scaling
with the following features:
Scaling down to zero
capability for resource efficiency
Controllable
scale-up and scale-down velocity
Custom scaling rules and thresholds
to match your specific needs
Was this page helpful?
Yes
No
How does the system scale?
Previous
What’s the supported throughput?
Next
⌘I