What’s the supported throughput? - Fireworks AI Docs

Throughput capacity typically depends on several factors:

Deployment type (serverless or on-demand)
Traffic patterns and request patterns
Hardware configuration
Model size and complexity

Do you support Auto Scaling?

What factors affect the number of simultaneous requests that can be handled?

⌘I