Throughput capacity typically depends on several factors:

  • Deployment type (serverless or on-demand)
  • Traffic patterns and request patterns
  • Hardware configuration
  • Model size and complexity