• Scaling from 0: No minimum cost when scaled to zero
  • Scaling up: Each new replica adds to your total cost proportionally. For example:
    • Scaling from 1 to 2 replicas doubles your GPU costs
    • If each replica uses multiple GPUs, costs scale accordingly (e.g., scaling from 1 to 2 replicas with 2 GPUs each means paying for 4 GPUs total)

For current pricing details, please visit our pricing page.