Deployment costs

Q: Are there costs associated with deploying fine-tuned models to serverless infrastructure?

No, deploying fine-tuned models to serverless infrastructure is free.

What’s free:

  • Deploying fine-tuned models to serverless
  • Hosting models on serverless infrastructure
  • Deploying up to 100 fine-tuned models

What you pay for:

  • Usage costs on a per-token basis when the model is actually used
  • The fine-tuning process itself, if applicable

Note: This differs from on-demand deployments, which include hourly hosting costs.


Model availability

Q: Do you provide notice before removing model availability?

Yes, we provide advance notice before removing models from the serverless infrastructure:

  • Minimum 2 weeks’ notice before model removal
  • Longer notice periods may be provided for popular models, depending on usage
  • Higher-usage models may have extended deprecation timelines

Best Practices:

  1. Monitor announcements regularly.
  2. Prepare a migration plan in advance.
  3. Test alternative models to ensure continuity.
  4. Keep your contact information updated for timely notifications.

Additional resources