Release Notes and Updates
Changelog
2025-05-19
What’s new
🚀 Easier & faster LoRA fine-tune deployments on Fireworks
You can now deploy a LoRA fine-tune with a single command and get speeds that approximately match the base model:
Earlier, this involved two distinct steps, and the resultant deployment was slower than the base model:
- Create a deployment using
firectl create deployment "accounts/fireworks/models/<MODEL_ID of base model>" --enable-addons
- Then deploy the addon to the deployment:
firectl load-lora <MODEL_ID> --deployment <DEPLOYMENT_ID>
Docs: https://docs.fireworks.ai/models/deploying#deploying-to-on-demand
This change is for dedicated deployments with a single LoRA. You can still deploy multiple LoRAs on a deployment or deploy LoRA(s) on some Serverless models as described in the docs.