Single-LoRA deployment with live merge
Deploy a LoRA fine-tuned model using live merge for simplified deployment and optimal performance
Deploy your LoRA fine-tuned model with a single command that delivers performance matching the base model. This streamlined approach, called live merge, eliminates the previous two-step process and provides better performance compared to multi-LoRA deployments.
Quick deployment
Deploy your LoRA fine-tuned model with one simple command:
Your deployment will be ready to use once it completes, with performance that matches the base model.
Alternative deployment method
This two-step method is the standard approach for multi-LoRA deployments where multiple LoRA models share the same base model. While it can also be used for single LoRA deployments, it provides slower performance compared to live merge and is not recommended for single LoRA use cases.
You can also deploy single LoRA models using a two-step process:
Create base model deployment
Deploy the base model with addons enabled:
Load LoRA addon
Once the deployment is ready, load the LoRA model onto the deployment:
When to use live merge
Use live merge deployment when you:
- Have a single fine-tuned model to serve
- Need optimal performance that matches the base model
- Want the simplest deployment process
- Don’t require sharing a base model across multiple LoRA models
The live merge deployment method is designed for dedicated deployments with a single LoRA model. For multiple LoRA models sharing the same base model, consider using multi-LoRA deployment.