What this is
Service-mode RLOR jobs expose trainer endpoints consumed by custom training loops.Workflow
- Create job with
serviceMode=true. - Wait for readiness and capture
direct_route_handle. - Resume and delete jobs as experiments evolve.
End-to-end examples
Create and inspect RLOR job
Operational guidance
- Service-mode trainer jobs currently support full-parameter tuning only. Set
lora_rank=0whenserviceMode=true(lora_rank>0is rejected).