What this is
Managed jobs let the platform handle the training lifecycle — scheduling, execution, checkpointing, and output model materialization. Use them when your objective fits a supported method and your priority is reliability and operational simplicity.Managed job types
| Resource | Objective | API |
|---|---|---|
| Supervised Fine-Tuning (SFT) | Cross-entropy on instruction/response pairs | fw.supervised_fine_tuning_jobs.* |
| DPO | Direct preference optimization on chosen/rejected pairs | fw.dpo_jobs.* |
| Managed RFT | Reinforcement fine-tuning with built-in RL losses | fw.reinforcement_fine_tuning_jobs.* |
| Service-mode RLOR | Custom objectives via Training SDK or Cookbook loops | TrainerJobManager + TrainerJobConfig (SDK) or cookbook recipes |
Listing jobs
Creating managed jobs
SFT (flat keyword arguments)
DPO (flat keyword arguments)
Managed RFT (with training_config and loss_config)
Available loss_config methods
| Method | Description |
|---|---|
GRPO | Group Relative Policy Optimization (default for RFT) |
DAPO | Dynamic Advantage Policy Optimization |
DPO | Direct Preference Optimization (default for DPO API) |
ORPO | Odds Ratio Preference Optimization |
GSPO_TOKEN | Token-level GSPO |
W&B integration
All managed job types supportwandb_config for native Weights & Biases logging:
When to switch to service-mode loops
Move from managed jobs to service-mode RLOR loops when you need:- Custom loss functions (e.g. hybrid GRPO + DPO, custom reward shaping)
- Full-parameter tuning with per-step metrics
- Inference-in-the-loop evaluation via hotloading during training
- Algorithm research beyond the built-in methods
fireworks.training.sdk.TrainerJobManager + TrainerJobConfig (see Training SDK Overview).
Operational guidance
- Use managed jobs when your objective fits supported methods and you want minimal code.
- Monitor job state by polling
fw.<job_type>.get(...)until the job reaches a terminal state. - Cancel stuck jobs with
fw.<job_type>.cancel(...)to release resources. - Delete completed jobs when you no longer need them.