Create Supervised Fine-tuning Job
Documentation Index
Fetch the complete documentation index at: https://docs.fireworks.ai/llms.txt
Use this file to discover all available pages before exploring further.
Authorizations
Bearer authentication using your Fireworks API key. Format: Bearer <API_KEY>
Path Parameters
The Account Id
Query Parameters
ID of the supervised fine-tuning job, a random UUID will be generated if not specified.
Body
The name of the dataset used for training.
The AWS configuration for S3 dataset access.
The Azure configuration for Azure Blob Storage dataset access.
The model ID to be assigned to the resulting fine-tuned model. If not specified, the job ID will be used.
The name of the base model to be fine-tuned Only one of 'base_model' or 'warm_start_from' should be specified.
The PEFT addon model in Fireworks format to be fine-tuned from Only one of 'base_model' or 'warm_start_from' should be specified.
Whether to stop training early if the validation loss does not improve.
The number of epochs to train for.
The learning rate used for training.
The maximum context length to use with the model.
The rank of the LoRA layers.
The Weights & Biases team/user account for logging training progress.
The name of a separate dataset to use for evaluation.
Whether to run the fine-tuning job in turbo mode.
Whether to auto-carve the dataset for eval.
Deprecated: multi-node scheduling is now handled by the cookbook orchestrator in V2 workflows. This field is ignored for V2 jobs and will be removed in a future release.
Deprecated: MTP is not supported in V2 training. These fields are retained for V1 Helm-based SFT backward compatibility only.
Deprecated: see mtp_enabled.
Deprecated: see mtp_enabled.
The number of samples per gradient batch.
Weight decay (L2 regularization) for optimizer.
Scheduling purpose for this job.
PURPOSE_UNSPECIFIED, PURPOSE_PILOT Response
A successful response.
The name of the dataset used for training.
The AWS configuration for S3 dataset access.
The Azure configuration for Azure Blob Storage dataset access.
JobState represents the state an asynchronous job can be in.
- JOB_STATE_PAUSED: Job is paused, typically due to account suspension or manual intervention.
- JOB_STATE_DELETED: Job has been deleted.
JOB_STATE_UNSPECIFIED, JOB_STATE_CREATING, JOB_STATE_RUNNING, JOB_STATE_COMPLETED, JOB_STATE_FAILED, JOB_STATE_CANCELLED, JOB_STATE_DELETING, JOB_STATE_WRITING_RESULTS, JOB_STATE_VALIDATING, JOB_STATE_DELETING_CLEANING_UP, JOB_STATE_PENDING, JOB_STATE_EXPIRED, JOB_STATE_RE_QUEUEING, JOB_STATE_CREATING_INPUT_DATASET, JOB_STATE_IDLE, JOB_STATE_CANCELLING, JOB_STATE_EARLY_STOPPED, JOB_STATE_PAUSED, JOB_STATE_DELETED The email address of the user who initiated this fine-tuning job.
The model ID to be assigned to the resulting fine-tuned model. If not specified, the job ID will be used.
The name of the base model to be fine-tuned Only one of 'base_model' or 'warm_start_from' should be specified.
The PEFT addon model in Fireworks format to be fine-tuned from Only one of 'base_model' or 'warm_start_from' should be specified.
Whether to stop training early if the validation loss does not improve.
The number of epochs to train for.
The learning rate used for training.
The maximum context length to use with the model.
The rank of the LoRA layers.
The Weights & Biases team/user account for logging training progress.
The name of a separate dataset to use for evaluation.
Whether to run the fine-tuning job in turbo mode.
Whether to auto-carve the dataset for eval.
The update time for the supervised fine-tuning job.
Deprecated: multi-node scheduling is now handled by the cookbook orchestrator in V2 workflows. This field is ignored for V2 jobs and will be removed in a future release.
Deprecated: MTP is not supported in V2 training. These fields are retained for V1 Helm-based SFT backward compatibility only.
Deprecated: see mtp_enabled.
Deprecated: see mtp_enabled.
Job progress.
The signed URL for the trainer logs file (stdout/stderr). Only populated if the account has trainer log reading enabled.
The number of samples per gradient batch.
The estimated cost of the job.
Weight decay (L2 regularization) for optimizer.
Scheduling purpose for this job.
PURPOSE_UNSPECIFIED, PURPOSE_PILOT