Secure Training (BYOB)

Fireworks enables secure model fine-tuning while maintaining customer control over sensitive components and data. Use your own cloud storage, keep reward functions proprietary, and ensure training data never persists on our platform beyond active workflows.

Secure reinforcement fine-tuning (RFT)

Use reinforcement fine-tuning while keeping sensitive components and data under your control. Follow these steps to run secure RFT end to end using your own storage and reward pipeline.

Configure storage (BYOB)

Point Fireworks to your storage so you retain governance and apply your own compliance controls.

Datasets: GCS Bucket Integration (AWS S3 coming soon)
Models (optional): External AWS S3 Bucket Integration

Grant least-privilege IAM to only the bucket/path prefixes needed for training. Use server-side encryption and your KMS policies where required.

Prepare your reward pipeline and rollouts

Keep your reward functions, rollout servers, and training metrics under your control. Generate rewards from your environment and write them to examples in your dataset (or export a dataset that contains per-example rewards).

Reward functions and reward models remain proprietary and never need to be shared
Rollouts and evaluation infrastructure run in your environment
Model checkpoints can be registered to your storage registry if desired

Create a dataset that includes rewards

Create or point a Dataset at your BYOB storage. Ensure each example contains the information required by your reward pipeline (for example, prompts, outputs/trajectories, and numeric rewards).

You can reuse existing supervised data by attaching reward signals produced by your pipeline, or export a fresh dataset into your bucket for consumption by RFT.

Run reinforcement fine-tuning step from Python

Use the Python SDK to create a reinforcement fine-tuning step that reads from your BYOB dataset and produces a new checkpoint.

from fireworks import Fireworks

client = Fireworks()

# Create a reinforcement fine-tuning step
step = client.reinforcement_fine_tuning_steps.create(
    rlor_trainer_job_id="my-rft-job-001",
    display_name="Secure RFT Training Step",
    training_config={
        "base_model": "accounts/fireworks/models/{BASE_MODEL}",
        "learning_rate": 1e-5,
        "lora_rank": 8,
        "max_context_length": 4096,
        "batch_size": 32768,
    },
    dataset="accounts/{ACCOUNT}/datasets/{DATASET_NAME}",  # Your BYOB dataset with rewards
    output_model="accounts/{ACCOUNT}/models/my-improved-model-v1",
    reward_weights=["score"],  # Field name for rewards in your dataset
)

# Poll for completion
import time
timeout = 3600  # 1 hour timeout
start_time = time.time()
while True:
    if time.time() - start_time > timeout:
        raise TimeoutError(f"Job polling timed out after {timeout} seconds")
    job = client.reinforcement_fine_tuning_steps.get(
        rlor_trainer_job_id="my-rft-job-001"
    )
    if job.state == "JOB_STATE_COMPLETED":
        print("Training complete!")
        break
    elif job.state in ("JOB_STATE_FAILED", "JOB_STATE_CANCELLED"):
        raise RuntimeError(f"Training failed: {job.state}")
    time.sleep(10)

See the Create Reinforcement Fine-tuning Step API reference for full parameters and options.

For a complete iterative RL workflow example using the Python SDK, including rollout generation, reward computation, and hot-reloading LoRA adapters, see the iterative RL workflow example on GitHub.

When continuing from a LoRA checkpoint, training parameters such as lora_rank, learning_rate, max_context_length, and batch_size must match the original LoRA training.

Verify outputs and enforce controls

Validate the new checkpoint functions as expected in your environment
If exporting models to your storage, apply your registry policies and access reviews
Review audit logs and rotate any temporary credentials used for the run

Do not store long-lived credentials in code. Use short-lived tokens, workload identity, or scoped service accounts when granting Fireworks access to your buckets.

You now have an end-to-end secure RFT workflow with BYOB datasets, proprietary reward pipelines, and isolated training jobs that generate new checkpoints.

GCS Bucket Integration

Use external Google Cloud Storage (GCS) buckets for fine-tuning while keeping your data private. Fireworks creates proxy datasets that reference your external buckets—data is only accessed during fine-tuning within a secure, isolated cluster.

Your data never leaves your GCS bucket except during fine-tuning, ensuring maximum privacy and security.

Required Permissions

You need to grant access to three service accounts:

Fireworks Control Plane

Account: [email protected]
Required role: Custom role with storage.buckets.getIamPolicy permission

gcloud storage buckets add-iam-policy-binding <YOUR_BUCKET> \
  --member=serviceAccount:[email protected] \
  --role=projects/<YOUR_PROJECT>/roles/<YOUR_CUSTOM_ROLE>

This service account will be used to retrieve the IAM Policy set on the bucket, so that we are able to perform bucket ownership verifications and access verifications during dataset creation.

Inference Service Account

Account: [email protected]
Required role: Storage Object Viewer or Storage Object Admin

gcloud storage buckets add-iam-policy-binding <YOUR_BUCKET> \
  --member=serviceAccount:[email protected] \
  --role=roles/storage.objectViewer

This service account will be used to access the files in the bucket.

Your Company’s Fireworks Service Account

Account: Your company’s Fireworks account registration email
Required role: Storage Object Viewer or Storage Object Admin

gcloud storage buckets add-iam-policy-binding <YOUR_BUCKET> \
  --member=serviceAccount:<YOUR_COMPANY_FW_ACCOUNT_EMAIL> \
  --role=roles/storage.objectViewer

This is used to validate that your account actually has access to the bucket that you are trying to reference the dataset from. The email associated with your account (not the email of the user, but the account itself, you can get it with firectl get account) must have at least read access to the bucket listed under the bucket access IAM policy.

Usage Example

Create a Proxy Dataset

Create a dataset that references your external GCS bucket:

firectl create dataset {DATASET_NAME} --external-url gs://bucket-name/object-name

Ensure your gsutil path points directly to the JSONL file. If the file is in a folder, make sure the folder contains only the intended file.

Start Fine-tuning

Use the proxy dataset to create a fine-tuning job:

firectl create sftj \
  --dataset "accounts/{ACCOUNT}/datasets/{DATASET_NAME}" \
  --base-model "accounts/fireworks/models/{MODEL}" \
  --output-model {TRAINED_MODEL_NAME}

For additional options, run: firectl create sftj -h

Key Benefits

Data Privacy

Your data never leaves your GCS bucket except during fine-tuning

Security

Access is limited to isolated fine-tuning clusters

Simplicity

Reference external data without copying or moving files

Data Security Overview

Learn about our comprehensive security measures

Reinforcement Fine Tuning

Full guide to reinforcement fine-tuning

Get Started

Deployments

Models & Inference

Fine Tuning

Administration

Security & Compliance

Integrations

Secure reinforcement fine-tuning (RFT)

GCS Bucket Integration

Required Permissions

Fireworks Control Plane

Inference Service Account

Your Company’s Fireworks Service Account

Usage Example

Key Benefits

Data Privacy

Security

Simplicity

Data Security Overview

Reinforcement Fine Tuning

Get Started

Deployments

Models & Inference

Fine Tuning

Administration

Security & Compliance

Integrations

​Secure reinforcement fine-tuning (RFT)

​GCS Bucket Integration

​Required Permissions

​Fireworks Control Plane

​Inference Service Account

​Your Company’s Fireworks Service Account

​Usage Example

​Key Benefits

Data Privacy

Security

Simplicity

​Related Resources

Data Security Overview

Reinforcement Fine Tuning

Secure reinforcement fine-tuning (RFT)

GCS Bucket Integration

Required Permissions

Fireworks Control Plane

Inference Service Account

Your Company’s Fireworks Service Account

Usage Example

Key Benefits

Related Resources