Training Prerequisites & Validation

Before launching an RFT job using the CLI or Web UI, ensure you meet these prerequisites and understand the validation process.

Prerequisites

Before launching an RFT job, ensure you have the following set up. Our quickstart guides will walk you through this.

Dataset prepared and uploaded

Your dataset must be in JSONL format with prompts (system and user messages). Each line represents one training example.Upload via CLI:

eval-protocol create dataset my-dataset --file dataset.jsonl

Or via the Fireworks dashboard.

Evaluator created

Your reward function must be tested and uploaded. For local evaluators, upload via pytest:

cd evaluator_directory
pytest my-evaluator-name.py -vs

The test automatically registers your evaluator with Fireworks. For remote environment testing, deploy your HTTP service first.

Fireworks API key configured

Set your API key as an environment variable:

export FIREWORKS_API_KEY="fw_your_api_key_here"

Or store it in a .env file in your project directory.

Base model selected

Choose a base model that supports fine-tuning. Popular options:

accounts/fireworks/models/llama-v3p1-8b-instruct - Good balance of quality and speed
accounts/fireworks/models/qwen3-0p6b - Fast training for experimentation
accounts/fireworks/models/llama-v3p1-70b-instruct - Best quality, slower training

Check available models at fireworks.ai/models.

Job validation

Before starting training, Fireworks validates your configuration:

Dataset format validation

✅ Valid JSONL format
✅ Each line has messages array
✅ Messages have role and content fields
✅ File size within limits
❌ Missing fields → error with specific line numbers
❌ Invalid JSON → syntax error details

Evaluator validation

✅ Evaluator code syntax is valid
✅ Required dependencies are available
✅ Entry point function exists
✅ Test runs completed successfully
❌ Import errors → missing dependencies
❌ Syntax errors → code issues

Resource availability

✅ Sufficient GPU quota
✅ Base model supports fine-tuning
✅ Account has RFT permissions
❌ Insufficient quota → request increase
❌ Invalid model → choose different base model

Parameter validation

✅ Parameters within valid ranges
✅ Compatible parameter combinations
❌ Invalid ranges → error with allowed values
❌ Conflicting options → resolution guidance

If validation fails, you’ll receive specific error messages with instructions to fix the issues.

Common errors and fixes

Invalid dataset format

Error: Dataset validation failed: invalid JSON on line 42Fix:

Open your JSONL file
Check line 42 for JSON syntax errors
Common issues: missing quotes, trailing commas, unescaped characters
Validate JSON at jsonlint.com

Error: Missing required field 'messages'Fix: Each dataset row must have a messages array:

{"messages": [{"role": "user", "content": "..."}]}

Evaluator not found

Error: Evaluator 'my-evaluator' not found in accountFix:

Upload your evaluator first:

cd evaluator_directory
pytest my-evaluator-name.py -vs

Or specify evaluator ID if using UI:
- Check Evaluators dashboard
- Copy exact evaluator ID

Insufficient quota

Error: Insufficient GPU quota for this jobFix:

Check your current quota at Account Settings
Request a quota increase through the dashboard
Or choose a smaller base model to reduce GPU requirements

Parameter out of range

Error: Learning rate 1e-2 outside valid range [1e-5, 5e-4]Fix: Adjust the parameter to be within the allowed range:

--learning-rate 1e-4  # Use default value

See Parameter Reference for all valid ranges.

Evaluator build timeout

Error: Evaluator build timed out after 10 minutesFix:

Check build logs in Evaluators dashboard
Common issues:
- Large dependencies taking too long to install
- Network issues downloading packages
- Syntax errors in requirements.txt
Wait for build to complete, then retry launching your job
Consider splitting large dependencies or using lighter alternatives

What happens after launching

Once your job is created, here’s what happens:

Job queued

Your job enters the queue and waits for available GPU resources. Queue time depends on current demand.Status: PENDING

Dataset validation

Fireworks validates your dataset to ensure it meets format requirements and quality standards. This typically takes 1-2 minutes.Status: VALIDATING

Training starts

The system begins generating rollouts, evaluating them, and updating model weights. You’ll see:

Rollout generation and evaluation
Reward curves updating in real-time
Training loss decreasing

Status: RUNNING

Monitor progress

Track training via the dashboard. See Monitor Training for details on interpreting metrics and debugging issues.Status: RUNNING → COMPLETED

Job completes

When training finishes, your fine-tuned model is ready for deployment.Status: COMPLETEDNext: Deploy your model for inference.

Next steps

Launch with CLI

Use eval-protocol CLI for fast, scriptable launches

Launch with Web UI

Use the dashboard for visual, guided job creation

Monitor training

Track job progress, inspect rollouts, and debug issues

Get Started

Deployments

Models & Inference

Fine Tuning

Administration

Security & Compliance

Integrations

Training Prerequisites & Validation

Prerequisites

Job validation

Common errors and fixes

What happens after launching

Next steps

Launch with CLI

Launch with Web UI

Monitor training

Get Started

Deployments

Models & Inference

Fine Tuning

Administration

Security & Compliance

Integrations

​Prerequisites

​Job validation

​Common errors and fixes

​What happens after launching

​Next steps

Launch with CLI

Launch with Web UI

Monitor training

Prerequisites

Job validation

Common errors and fixes

What happens after launching

Next steps