Skip to main content

Documentation Index

Fetch the complete documentation index at: https://docs.fireworks.ai/llms.txt

Use this file to discover all available pages before exploring further.

Overview

firectl billing get-usage returns your account’s usage and rated costs for a time range, grouped by dimensions you choose — deployment, accelerator type, model, region, or custom team/project/environment tags. This complements Exporting Billing Metrics: use export-metrics for a raw per-event CSV dump, and get-usage for a grouped, rated view.
Requires firectl 1.7.21 or later. Run firectl version, then firectl upgrade if needed.

Usage

firectl billing get-usage \
  --start-time 2026-05-01 \
  --end-time   2026-06-01
The output has two parts: account costs (rated dollar totals for the range) and usage (the metered quantities, grouped by your chosen dimensions). Add -o json for machine-readable output.

Examples

# Serverless usage by model
firectl billing get-usage --start-time 2026-05-01 --end-time 2026-06-01 \
  --usage-type serverless --group-by model_name

# Dedicated deployment usage by deployment and GPU type
firectl billing get-usage --start-time 2026-05-01 --end-time 2026-06-01 \
  --usage-type dedicated-deployment --group-by deployment_name --group-by accelerator_type

# Filter to a single deployment
firectl billing get-usage --start-time 2026-05-01 --end-time 2026-06-01 \
  --filter deployment_name=accounts/my-account/deployments/my-deployment

# Account-level cost totals only
firectl billing get-usage --start-time 2026-05-01 --end-time 2026-06-01 --account-costs-only

Flags

FlagDescription
--start-timeStart time (inclusive), as YYYY-MM-DD or 'YYYY-MM-DD hh:mm:ss'.
--end-timeEnd time (exclusive), same formats.
--usage-typeall, serverless, or dedicated-deployment. Defaults to all.
--group-byDimension to group by. Repeatable.
--filterkey=value filter. Repeatable; repeated values for the same key are OR’ed.
--timezoneIANA timezone for daily aggregation (e.g. America/Los_Angeles). Defaults to UTC.
--account-costs-onlyPrint only account-level cumulative costs for the range.
-o, --outputtext (default) or json.

Grouping dimensions

Valid --group-by and --filter dimensions depend on the usage type:
  • Serverless: model_name, api_key_id, api_key_name, annotations.team, annotations.project, annotations.environment
  • Dedicated deployment: deployment_name, accelerator_type, annotations.team, annotations.project, annotations.environment
Dedicated-deployment rows also include the deployment’s region (placement, e.g. US, EUROPE, GLOBAL) and metered accelerator_seconds.

Custom tags (team / project / environment)

Group by annotations.team, annotations.project, or annotations.environment to split usage by your own labels. The tag source depends on usage type:
  • Dedicated deployments: set an annotations map on the deployment, e.g. {"team": "search", "project": "x", "environment": "prod"}.
  • Serverless: send a per-request header, e.g. Fireworks-Annotations: team=search,project=x,environment=prod.

Granularity

Usage is aggregated into daily buckets (--timezone sets the day boundary). There are no sub-daily buckets.
Run firectl billing get-usage --help to see all available flags and options.

See also