> ## Documentation Index
> Fetch the complete documentation index at: https://docs.fireworks.ai/llms.txt
> Use this file to discover all available pages before exploring further.

# Get Account Usage



## OpenAPI

````yaml get /v1/accounts/{account_id}/billingUsage
openapi: 3.1.0
info:
  title: Gateway REST API
  version: 4.259.0
servers:
  - url: https://api.fireworks.ai
security:
  - BearerAuth: []
tags:
  - name: Gateway
paths:
  /v1/accounts/{account_id}/billingUsage:
    get:
      tags:
        - Gateway
      summary: Get Account Usage
      operationId: Gateway_GetAccountUsage
      parameters:
        - name: startTime
          description: |-
            Costs returned are inclusive of `start_time`.
            start_time must be before end_time.
          in: query
          required: true
          schema:
            type: string
            format: date-time
        - name: endTime
          description: |-
            Costs returned are exclusive of `end_time`.
            end_time must not be more than 31 days after start_time.
          in: query
          required: true
          schema:
            type: string
            format: date-time
        - name: usageType
          description: >-
            Usage type to query usage for. If not specified, returns all usage
            types (serverless, dedicated deployments, and training).

             - USAGE_TYPE_UNSPECIFIED: Default value. When specified (or when usage_type field is not set),
            returns usage data for all deployment types: serverless requests,
            dedicated deployments, and training jobs.
             - SERVERLESS: Returns only serverless usage data.
            Filters the response to include only usage from serverless API
            requests.
             - DEDICATED_DEPLOYMENT: Returns only dedicated deployment usage data.
            Filters the response to include only usage from dedicated
            deployments.
             - TRAINING: Returns only training job usage data (SFT/DPO token usage and RFT / service-mode
            trainer GPU-seconds usage). Inference deployments serving rollouts
            for RFT / online RL are reported under DEDICATED_DEPLOYMENT (not
            TRAINING) to avoid double counting GPU time.
          in: query
          required: false
          schema:
            type: string
            enum:
              - USAGE_TYPE_UNSPECIFIED
              - SERVERLESS
              - DEDICATED_DEPLOYMENT
              - TRAINING
            default: USAGE_TYPE_UNSPECIFIED
        - name: timezone
          description: >-
            IANA timezone identifier for daily aggregation (e.g.,
            "America/Los_Angeles", "Europe/London").

            When specified, the returned data will be aggregated into daily
            buckets based on this timezone.

            If not specified or empty, defaults to "UTC".

            See: https://en.wikipedia.org/wiki/List_of_tz_database_time_zones
          in: query
          required: false
          schema:
            type: string
        - name: groupBy
          description: >-
            Dimensions to group usage by (multiple values allowed; each is a
            separate GROUP BY column). Each returned bucket carries the
            requested dimension values in the `group` map on the response item.

            Serverless: "model_name", "api_key_id", "api_key_name",
            "annotations.team", "annotations.project",
            "annotations.environment".

            Dedicated: "deployment_name", "accelerator_type", and the same
            annotation keys.

            Training: "job_id", "job_type", "usage_type", "accelerator_type",
            "base_model", and the same annotation keys.

            When usage_type is unspecified, dimensions that apply only to one
            stream are ignored on the others

            (e.g. "deployment_name" is ignored for serverless and training;
            "model_name" / "api_key_id" / "api_key_name" are ignored for
            dedicated and training; "job_id" / "job_type" are ignored for
            serverless and dedicated).

            Example: ["annotations.team", "model_name"] or ["api_key_id",
            "api_key_name"].

            If empty: serverless aggregates by model name; dedicated defaults to
            deployment and accelerator type; training aggregates by job_id,
            job_type, usage_type, accelerator_type and base_model.
          in: query
          required: false
          explode: true
          schema:
            type: array
            maxItems: 5
            items:
              type: string
        - name: filter
          description: >-
            Filter usage by dimension. Map query parameter — encode each entry
            as `filter[<dimension>][values]=<value>`, repeating the same key to
            OR multiple values for a single dimension.

            Serverless: "model_name", "api_key_id", "api_key_name",
            "annotations.team", "annotations.project",
            "annotations.environment".

            Dedicated: "deployment_name", "accelerator_type", and the same
            annotation keys.

            Training: "job_id", "job_type", "usage_type", "accelerator_type",
            "base_model", and the same annotation keys.

            Example:
            `filter[api_key_name][values]=prod-key&filter[api_key_name][values]=staging-key`.
          in: query
          required: false
          style: deepObject
          explode: true
          schema:
            type: object
            additionalProperties:
              type: object
              properties:
                values:
                  type: array
                  items:
                    type: string
        - name: account_id
          in: path
          required: true
          description: The Account Id
          schema:
            type: string
      responses:
        '200':
          description: A successful response.
          content:
            application/json:
              schema:
                $ref: '#/components/schemas/gatewayAccountUsage'
components:
  schemas:
    gatewayAccountUsage:
      type: object
      properties:
        serverlessCosts:
          type: array
          items:
            $ref: '#/components/schemas/AccountUsageServerlessUsage'
            type: object
          title: List of serverless cost data
        dedicatedCosts:
          type: array
          items:
            $ref: '#/components/schemas/AccountUsageDedicatedDeploymentUsage'
            type: object
          title: List of dedicated deployment cost data
      title: Response with model costs by deployment type
    AccountUsageServerlessUsage:
      type: object
      properties:
        modelName:
          type: string
          description: 'Deprecated: use group["model_name"] instead.'
        promptTokens:
          type: string
          format: int64
          title: Number of prompt tokens (for text inference)
        completionTokens:
          type: string
          format: int64
          title: Number of completion tokens (for text inference)
        startTime:
          type: string
          format: date-time
          title: >-
            Start timestamp of the usage.

            Note: Serverless usage events are aggregated daily. Only the date
            portion (YYYY-MM-DD) is used;
        endTime:
          type: string
          format: date-time
          title: >-
            End timestamp of the usage.

            Note: Serverless usage events are aggregated daily. Only the date
            portion (YYYY-MM-DD) is used;
        audioInputSeconds:
          type: number
          format: double
          title: Audio input seconds (for audio inference)
        usageType:
          type: string
          title: Usage type to distinguish between different inference types
        apiKeyId:
          type: string
          description: 'Deprecated: use group["api_key_id"] instead.'
        group:
          type: object
          additionalProperties:
            type: string
          description: >-
            Group-by dimension key-values (model_name, api_key_id, team,
            project, environment).
      description: One serverless usage aggregation bucket.
    AccountUsageDedicatedDeploymentUsage:
      type: object
      properties:
        deploymentId:
          type: string
          title: The deployment ID
        acceleratorType:
          type: string
          title: GPU type / accelerator type
        acceleratorSeconds:
          type: string
          format: int64
          title: Accelerator seconds
        startTime:
          type: string
          format: date-time
          description: >-
            Start timestamp of the usage.

            Note: Dedicated deployment usage events are aggregated daily. Only
            the date portion (YYYY-MM-DD) is used;

            the time portion is ignored.
        endTime:
          type: string
          format: date-time
          description: >-
            End timestamp of the usage.

            Note: Dedicated deployment usage events are aggregated daily. Only
            the date portion (YYYY-MM-DD) is used;

            the time portion is ignored.
        baseModel:
          type: string
          title: Base model of the deployment
        usageType:
          type: string
          title: Usage type to distinguish between different deployment types
        placement:
          type: string
          description: >-
            Deployment placement as a string (e.g. US_IOWA_1, GLOBAL, US,
            EUROPE).

            Set to REGION_UNSPECIFIED when unknown.
        group:
          type: object
          additionalProperties:
            type: string
          description: >-
            Group-by dimension key-values (deployment_name, accelerator_type,
            team, project, environment).
      description: One dedicated deployment usage aggregation bucket.
  securitySchemes:
    BearerAuth:
      type: http
      scheme: bearer
      description: >-
        Bearer authentication using your Fireworks API key. Format: Bearer
        <API_KEY>
      bearerFormat: API_KEY

````