Update Deployment

curl --request PATCH \ --url https://api.fireworks.ai/v1/accounts/{account_id}/deployments/{deployment_id} \ --header 'Authorization: Bearer <token>' \ --header 'Content-Type: application/json' \ --data '{ "displayName": "<string>", "description": "<string>", "expireTime": "2023-11-07T05:31:56Z", "minReplicaCount": 123, "maxReplicaCount": 123, "autoscalingPolicy": { "scaleUpWindow": "<string>", "scaleDownWindow": "<string>", "scaleToZeroWindow": "<string>", "loadTargets": {} }, "baseModel": "<string>", "acceleratorCount": 123, "acceleratorType": "ACCELERATOR_TYPE_UNSPECIFIED", "precision": "PRECISION_UNSPECIFIED", "enableAddons": true, "draftTokenCount": 123, "draftModel": "<string>", "ngramSpeculationLength": 123, "deploymentTemplate": "<string>", "autoTune": { "longPrompt": true }, "placement": { "region": "REGION_UNSPECIFIED", "multiRegion": "MULTI_REGION_UNSPECIFIED", "regions": [ "REGION_UNSPECIFIED" ] }, "disableDeploymentSizeValidation": true }'

{ "name": "<string>", "displayName": "<string>", "description": "<string>", "createTime": "2023-11-07T05:31:56Z", "expireTime": "2023-11-07T05:31:56Z", "purgeTime": "2023-11-07T05:31:56Z", "deleteTime": "2023-11-07T05:31:56Z", "state": "STATE_UNSPECIFIED", "status": { "code": "OK", "message": "<string>" }, "minReplicaCount": 123, "maxReplicaCount": 123, "replicaCount": 123, "autoscalingPolicy": { "scaleUpWindow": "<string>", "scaleDownWindow": "<string>", "scaleToZeroWindow": "<string>", "loadTargets": {} }, "baseModel": "<string>", "acceleratorCount": 123, "acceleratorType": "ACCELERATOR_TYPE_UNSPECIFIED", "precision": "PRECISION_UNSPECIFIED", "cluster": "<string>", "enableAddons": true, "draftTokenCount": 123, "draftModel": "<string>", "ngramSpeculationLength": 123, "numPeftDeviceCached": 123, "deploymentTemplate": "<string>", "autoTune": { "longPrompt": true }, "placement": { "region": "REGION_UNSPECIFIED", "multiRegion": "MULTI_REGION_UNSPECIFIED", "regions": [ "REGION_UNSPECIFIED" ] }, "region": "REGION_UNSPECIFIED", "updateTime": "2023-11-07T05:31:56Z", "disableDeploymentSizeValidation": true }

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Path Parameters

account_id

string

required

The Account Id

deployment_id

string

required

The Deployment Id

Body

application/json

The properties of the deployment being updated. deployment.name must be populated with the updated resource's name.

The body is of type object.

Response

200 - application/json

A successful response.

The response is of type object.

LLM API

Response API

Embeddings API

Image API

Audio API

Audio batch API

Accounts

Deployments

Models

LoRAs

Supervised fine-tuning jobs

Reinforcement fine-tuning jobs

Datasets

Users

API Keys

Authorizations

Path Parameters

Body

Response