# Fireworks AI Docs ## Docs - [Exporting Billing Metrics](https://docs.fireworks.ai/accounts/exporting-billing-metrics.md): Export billing and usage metrics for all Fireworks services - [Service Accounts](https://docs.fireworks.ai/accounts/service-accounts.md): How to manage and use service accounts in Fireworks - [Custom SSO](https://docs.fireworks.ai/accounts/sso.md): Set up custom Single Sign-On (SSO) authentication for Fireworks AI - [Managing users](https://docs.fireworks.ai/accounts/users.md): Add and delete additional users in your Fireworks account - [Batch Delete Batch Jobs](https://docs.fireworks.ai/api-reference-dlde/batch-delete-batch-jobs.md) - [Batch Delete Environments](https://docs.fireworks.ai/api-reference-dlde/batch-delete-environments.md) - [Batch Delete Node Pools](https://docs.fireworks.ai/api-reference-dlde/batch-delete-node-pools.md) - [Cancel Batch Job](https://docs.fireworks.ai/api-reference-dlde/cancel-batch-job.md): Cancels an existing batch job if it is queued, pending, or running. - [Connect Environment](https://docs.fireworks.ai/api-reference-dlde/connect-environment.md): Connects the environment to a node pool. Returns an error if there is an existing pending connection. - [Create Aws Iam Role Binding](https://docs.fireworks.ai/api-reference-dlde/create-aws-iam-role-binding.md) - [Create Batch Job](https://docs.fireworks.ai/api-reference-dlde/create-batch-job.md) - [Create Cluster](https://docs.fireworks.ai/api-reference-dlde/create-cluster.md) - [Create Environment](https://docs.fireworks.ai/api-reference-dlde/create-environment.md) - [Create Node Pool](https://docs.fireworks.ai/api-reference-dlde/create-node-pool.md) - [Create Node Pool Binding](https://docs.fireworks.ai/api-reference-dlde/create-node-pool-binding.md) - [Create Snapshot](https://docs.fireworks.ai/api-reference-dlde/create-snapshot.md) - [Delete Aws Iam Role Binding](https://docs.fireworks.ai/api-reference-dlde/delete-aws-iam-role-binding.md) - [Delete Batch Job](https://docs.fireworks.ai/api-reference-dlde/delete-batch-job.md) - [Delete Cluster](https://docs.fireworks.ai/api-reference-dlde/delete-cluster.md) - [Delete Environment](https://docs.fireworks.ai/api-reference-dlde/delete-environment.md) - [Delete Node Pool](https://docs.fireworks.ai/api-reference-dlde/delete-node-pool.md) - [Delete Node Pool Binding](https://docs.fireworks.ai/api-reference-dlde/delete-node-pool-binding.md) - [Delete Snapshot](https://docs.fireworks.ai/api-reference-dlde/delete-snapshot.md) - [Disconnect Environment](https://docs.fireworks.ai/api-reference-dlde/disconnect-environment.md): Disconnects the environment from the node pool. Returns an error if the environment is not connected to a node pool. - [Get Batch Job](https://docs.fireworks.ai/api-reference-dlde/get-batch-job.md) - [Get Batch Job Logs](https://docs.fireworks.ai/api-reference-dlde/get-batch-job-logs.md) - [Get Cluster](https://docs.fireworks.ai/api-reference-dlde/get-cluster.md) - [Get Cluster Connection Info](https://docs.fireworks.ai/api-reference-dlde/get-cluster-connection-info.md): Retrieve connection settings for the cluster to be put in kubeconfig - [Get Environment](https://docs.fireworks.ai/api-reference-dlde/get-environment.md) - [Get Node Pool](https://docs.fireworks.ai/api-reference-dlde/get-node-pool.md) - [Get Node Pool Stats](https://docs.fireworks.ai/api-reference-dlde/get-node-pool-stats.md) - [Get Snapshot](https://docs.fireworks.ai/api-reference-dlde/get-snapshot.md) - [List Aws Iam Role Bindings](https://docs.fireworks.ai/api-reference-dlde/list-aws-iam-role-bindings.md) - [List Batch Jobs](https://docs.fireworks.ai/api-reference-dlde/list-batch-jobs.md) - [List Clusters](https://docs.fireworks.ai/api-reference-dlde/list-clusters.md) - [List Environments](https://docs.fireworks.ai/api-reference-dlde/list-environments.md) - [List Node Pool Bindings](https://docs.fireworks.ai/api-reference-dlde/list-node-pool-bindings.md) - [List Node Pools](https://docs.fireworks.ai/api-reference-dlde/list-node-pools.md) - [List Snapshots](https://docs.fireworks.ai/api-reference-dlde/list-snapshots.md) - [Update Batch Job](https://docs.fireworks.ai/api-reference-dlde/update-batch-job.md) - [Update Cluster](https://docs.fireworks.ai/api-reference-dlde/update-cluster.md) - [Update Environment](https://docs.fireworks.ai/api-reference-dlde/update-environment.md) - [Update Node Pool](https://docs.fireworks.ai/api-reference-dlde/update-node-pool.md) - [Streaming Transcription](https://docs.fireworks.ai/api-reference/audio-streaming-transcriptions.md) - [Transcribe audio](https://docs.fireworks.ai/api-reference/audio-transcriptions.md) - [Translate audio](https://docs.fireworks.ai/api-reference/audio-translations.md) - [Create API Key](https://docs.fireworks.ai/api-reference/create-api-key.md) - [Create Batch Inference Job](https://docs.fireworks.ai/api-reference/create-batch-inference-job.md) - [Create Batch Request](https://docs.fireworks.ai/api-reference/create-batch-request.md) - [Create Dataset](https://docs.fireworks.ai/api-reference/create-dataset.md) - [Load LoRA](https://docs.fireworks.ai/api-reference/create-deployed-model.md) - [Create Deployment](https://docs.fireworks.ai/api-reference/create-deployment.md) - [Create Model](https://docs.fireworks.ai/api-reference/create-model.md) - [Create Reinforcement Fine-tuning Job](https://docs.fireworks.ai/api-reference/create-reinforcement-fine-tuning-job.md) - [Create Supervised Fine-tuning Job](https://docs.fireworks.ai/api-reference/create-supervised-fine-tuning-job.md) - [Create User](https://docs.fireworks.ai/api-reference/create-user.md) - [Create embeddings](https://docs.fireworks.ai/api-reference/creates-an-embedding-vector-representing-the-input-text.md) - [Delete API Key](https://docs.fireworks.ai/api-reference/delete-api-key.md) - [Delete Batch Inference Job](https://docs.fireworks.ai/api-reference/delete-batch-inference-job.md) - [Delete Dataset](https://docs.fireworks.ai/api-reference/delete-dataset.md) - [Unload LoRA](https://docs.fireworks.ai/api-reference/delete-deployed-model.md) - [Delete Deployment](https://docs.fireworks.ai/api-reference/delete-deployment.md) - [Delete Model](https://docs.fireworks.ai/api-reference/delete-model.md) - [Delete Reinforcement Fine-tuning Job](https://docs.fireworks.ai/api-reference/delete-reinforcement-fine-tuning-job.md) - [Delete a model response](https://docs.fireworks.ai/api-reference/delete-response.md): Deletes a model response by its ID. Once deleted, the response data will be gone immediately and permanently. The response cannot be recovered and any conversations that reference this response ID will no longer be able to access it. - [Delete Supervised Fine-tuning Job](https://docs.fireworks.ai/api-reference/delete-supervised-fine-tuning-job.md) - [Generate an image with FLUX.1 [schnell] FP8](https://docs.fireworks.ai/api-reference/generate-a-new-image-from-a-text-prompt.md) - [Generate or edit an image with FLUX.1 Kontext](https://docs.fireworks.ai/api-reference/generate-or-edit-image-using-flux-kontext.md) - [Get Account](https://docs.fireworks.ai/api-reference/get-account.md) - [Get Batch Inference Job](https://docs.fireworks.ai/api-reference/get-batch-inference-job.md) - [Check Batch Status](https://docs.fireworks.ai/api-reference/get-batch-status.md) - [Get Dataset](https://docs.fireworks.ai/api-reference/get-dataset.md) - [Get Dataset Upload Endpoint](https://docs.fireworks.ai/api-reference/get-dataset-upload-endpoint.md) - [Get LoRA](https://docs.fireworks.ai/api-reference/get-deployed-model.md) - [Get Deployment](https://docs.fireworks.ai/api-reference/get-deployment.md) - [Get generated image from FLUX.1 Kontext](https://docs.fireworks.ai/api-reference/get-generated-image-from-flux-kontex.md) - [Get Model](https://docs.fireworks.ai/api-reference/get-model.md) - [Get Model Download Endpoint](https://docs.fireworks.ai/api-reference/get-model-download-endpoint.md) - [Get Model Upload Endpoint](https://docs.fireworks.ai/api-reference/get-model-upload-endpoint.md) - [Get Reinforcement Fine-tuning Job](https://docs.fireworks.ai/api-reference/get-reinforcement-fine-tuning-job.md) - [Get Supervised Fine-tuning Job](https://docs.fireworks.ai/api-reference/get-supervised-fine-tuning-job.md) - [Get User](https://docs.fireworks.ai/api-reference/get-user.md) - [Introduction](https://docs.fireworks.ai/api-reference/introduction.md) - [List API Keys](https://docs.fireworks.ai/api-reference/list-api-keys.md) - [List Batch Inference Jobs](https://docs.fireworks.ai/api-reference/list-batch-inference-jobs.md) - [List Datasets](https://docs.fireworks.ai/api-reference/list-datasets.md) - [List LoRAs](https://docs.fireworks.ai/api-reference/list-deployed-models.md) - [List Deployments](https://docs.fireworks.ai/api-reference/list-deployments.md) - [List Models](https://docs.fireworks.ai/api-reference/list-models.md) - [List Reinforcement Fine-tuning Jobs](https://docs.fireworks.ai/api-reference/list-reinforcement-fine-tuning-jobs.md) - [List Supervised Fine-tuning Jobs](https://docs.fireworks.ai/api-reference/list-supervised-fine-tuning-jobs.md) - [List Users](https://docs.fireworks.ai/api-reference/list-users.md) - [Create Chat Completion](https://docs.fireworks.ai/api-reference/post-chatcompletions.md): Creates a model response for the given chat conversation. - [Create Completion](https://docs.fireworks.ai/api-reference/post-completions.md): Creates a completion for the provided prompt and parameters. - [Create a model response](https://docs.fireworks.ai/api-reference/post-responses.md): Creates a model response, optionally interacting with custom tools via the Model Context Protocol (MCP). This endpoint supports conversational continuation and streaming. Explore our cookbooks for detailed examples: - [Basic MCP Usage](https://github.com/fw-ai/cookbook/blob/main/learn/response-api/fireworks_mcp_examples.ipynb) - [Streaming with MCP](https://github.com/fw-ai/cookbook/blob/main/learn/response-api/fireworks_mcp_with_streaming.ipynb) - [Conversational History with `previous_response_id`](https://github.com/fw-ai/cookbook/blob/main/learn/response-api/fireworks_previous_response_cookbook.ipynb) - [Basic Streaming](https://github.com/fw-ai/cookbook/blob/main/learn/response-api/fireworks_streaming_example.ipynb) - [Controlling Response Storage](https://github.com/fw-ai/cookbook/blob/main/learn/response-api/mcp_server_with_store_false_argument.ipynb) - [Prepare Model for different precisions](https://docs.fireworks.ai/api-reference/prepare-model.md) - [Rerank documents](https://docs.fireworks.ai/api-reference/rerank-documents.md): Rerank documents for a query using relevance scoring - [Undelete Deployment](https://docs.fireworks.ai/api-reference/undelete-deployment.md) - [Update Dataset](https://docs.fireworks.ai/api-reference/update-dataset.md) - [Update LoRA](https://docs.fireworks.ai/api-reference/update-deployed-model.md) - [Update Deployment](https://docs.fireworks.ai/api-reference/update-deployment.md) - [Update Model](https://docs.fireworks.ai/api-reference/update-model.md) - [Update User](https://docs.fireworks.ai/api-reference/update-user.md) - [Upload Dataset Files](https://docs.fireworks.ai/api-reference/upload-dataset-files.md): Provides a streamlined way to upload a dataset file in a single API request. This path can handle file sizes up to 150Mb. For larger file sizes use [Get Dataset Upload Endpoint](get-dataset-upload-endpoint). - [Validate Dataset Upload](https://docs.fireworks.ai/api-reference/validate-dataset-upload.md) - [Validate Model Upload](https://docs.fireworks.ai/api-reference/validate-model-upload.md) - [Autoscaling](https://docs.fireworks.ai/deployments/autoscaling.md): Configure how your deployment scales based on traffic - [Performance benchmarking](https://docs.fireworks.ai/deployments/benchmarking.md): Measure and optimize your deployment's performance with load testing - [Client-side performance optimization](https://docs.fireworks.ai/deployments/client-side-performance-optimization.md): Optimize your client code for maximum performance with dedicated deployments - [Direct routing](https://docs.fireworks.ai/deployments/direct-routing.md): Direct routing enables enterprise users reduce latency to their deployments. - [Exporting Metrics](https://docs.fireworks.ai/deployments/exporting-metrics.md): Export metrics from your dedicated deployments to your observability stack - [Regions](https://docs.fireworks.ai/deployments/regions.md): Fireworks runs a global fleet of hardware on which you can deploy your models. - [Reserved capacity](https://docs.fireworks.ai/deployments/reservations.md) - [Speculative Decoding](https://docs.fireworks.ai/deployments/speculative-decoding.md): Speed up generation with draft models and n-gram speculation - [Cloud Integrations](https://docs.fireworks.ai/ecosystem/integrations.md): Cloud Integrations - [Agent Frameworks](https://docs.fireworks.ai/ecosystem/integrations/agent-frameworks.md): Build production-ready AI agents with Fireworks and leading open-source frameworks - [null](https://docs.fireworks.ai/evaluators/api_reference/api_overview.md) - [null](https://docs.fireworks.ai/evaluators/api_reference/data_models.md) - [null](https://docs.fireworks.ai/evaluators/api_reference/reward_function_class.md) - [null](https://docs.fireworks.ai/evaluators/api_reference/reward_function_decorator.md) - [null](https://docs.fireworks.ai/evaluators/cli_reference/cli_overview.md) - [null](https://docs.fireworks.ai/evaluators/developer_guide/agent_evaluation.md) - [null](https://docs.fireworks.ai/evaluators/developer_guide/core_data_types.md) - [null](https://docs.fireworks.ai/evaluators/developer_guide/evaluation_workflows.md) - [null](https://docs.fireworks.ai/evaluators/developer_guide/getting_started.md) - [null](https://docs.fireworks.ai/evaluators/developer_guide/reward_function_anatomy.md) - [Using Secrets](https://docs.fireworks.ai/evaluators/developer_guide/using_secrets.md): Learn how to create secrets that can be utilized within your reward function. - [null](https://docs.fireworks.ai/evaluators/documentation_home.md) - [null](https://docs.fireworks.ai/evaluators/examples/accuracy_length/accuracy_length_overview.md) - [null](https://docs.fireworks.ai/evaluators/examples/advanced_examples/advanced_reward_functions.md) - [null](https://docs.fireworks.ai/evaluators/examples/advanced_examples/code_execution_with_e2b.md) - [null](https://docs.fireworks.ai/evaluators/examples/advanced_examples/math_evaluation.md) - [null](https://docs.fireworks.ai/evaluators/examples/apps_coding_example.md) - [null](https://docs.fireworks.ai/evaluators/examples/basic_examples/basic_reward_function.md) - [null](https://docs.fireworks.ai/evaluators/examples/basic_examples/reward_functions_overview.md) - [null](https://docs.fireworks.ai/evaluators/examples/e2b/e2b_code_execution_overview.md) - [null](https://docs.fireworks.ai/evaluators/examples/examples_overview.md) - [null](https://docs.fireworks.ai/evaluators/examples/gcp_cloud_run_deployment_example.md) - [null](https://docs.fireworks.ai/evaluators/examples/math_with_formatting_example.md) - [null](https://docs.fireworks.ai/evaluators/examples/tool_calling_example.md) - [Featured](https://docs.fireworks.ai/examples/introduction.md): Standalone examples showing how to use Fireworks to solve real-world use cases - [null](https://docs.fireworks.ai/examples/knowledge-distillation.md) - [null](https://docs.fireworks.ai/examples/reward-hacking.md) - [null](https://docs.fireworks.ai/examples/text-to-sql.md) - [How do I close my Fireworks.ai account?](https://docs.fireworks.ai/faq-new/account-access/how-do-i-close-my-fireworksai-account.md) - [I have multiple Fireworks accounts. When I try to login with Google on Fireworks' web UI, I'm getting signed into the wrong account. How do I fix this?](https://docs.fireworks.ai/faq-new/account-access/i-have-multiple-fireworks-accounts-when-i-try-to-login-with-google-on-fireworks.md) - [What email does GitHub authentication use?](https://docs.fireworks.ai/faq-new/account-access/what-email-does-github-authentication-use.md) - [What email does LinkedIn authentication use?](https://docs.fireworks.ai/faq-new/account-access/what-email-does-linkedin-authentication-use.md) - [What should I do if I can't access my company account after being invited when I already have a personal account?](https://docs.fireworks.ai/faq-new/account-access/what-should-i-do-if-i-cant-access-my-company-account-after-being-invited-when-i.md) - [Are there discounts for bulk usage?](https://docs.fireworks.ai/faq-new/billing-pricing/are-there-discounts-for-bulk-usage.md) - [Are there extra fees for serving fine-tuned models?](https://docs.fireworks.ai/faq-new/billing-pricing/are-there-extra-fees-for-serving-fine-tuned-models.md) - [How does billing and credit usage work?](https://docs.fireworks.ai/faq-new/billing-pricing/how-does-billing-and-credit-usage-work.md) - [How many tokens per image?](https://docs.fireworks.ai/faq-new/billing-pricing/how-many-tokens-per-image.md): Learn how to calculate token usage for images in vision models and understand pricing implications - [How much does Fireworks cost?](https://docs.fireworks.ai/faq-new/billing-pricing/how-much-does-fireworks-cost.md) - [Is prompt caching billed differently for serverless models?](https://docs.fireworks.ai/faq-new/billing-pricing/is-prompt-caching-billed-differently.md) - [How do credits work?](https://docs.fireworks.ai/faq-new/billing-pricing/what-happens-when-i-finish-my-1-dollar-credit.md) - [Why might my account be suspended even with remaining credits?](https://docs.fireworks.ai/faq-new/billing-pricing/why-might-my-account-be-suspended-even-with-remaining-credits.md) - [Are there any quotas for serverless?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/are-there-any-quotas-for-serverless.md) - [Do you provide notice before removing model availability?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/do-you-provide-notice-before-removing-model-availability.md) - [Do you support Auto Scaling?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/do-you-support-auto-scaling.md) - [How does autoscaling affect my costs?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/how-does-autoscaling-affect-my-costs.md) - [How does billing and scaling work for on-demand GPU deployments?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/how-does-billing-and-scaling-work-for-on-demand-gpu-deployments.md) - [How does billing work for on-demand deployments?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/how-does-billing-work-for-on-demand-deployments.md) - [How does the system scale?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/how-does-the-system-scale.md) - [Are there SLAs for serverless?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/is-latency-guaranteed-for-serverless-models.md) - [What are the rate limits for on-demand deployments?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/what-are-the-rate-limits-for-on-demand-deployments.md) - [What factors affect the number of simultaneous requests that can be handled?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/what-factors-affect-the-number-of-simultaneous-requests-that-can-be-handled.md) - [What’s the supported throughput?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/whats-the-supported-throughput.md) - [Why am I experiencing request timeout errors and slow response times with serverless LLM models?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/why-am-i-experiencing-request-timeout-errors-and-slow-response-times-with-server.md) - [Does Fireworks support custom base models?](https://docs.fireworks.ai/faq-new/models-inference/does-fireworks-support-custom-base-models.md) - [Does the API support batching and load balancing?](https://docs.fireworks.ai/faq-new/models-inference/does-the-api-support-batching-and-load-balancing.md) - [FLUX image generation](https://docs.fireworks.ai/faq-new/models-inference/flux-image-generation.md) - [How do I control output image sizes when using SDXL ControlNet?](https://docs.fireworks.ai/faq-new/models-inference/how-do-i-control-output-image-sizes-when-using-sdxl-controlnet.md) - [How to check if a model is available on serverless?](https://docs.fireworks.ai/faq-new/models-inference/how-to-check-if-a-model-is-available-on-serverless.md) - [There’s a model I would like to use that isn’t available on Fireworks. Can I request it?](https://docs.fireworks.ai/faq-new/models-inference/theres-a-model-i-would-like-to-use-that-isnt-available-on-fireworks-can-i-reques.md) - [What factors affect the number of simultaneous requests that can be handled?](https://docs.fireworks.ai/faq-new/models-inference/what-factors-affect-the-number-of-simultaneous-requests-that-can-be-handled.md) - [Deploying LoRAs](https://docs.fireworks.ai/fine-tuning/deploying-loras.md): Deploy one or multiple LoRA fine-tuned models - [Direct Preference Optimization](https://docs.fireworks.ai/fine-tuning/dpo-fine-tuning.md) - [Supervised Fine Tuning - Text](https://docs.fireworks.ai/fine-tuning/fine-tuning-models.md) - [Supervised Fine Tuning - Vision](https://docs.fireworks.ai/fine-tuning/fine-tuning-vlm.md): Learn how to fine-tune vision-language models on Fireworks AI with image and text datasets - [Fine Tuning Overview](https://docs.fireworks.ai/fine-tuning/finetuning-intro.md) - [Reinforcement Fine Tuning](https://docs.fireworks.ai/fine-tuning/reinforcement-fine-tuning-models.md) - [Secure Fine Tuning](https://docs.fireworks.ai/fine-tuning/secure-fine-tuning.md): Fine-tune models while keeping sensitive data and components under your control - [Concepts](https://docs.fireworks.ai/getting-started/concepts.md): This document outlines basic Fireworks AI concepts. - [Build with Fireworks AI](https://docs.fireworks.ai/getting-started/introduction.md): Fast inference and fine-tuning for open source models - [Deployments Quickstart](https://docs.fireworks.ai/getting-started/ondemand-quickstart.md): Deploy models on dedicated GPUs in minutes - [Serverless Quickstart](https://docs.fireworks.ai/getting-started/quickstart.md): Make your first Serverless API call in minutes - [Batch API](https://docs.fireworks.ai/guides/batch-inference.md): Process large-scale async workloads - [Completions API](https://docs.fireworks.ai/guides/completions-api.md): Use the completions API for raw text generation with custom prompt templates - [Tool Calling](https://docs.fireworks.ai/guides/function-calling.md): Connect models to external tools and APIs - [Inference Error Codes](https://docs.fireworks.ai/guides/inference-error-codes.md): Common error codes, their meanings, and resolutions for inference requests - [Deployments](https://docs.fireworks.ai/guides/ondemand-deployments.md): Configure and manage on-demand deployments on dedicated GPUs - [Using predicted outputs](https://docs.fireworks.ai/guides/predicted-outputs.md): Use Predicted Outputs to boost output generation speeds for editing / rewriting use cases - [Prompt caching](https://docs.fireworks.ai/guides/prompt-caching.md) - [Speech to Text](https://docs.fireworks.ai/guides/querying-asr-models.md): Convert audio to text with streaming and pre-recorded transcription - [Embeddings & Reranking](https://docs.fireworks.ai/guides/querying-embeddings-models.md): Generate embeddings and rerank results for semantic search - [Text Models](https://docs.fireworks.ai/guides/querying-text-models.md): Query, track and manage inference for text models - [Vision Models](https://docs.fireworks.ai/guides/querying-vision-language-models.md): Query vision-language models to analyze images and visual content - [Rate Limits & Quotas](https://docs.fireworks.ai/guides/quotas_usage/rate-limits.md): Understand and manage your rate limits, spend limits and quotas - [Which model should I use?](https://docs.fireworks.ai/guides/recommended-models.md): A list of recommended open models for common use cases - [Responses API](https://docs.fireworks.ai/guides/response-api.md) - [Audit & Access Logs](https://docs.fireworks.ai/guides/security_compliance/audit_logs.md): Monitor and track account activities with audit logging for Enterprise accounts - [Zero Data Retention](https://docs.fireworks.ai/guides/security_compliance/data_handling.md): Data retention policies at Fireworks - [Data Security](https://docs.fireworks.ai/guides/security_compliance/data_security.md): How we secure and handle your data for inference and training - [Quantization](https://docs.fireworks.ai/models/quantization.md): Reduce model precision to improve performance and lower costs - [Custom Models](https://docs.fireworks.ai/models/uploading-custom-models.md): Upload, verify, and deploy your own models from Hugging Face or elsewhere - [Structured Outputs](https://docs.fireworks.ai/structured-responses/structured-response-formatting.md): Enforce output formats using JSON schemas or custom grammars - [Authentication](https://docs.fireworks.ai/tools-sdks/firectl/commands/authentication.md): Authentication for access to your account - [firectl create api-key](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-api-key.md): Creates an API key for the signed in user or a specified service account user. - [firectl create batch-inference-job](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-batch-inference-job.md): Creates a batch inference job. - [firectl create dataset](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-dataset.md): Creates and uploads a dataset. - [firectl create deployment](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-deployment.md): Creates a new deployment. - [firectl create dpo-job](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-dpo-job.md): Creates a dpo job. - [firectl create identity-provider](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-identity-provider.md): Creates a new identity provider. - [firectl create model](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-model.md): Creates and uploads a model. - [firectl create reinforcement-fine-tuning-job](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-reinforcement-fine-tuning-job.md): Creates a reinforcement fine-tuning job. - [firectl create secret](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-secret.md): Creates a secret for the signed in user. - [firectl create supervised-fine-tuning-job](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-supervised-fine-tuning-job.md): Creates a supervised fine-tuning job. - [firectl create user](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-user.md): Creates a new user. - [firectl delete api-key](https://docs.fireworks.ai/tools-sdks/firectl/commands/delete-api-key.md): Deletes an API key. - [firectl delete batch-inference-job](https://docs.fireworks.ai/tools-sdks/firectl/commands/delete-batch-inference-job.md): Deletes a batch inference job. - [firectl delete dataset](https://docs.fireworks.ai/tools-sdks/firectl/commands/delete-dataset.md): Deletes a dataset. - [firectl delete deployment](https://docs.fireworks.ai/tools-sdks/firectl/commands/delete-deployment.md): Deletes a deployment. - [firectl delete dpo-job](https://docs.fireworks.ai/tools-sdks/firectl/commands/delete-dpo-job.md): Deletes a dpo job. - [firectl delete model](https://docs.fireworks.ai/tools-sdks/firectl/commands/delete-model.md): Deletes a model. - [firectl delete reinforcement-fine-tuning-job](https://docs.fireworks.ai/tools-sdks/firectl/commands/delete-reinforcement-fine-tuning-job.md): Deletes a reinforcement fine-tuning job. - [firectl delete secret](https://docs.fireworks.ai/tools-sdks/firectl/commands/delete-secret.md): Deletes a secret. - [firectl delete supervised-fine-tuning-job](https://docs.fireworks.ai/tools-sdks/firectl/commands/delete-supervised-fine-tuning-job.md): Deletes a supervised fine-tuning job. - [firectl delete user](https://docs.fireworks.ai/tools-sdks/firectl/commands/delete-user.md): Deletes a user. - [firectl download billing-metrics](https://docs.fireworks.ai/tools-sdks/firectl/commands/download-billing-metrics.md): Exports billing metrics - [firectl download dataset](https://docs.fireworks.ai/tools-sdks/firectl/commands/download-dataset.md): Downloads a dataset to a local directory. - [firectl download dpo-job-metrics](https://docs.fireworks.ai/tools-sdks/firectl/commands/download-dpo-job-metrics.md): Retrieves metrics for a dpo job. - [firectl download model](https://docs.fireworks.ai/tools-sdks/firectl/commands/download-model.md): Download a model. - [firectl get account](https://docs.fireworks.ai/tools-sdks/firectl/commands/get-account.md): Prints information about an account. - [firectl get batch-inference-job](https://docs.fireworks.ai/tools-sdks/firectl/commands/get-batch-inference-job.md): Retrieves information about a batch inference job. - [firectl get dataset](https://docs.fireworks.ai/tools-sdks/firectl/commands/get-dataset.md): Prints information about a dataset. - [firectl get deployed-model](https://docs.fireworks.ai/tools-sdks/firectl/commands/get-deployed-model.md): Prints information about a deployed model. - [firectl get deployment](https://docs.fireworks.ai/tools-sdks/firectl/commands/get-deployment.md): Prints information about a deployment. - [firectl get deployment-shape-version](https://docs.fireworks.ai/tools-sdks/firectl/commands/get-deployment-shape-version.md): Prints information about a deployment shape version. - [firectl get dpo-job](https://docs.fireworks.ai/tools-sdks/firectl/commands/get-dpo-job.md): Retrieves information about a dpo job. - [firectl get feature-flag](https://docs.fireworks.ai/tools-sdks/firectl/commands/get-feature-flag.md): Gets a feature flag. - [firectl get identity-provider](https://docs.fireworks.ai/tools-sdks/firectl/commands/get-identity-provider.md): Prints information about an identity provider. - [firectl get model](https://docs.fireworks.ai/tools-sdks/firectl/commands/get-model.md): Prints information about a model. - [firectl get quota](https://docs.fireworks.ai/tools-sdks/firectl/commands/get-quota.md): Prints information about a quota. - [firectl get reinforcement-fine-tuning-job](https://docs.fireworks.ai/tools-sdks/firectl/commands/get-reinforcement-fine-tuning-job.md): Retrieves information about a reinforcement fine-tuning job. - [firectl get reservation](https://docs.fireworks.ai/tools-sdks/firectl/commands/get-reservation.md): Prints information about a reservation. - [firectl get secret](https://docs.fireworks.ai/tools-sdks/firectl/commands/get-secret.md): Retrieves a secret by name. - [firectl get supervised-fine-tuning-job](https://docs.fireworks.ai/tools-sdks/firectl/commands/get-supervised-fine-tuning-job.md): Retrieves information about a supervised fine-tuning job. - [firectl get user](https://docs.fireworks.ai/tools-sdks/firectl/commands/get-user.md): Prints information about a user. - [firectl list accounts](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-accounts.md): Prints all accounts the current signed-in user has access to. - [firectl list api-key](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-api-key.md): Prints all API keys for the signed in user. - [firectl list batch-inference-jobs](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-batch-inference-jobs.md): Lists all batch inference jobs in an account. - [firectl list datasets](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-datasets.md): Prints all datasets in an account. - [firectl list deployed-models](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-deployed-models.md): Prints all deployed models in the account. - [firectl list deployment-shape-versions](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-deployment-shape-versions.md): Prints all deployment shape versions of this deployment shape. - [firectl list deployments](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-deployments.md): Prints all deployments in the account. - [firectl list dpo-jobs](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-dpo-jobs.md): Lists all dpo jobs in an account. - [firectl list identity-providers](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-identity-providers.md): List identity providers for an account - [firectl list invoices](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-invoices.md): Prints information about invoices. - [firectl list models](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-models.md): Prints all models in an account. - [firectl list quotas](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-quotas.md): Prints all quotas. - [firectl list reinforcement-fine-tuning-jobs](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-reinforcement-fine-tuning-jobs.md): Lists all reinforcement fine-tuning jobs in an account. - [firectl list reservations](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-reservations.md): Prints active reservations. - [firectl list secret](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-secret.md): Lists all secrets for the signed in user. - [firectl list supervised-fine-tuning-jobs](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-supervised-fine-tuning-jobs.md): Lists all supervised fine-tuning jobs in an account. - [firectl list user](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-user.md): Prints all users in the account. - [firectl load-lora](https://docs.fireworks.ai/tools-sdks/firectl/commands/load-lora.md): Loads a LoRA model to a deployment. - [firectl prepare-model](https://docs.fireworks.ai/tools-sdks/firectl/commands/prepare-model.md): Prepare models for different precisions - [firectl resume reinforcement-fine-tuning-job](https://docs.fireworks.ai/tools-sdks/firectl/commands/resume-reinforcement-fine-tuning-job.md): Resumes a failed reinforcement fine-tuning job. - [firectl scale](https://docs.fireworks.ai/tools-sdks/firectl/commands/scale.md): Scales a deployment to a specified number of replicas. - [firectl undelete deployment](https://docs.fireworks.ai/tools-sdks/firectl/commands/undelete-deployment.md): Undeletes a deployment. - [firectl unload-lora](https://docs.fireworks.ai/tools-sdks/firectl/commands/unload-lora.md): Unloads a LoRA model from a deployment. - [firectl update dataset](https://docs.fireworks.ai/tools-sdks/firectl/commands/update-dataset.md): Updates a dataset. - [firectl update deployed-model](https://docs.fireworks.ai/tools-sdks/firectl/commands/update-deployed-model.md): Update a deployed model. - [firectl update deployment](https://docs.fireworks.ai/tools-sdks/firectl/commands/update-deployment.md): Update a deployment. - [firectl update model](https://docs.fireworks.ai/tools-sdks/firectl/commands/update-model.md): Updates a model. - [firectl update quota](https://docs.fireworks.ai/tools-sdks/firectl/commands/update-quota.md): Updates a quota. - [firectl update secret](https://docs.fireworks.ai/tools-sdks/firectl/commands/update-secret.md): Updates an existing secret. - [firectl update user](https://docs.fireworks.ai/tools-sdks/firectl/commands/update-user.md): Updates a user. - [firectl upgrade](https://docs.fireworks.ai/tools-sdks/firectl/commands/upgrade.md): Upgrades the firectl binary to the latest version. - [firectl upload model](https://docs.fireworks.ai/tools-sdks/firectl/commands/upload-model.md): Resumes or completes a model upload. - [firectl version](https://docs.fireworks.ai/tools-sdks/firectl/commands/version.md): Prints the version of firectl - [firectl whoami](https://docs.fireworks.ai/tools-sdks/firectl/commands/whoami.md): Shows the currently authenticated user - [Getting started](https://docs.fireworks.ai/tools-sdks/firectl/firectl.md): Learn to create, deploy, and manage resources using Firectl - [OpenAI compatibility](https://docs.fireworks.ai/tools-sdks/openai-compatibility.md) - [Querying Dedicated Deployments](https://docs.fireworks.ai/tools-sdks/python-client/querying-dedicated-deployments.md): Learn how to connect to and query dedicated deployments that were created outside the SDK - [Build SDK Basics](https://docs.fireworks.ai/tools-sdks/python-client/sdk-basics.md) - [Build SDK Introduction](https://docs.fireworks.ai/tools-sdks/python-client/sdk-introduction.md) - [Reference](https://docs.fireworks.ai/tools-sdks/python-client/sdk-reference.md) - [Tutorial](https://docs.fireworks.ai/tools-sdks/python-client/the-tutorial.md) - [Changelog](https://docs.fireworks.ai/updates/changelog.md)