# Fireworks AI Docs ## Docs - [Custom SSO](https://docs.fireworks.ai/accounts/sso.md): Set up custom Single Sign-On (SSO) authentication for Fireworks AI - [Managing users](https://docs.fireworks.ai/accounts/users.md): Add and delete additional users in your Fireworks account - [Batch Delete Batch Jobs](https://docs.fireworks.ai/api-reference-dlde/batch-delete-batch-jobs.md) - [Batch Delete Environments](https://docs.fireworks.ai/api-reference-dlde/batch-delete-environments.md) - [Batch Delete Node Pools](https://docs.fireworks.ai/api-reference-dlde/batch-delete-node-pools.md) - [Cancel Batch Job](https://docs.fireworks.ai/api-reference-dlde/cancel-batch-job.md): Cancels an existing batch job if it is queued, pending, or running. - [Connect Environment](https://docs.fireworks.ai/api-reference-dlde/connect-environment.md): Connects the environment to a node pool. Returns an error if there is an existing pending connection. - [Create Aws Iam Role Binding](https://docs.fireworks.ai/api-reference-dlde/create-aws-iam-role-binding.md) - [Create Batch Job](https://docs.fireworks.ai/api-reference-dlde/create-batch-job.md) - [Create Cluster](https://docs.fireworks.ai/api-reference-dlde/create-cluster.md) - [Create Environment](https://docs.fireworks.ai/api-reference-dlde/create-environment.md) - [Create Node Pool](https://docs.fireworks.ai/api-reference-dlde/create-node-pool.md) - [Create Node Pool Binding](https://docs.fireworks.ai/api-reference-dlde/create-node-pool-binding.md) - [Create Snapshot](https://docs.fireworks.ai/api-reference-dlde/create-snapshot.md) - [Delete Aws Iam Role Binding](https://docs.fireworks.ai/api-reference-dlde/delete-aws-iam-role-binding.md) - [Delete Batch Job](https://docs.fireworks.ai/api-reference-dlde/delete-batch-job.md) - [Delete Cluster](https://docs.fireworks.ai/api-reference-dlde/delete-cluster.md) - [Delete Environment](https://docs.fireworks.ai/api-reference-dlde/delete-environment.md) - [Delete Node Pool](https://docs.fireworks.ai/api-reference-dlde/delete-node-pool.md) - [Delete Node Pool Binding](https://docs.fireworks.ai/api-reference-dlde/delete-node-pool-binding.md) - [Delete Snapshot](https://docs.fireworks.ai/api-reference-dlde/delete-snapshot.md) - [Disconnect Environment](https://docs.fireworks.ai/api-reference-dlde/disconnect-environment.md): Disconnects the environment from the node pool. Returns an error if the environment is not connected to a node pool. - [Get Batch Job](https://docs.fireworks.ai/api-reference-dlde/get-batch-job.md) - [Get Batch Job Logs](https://docs.fireworks.ai/api-reference-dlde/get-batch-job-logs.md) - [Get Cluster](https://docs.fireworks.ai/api-reference-dlde/get-cluster.md) - [Get Cluster Connection Info](https://docs.fireworks.ai/api-reference-dlde/get-cluster-connection-info.md): Retrieve connection settings for the cluster to be put in kubeconfig - [Get Environment](https://docs.fireworks.ai/api-reference-dlde/get-environment.md) - [Get Node Pool](https://docs.fireworks.ai/api-reference-dlde/get-node-pool.md) - [Get Node Pool Stats](https://docs.fireworks.ai/api-reference-dlde/get-node-pool-stats.md) - [Get Snapshot](https://docs.fireworks.ai/api-reference-dlde/get-snapshot.md) - [List Aws Iam Role Bindings](https://docs.fireworks.ai/api-reference-dlde/list-aws-iam-role-bindings.md) - [List Batch Jobs](https://docs.fireworks.ai/api-reference-dlde/list-batch-jobs.md) - [List Clusters](https://docs.fireworks.ai/api-reference-dlde/list-clusters.md) - [List Environments](https://docs.fireworks.ai/api-reference-dlde/list-environments.md) - [List Node Pool Bindings](https://docs.fireworks.ai/api-reference-dlde/list-node-pool-bindings.md) - [List Node Pools](https://docs.fireworks.ai/api-reference-dlde/list-node-pools.md) - [List Snapshots](https://docs.fireworks.ai/api-reference-dlde/list-snapshots.md) - [Update Batch Job](https://docs.fireworks.ai/api-reference-dlde/update-batch-job.md) - [Update Cluster](https://docs.fireworks.ai/api-reference-dlde/update-cluster.md) - [Update Environment](https://docs.fireworks.ai/api-reference-dlde/update-environment.md) - [Update Node Pool](https://docs.fireworks.ai/api-reference-dlde/update-node-pool.md) - [Streaming Transcription](https://docs.fireworks.ai/api-reference/audio-streaming-transcriptions.md) - [Transcribe audio](https://docs.fireworks.ai/api-reference/audio-transcriptions.md) - [Translate audio](https://docs.fireworks.ai/api-reference/audio-translations.md) - [Create API Key](https://docs.fireworks.ai/api-reference/create-api-key.md) - [Create Batch Request](https://docs.fireworks.ai/api-reference/create-batch-request.md) - [Create Dataset](https://docs.fireworks.ai/api-reference/create-dataset.md) - [Load LoRA](https://docs.fireworks.ai/api-reference/create-deployed-model.md) - [Create Deployment](https://docs.fireworks.ai/api-reference/create-deployment.md) - [Create Model](https://docs.fireworks.ai/api-reference/create-model.md) - [Create Reinforcement Fine-tuning Job](https://docs.fireworks.ai/api-reference/create-reinforcement-fine-tuning-job.md) - [Create Supervised Fine-tuning Job](https://docs.fireworks.ai/api-reference/create-supervised-fine-tuning-job.md) - [Create User](https://docs.fireworks.ai/api-reference/create-user.md) - [Create embeddings](https://docs.fireworks.ai/api-reference/creates-an-embedding-vector-representing-the-input-text.md) - [Delete API Key](https://docs.fireworks.ai/api-reference/delete-api-key.md) - [Delete Dataset](https://docs.fireworks.ai/api-reference/delete-dataset.md) - [Unload LoRA](https://docs.fireworks.ai/api-reference/delete-deployed-model.md) - [Delete Deployment](https://docs.fireworks.ai/api-reference/delete-deployment.md) - [Delete Model](https://docs.fireworks.ai/api-reference/delete-model.md) - [Delete Reinforcement Fine-tuning Job](https://docs.fireworks.ai/api-reference/delete-reinforcement-fine-tuning-job.md) - [Delete Supervised Fine-tuning Job](https://docs.fireworks.ai/api-reference/delete-supervised-fine-tuning-job.md) - [Generate an image with FLUX.1 [schnell] FP8](https://docs.fireworks.ai/api-reference/generate-a-new-image-from-a-text-prompt.md) - [Generate or edit an image with FLUX.1 Kontext](https://docs.fireworks.ai/api-reference/generate-or-edit-image-using-flux-kontext.md) - [Get Account](https://docs.fireworks.ai/api-reference/get-account.md) - [Check Batch Status](https://docs.fireworks.ai/api-reference/get-batch-status.md) - [Get Dataset](https://docs.fireworks.ai/api-reference/get-dataset.md) - [Get Dataset Upload Endpoint](https://docs.fireworks.ai/api-reference/get-dataset-upload-endpoint.md) - [Get LoRA](https://docs.fireworks.ai/api-reference/get-deployed-model.md) - [Get Deployment](https://docs.fireworks.ai/api-reference/get-deployment.md) - [Get generated image from FLUX.1 Kontext](https://docs.fireworks.ai/api-reference/get-generated-image-from-flux-kontex.md) - [Get Model](https://docs.fireworks.ai/api-reference/get-model.md) - [Get Model Download Endpoint](https://docs.fireworks.ai/api-reference/get-model-download-endpoint.md) - [Get Model Upload Endpoint](https://docs.fireworks.ai/api-reference/get-model-upload-endpoint.md) - [Get Reinforcement Fine-tuning Job](https://docs.fireworks.ai/api-reference/get-reinforcement-fine-tuning-job.md) - [Get Supervised Fine-tuning Job](https://docs.fireworks.ai/api-reference/get-supervised-fine-tuning-job.md) - [Get User](https://docs.fireworks.ai/api-reference/get-user.md) - [Introduction](https://docs.fireworks.ai/api-reference/introduction.md) - [List API Keys](https://docs.fireworks.ai/api-reference/list-api-keys.md) - [List Datasets](https://docs.fireworks.ai/api-reference/list-datasets.md) - [List LoRAs](https://docs.fireworks.ai/api-reference/list-deployed-models.md) - [List Deployments](https://docs.fireworks.ai/api-reference/list-deployments.md) - [List Models](https://docs.fireworks.ai/api-reference/list-models.md) - [List Reinforcement Fine-tuning Jobs](https://docs.fireworks.ai/api-reference/list-reinforcement-fine-tuning-jobs.md) - [List Supervised Fine-tuning Jobs](https://docs.fireworks.ai/api-reference/list-supervised-fine-tuning-jobs.md) - [List Users](https://docs.fireworks.ai/api-reference/list-users.md) - [Create Chat Completion](https://docs.fireworks.ai/api-reference/post-chatcompletions.md): Creates a model response for the given chat conversation. - [Create Completion](https://docs.fireworks.ai/api-reference/post-completions.md): Creates a completion for the provided prompt and parameters. - [Create a model response](https://docs.fireworks.ai/api-reference/post-responses.md): Creates a model response, optionally interacting with custom tools via the Model Context Protocol (MCP). This endpoint supports conversational continuation and streaming. Explore our cookbooks for detailed examples: - [Basic MCP Usage](https://github.com/fw-ai/cookbook/blob/main/learn/response-api/fireworks_mcp_examples.ipynb) - [Streaming with MCP](https://github.com/fw-ai/cookbook/blob/main/learn/response-api/fireworks_mcp_with_streaming.ipynb) - [Conversational History with `previous_response_id`](https://github.com/fw-ai/cookbook/blob/main/learn/response-api/fireworks_previous_response_cookbook.ipynb) - [Basic Streaming](https://github.com/fw-ai/cookbook/blob/main/learn/response-api/fireworks_streaming_example.ipynb) - [Controlling Response Storage](https://github.com/fw-ai/cookbook/blob/main/learn/response-api/mcp_server_with_store_false_argument.ipynb) - [Prepare Model for different precisions](https://docs.fireworks.ai/api-reference/prepare-model.md) - [Undelete Deployment](https://docs.fireworks.ai/api-reference/undelete-deployment.md) - [Update Dataset](https://docs.fireworks.ai/api-reference/update-dataset.md) - [Update LoRA](https://docs.fireworks.ai/api-reference/update-deployed-model.md) - [Update Deployment](https://docs.fireworks.ai/api-reference/update-deployment.md) - [Update Model](https://docs.fireworks.ai/api-reference/update-model.md) - [Update User](https://docs.fireworks.ai/api-reference/update-user.md) - [Upload Dataset Files](https://docs.fireworks.ai/api-reference/upload-dataset-files.md): Provides a streamlined way to upload a dataset file in a single API request. This path can handle file sizes up to 150Mb. For larger file sizes use [Get Dataset Upload Endpoint](get-dataset-upload-endpoint). - [Validate Dataset Upload](https://docs.fireworks.ai/api-reference/validate-dataset-upload.md) - [Validate Model Upload](https://docs.fireworks.ai/api-reference/validate-model-upload.md) - [Direct routing](https://docs.fireworks.ai/deployments/direct-routing.md): Direct routing enables enterprise users reduce latency to their deployments. - [Regions](https://docs.fireworks.ai/deployments/regions.md): Fireworks runs a global fleet of hardware on which you can deploy your models. - [Reserved capacity](https://docs.fireworks.ai/deployments/reservations.md) - [Hugging Face](https://docs.fireworks.ai/ecosystem/integrations/hugging-face.md): Learn how developers can integrate and use Fireworks.ai inference capabilities via the Hugging Face ecosystem. - [null](https://docs.fireworks.ai/evaluators/api_reference/api_overview.md) - [null](https://docs.fireworks.ai/evaluators/api_reference/data_models.md) - [null](https://docs.fireworks.ai/evaluators/api_reference/reward_function_class.md) - [null](https://docs.fireworks.ai/evaluators/api_reference/reward_function_decorator.md) - [null](https://docs.fireworks.ai/evaluators/cli_reference/cli_overview.md) - [null](https://docs.fireworks.ai/evaluators/developer_guide/agent_evaluation.md) - [null](https://docs.fireworks.ai/evaluators/developer_guide/core_data_types.md) - [null](https://docs.fireworks.ai/evaluators/developer_guide/evaluation_workflows.md) - [null](https://docs.fireworks.ai/evaluators/developer_guide/getting_started.md) - [null](https://docs.fireworks.ai/evaluators/developer_guide/reward_function_anatomy.md) - [null](https://docs.fireworks.ai/evaluators/documentation_home.md) - [null](https://docs.fireworks.ai/evaluators/examples/accuracy_length/accuracy_length_overview.md) - [null](https://docs.fireworks.ai/evaluators/examples/advanced_examples/advanced_reward_functions.md) - [null](https://docs.fireworks.ai/evaluators/examples/advanced_examples/code_execution_with_e2b.md) - [null](https://docs.fireworks.ai/evaluators/examples/advanced_examples/math_evaluation.md) - [null](https://docs.fireworks.ai/evaluators/examples/apps_coding_example.md) - [null](https://docs.fireworks.ai/evaluators/examples/basic_examples/basic_reward_function.md) - [null](https://docs.fireworks.ai/evaluators/examples/basic_examples/reward_functions_overview.md) - [null](https://docs.fireworks.ai/evaluators/examples/e2b/e2b_code_execution_overview.md) - [null](https://docs.fireworks.ai/evaluators/examples/examples_overview.md) - [null](https://docs.fireworks.ai/evaluators/examples/gcp_cloud_run_deployment_example.md) - [null](https://docs.fireworks.ai/evaluators/examples/math_with_formatting_example.md) - [null](https://docs.fireworks.ai/evaluators/examples/tool_calling_example.md) - [How do I close my Fireworks.ai account?](https://docs.fireworks.ai/faq-new/account-access/how-do-i-close-my-fireworksai-account.md) - [I have multiple Fireworks accounts. When I try to login with Google on Fireworks' web UI, I'm getting signed into the wrong account. How do I fix this?](https://docs.fireworks.ai/faq-new/account-access/i-have-multiple-fireworks-accounts-when-i-try-to-login-with-google-on-fireworks.md) - [What email does GitHub authentication use?](https://docs.fireworks.ai/faq-new/account-access/what-email-does-github-authentication-use.md) - [What email does LinkedIn authentication use?](https://docs.fireworks.ai/faq-new/account-access/what-email-does-linkedin-authentication-use.md) - [What should I do if I can't access my company account after being invited when I already have a personal account?](https://docs.fireworks.ai/faq-new/account-access/what-should-i-do-if-i-cant-access-my-company-account-after-being-invited-when-i.md) - [Are calls to the Models API billable?](https://docs.fireworks.ai/faq-new/billing-pricing/are-calls-to-the-models-api-billable.md) - [Are there discounts for bulk spend on serverless deployments?](https://docs.fireworks.ai/faq-new/billing-pricing/are-there-discounts-for-bulk-spend-on-serverless-deployments.md) - [Are there discounts for bulk usage?](https://docs.fireworks.ai/faq-new/billing-pricing/are-there-discounts-for-bulk-usage.md) - [Are there extra fees for serving fine-tuned models?](https://docs.fireworks.ai/faq-new/billing-pricing/are-there-extra-fees-for-serving-fine-tuned-models.md) - [How does billing and credit usage work?](https://docs.fireworks.ai/faq-new/billing-pricing/how-does-billing-and-credit-usage-work.md) - [How much does Fireworks cost?](https://docs.fireworks.ai/faq-new/billing-pricing/how-much-does-fireworks-cost.md) - [I bought credits but don’t see them reflected in my account. Did they disappear?](https://docs.fireworks.ai/faq-new/billing-pricing/i-bought-credits-but-dont-see-them-reflected-in-my-account-did-they-disappear.md) - [Where's my receipt for purchased credits?](https://docs.fireworks.ai/faq-new/billing-pricing/wheres-my-receipt-for-purchased-credits.md) - [Why did I receive an invoice when I only deposited credits?](https://docs.fireworks.ai/faq-new/billing-pricing/why-did-i-receive-an-invoice-when-i-only-deposited-credits.md) - [Why might my account be suspended even with remaining credits?](https://docs.fireworks.ai/faq-new/billing-pricing/why-might-my-account-be-suspended-even-with-remaining-credits.md) - [Are there any quotas for serverless?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/are-there-any-quotas-for-serverless.md) - [Are there any SLAs for serverless models?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/are-there-any-slas-for-serverless-models.md) - [Are there costs associated with deploying fine-tuned models to serverless infrastructure?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/are-there-costs-associated-with-deploying-fine-tuned-models-to-serverless-infras.md) - [Do you provide notice before removing model availability?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/do-you-provide-notice-before-removing-model-availability.md) - [Do you support Auto Scaling?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/do-you-support-auto-scaling.md) - [How can we benchmark?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/how-can-we-benchmark.md) - [How does autoscaling affect my costs?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/how-does-autoscaling-affect-my-costs.md) - [How does billing and scaling work for on-demand GPU deployments?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/how-does-billing-and-scaling-work-for-on-demand-gpu-deployments.md) - [How does billing work for on-demand deployments?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/how-does-billing-work-for-on-demand-deployments.md) - [How does the system scale?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/how-does-the-system-scale.md) - [How can I optimize latency for single replica deployments?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/how-to-reduce-latency-for-deployment-on-single-replica.md) - [I have more specific performance questions about improvements](https://docs.fireworks.ai/faq-new/deployment-infrastructure/i-have-more-specific-performance-questions-about-improvements.md) - [Is latency guaranteed for serverless models?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/is-latency-guaranteed-for-serverless-models.md) - [What are the best practices for optimizing performance?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/what-are-the-best-practices-for-optimizing-performance.md) - [What are the common issues when deploying custom models?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/what-are-the-common-issues-when-deploying-custom-models.md) - [What are the rate limits for on-demand deployments?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/what-are-the-rate-limits-for-on-demand-deployments.md) - [What are the techniques to improve performance?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/what-are-the-techniques-to-improve-performance.md) - [What factors affect model latency and performance?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/what-factors-affect-model-latency-and-performance.md) - [What factors affect the number of simultaneous requests that can be handled?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/what-factors-affect-the-number-of-simultaneous-requests-that-can-be-handled.md) - [What should I expect for deployment and scaling performance?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/what-should-i-expect-for-deployment-and-scaling-performance.md) - [What’s the latency for small, medium, and large LLM models?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/whats-the-latency-for-small-medium-and-large-llm-models.md) - [What’s the supported throughput?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/whats-the-supported-throughput.md) - [Which accelerator/GPU should I use?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/which-acceleratorgpu-should-i-use.md) - [Why am I experiencing request timeout errors and slow response times with serverless LLM models?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/why-am-i-experiencing-request-timeout-errors-and-slow-response-times-with-server.md) - [Does Fireworks offer a fine-tuning service?](https://docs.fireworks.ai/faq-new/fine-tuning/does-fireworks-offer-a-fine-tuning-service.md) - [What models are supported for fine-tuning? Is Llama 3 supported for fine-tuning?](https://docs.fireworks.ai/faq-new/fine-tuning/what-models-are-supported-for-fine-tuning-is-llama-3-supported-for-fine-tuning.md) - [Why am I getting "invalid id" errors when using firectl commands like create deployment or list deployments?](https://docs.fireworks.ai/faq-new/fine-tuning/why-am-i-getting-invalid-id-errors-when-using-firectl-commands-like-create-deplo.md) - [Why am I getting "Model not found" errors when trying to access my fine-tuned model?](https://docs.fireworks.ai/faq-new/fine-tuning/why-am-i-getting-model-not-found-errors-when-trying-to-access-my-fine-tuned-mode.md) - [Why can’t I deploy my fine-tuned Llama 3.1 LoRA adapter?](https://docs.fireworks.ai/faq-new/fine-tuning/why-cant-i-deploy-my-fine-tuned-llama-31-lora-adapter.md) - [Can I create custom LoRA models with FLUX?](https://docs.fireworks.ai/faq-new/models-inference/can-i-create-custom-lora-models-with-flux.md) - [Can I generate multiple images in a single API call using FLUX serverless?](https://docs.fireworks.ai/faq-new/models-inference/can-i-generate-multiple-images-in-a-single-api-call-using-flux-serverless.md) - [Can safety filters or content restrictions be disabled on text generation models?](https://docs.fireworks.ai/faq-new/models-inference/can-safety-filters-or-content-restrictions-be-disabled-on-text-generation-models.md) - [Does Fireworks support custom base models?](https://docs.fireworks.ai/faq-new/models-inference/does-fireworks-support-custom-base-models.md) - [Does FLUX support image-to-image generation?](https://docs.fireworks.ai/faq-new/models-inference/does-flux-support-image-to-image-generation.md) - [Does the API support batching and load balancing?](https://docs.fireworks.ai/faq-new/models-inference/does-the-api-support-batching-and-load-balancing.md) - [How do I control output image sizes when using SDXL ControlNet?](https://docs.fireworks.ai/faq-new/models-inference/how-do-i-control-output-image-sizes-when-using-sdxl-controlnet.md) - [How to check if a model is available on serverless?](https://docs.fireworks.ai/faq-new/models-inference/how-to-check-if-a-model-is-available-on-serverless.md) - [There’s a model I would like to use that isn’t available on Fireworks. Can I request it?](https://docs.fireworks.ai/faq-new/models-inference/theres-a-model-i-would-like-to-use-that-isnt-available-on-fireworks-can-i-reques.md) - [What are the maximum completion token limits for models, and can they be increased?](https://docs.fireworks.ai/faq-new/models-inference/what-are-the-maximum-completion-token-limits-for-models-and-can-they-be-increase.md) - [What factors affect the number of simultaneous requests that can be handled?](https://docs.fireworks.ai/faq-new/models-inference/what-factors-affect-the-number-of-simultaneous-requests-that-can-be-handled.md) - [What quantization format is used for the Llama 3.1 405B model?](https://docs.fireworks.ai/faq-new/models-inference/what-quantization-format-is-used-for-the-llama-31-405b-model.md) - [Do you provide private connections?](https://docs.fireworks.ai/faq-new/security-compliance/do-you-provide-private-connections.md) - [Do you put any guardrails before any LLM models?](https://docs.fireworks.ai/faq-new/security-compliance/do-you-put-any-guardrails-before-any-llm-models.md) - [Does Fireworks provide client-side encryption or allow customers to bring their own encryption keys?](https://docs.fireworks.ai/faq-new/security-compliance/does-fireworks-provide-client-side-encryption-or-allow-customers-to-bring-their.md) - [How is data encrypted at rest?](https://docs.fireworks.ai/faq-new/security-compliance/how-is-data-encrypted-at-rest.md) - [How is data encrypted in transit?](https://docs.fireworks.ai/faq-new/security-compliance/how-is-data-encrypted-in-transit.md) - [What type of certifications do you have?](https://docs.fireworks.ai/faq-new/security-compliance/what-type-of-certifications-do-you-have.md) - [Where can I find more information about your security policies?](https://docs.fireworks.ai/faq-new/security-compliance/where-can-i-find-more-information-about-your-security-policies.md) - [Are there any quotas for Enterprise Tier?](https://docs.fireworks.ai/faq-new/support-general/are-there-any-quotas-for-enterprise-tier.md) - [Do you have a shared Slack channel?](https://docs.fireworks.ai/faq-new/support-general/do-you-have-a-shared-slack-channel.md) - [Do you host your deployments in the EU or Asia?](https://docs.fireworks.ai/faq-new/support-general/do-you-host-your-deployments-in-the-eu-or-asia.md) - [How does Support work?](https://docs.fireworks.ai/faq-new/support-general/how-does-support-work.md) - [I have another question or issue.](https://docs.fireworks.ai/faq-new/support-general/i-have-another-question-or-issue.md) - [I have specific performance questions or want to know about further performance improvement options.](https://docs.fireworks.ai/faq-new/support-general/i-have-specific-performance-questions-or-want-to-know-about-further-performance.md) - [If you're an Enterprise customer, how do you contact support?](https://docs.fireworks.ai/faq-new/support-general/if-youre-an-enterprise-customer-how-do-you-contact-support.md) - [What are the support tiers and SLAs for enterprise?](https://docs.fireworks.ai/faq-new/support-general/what-are-the-support-tiers-and-slas-for-enterprise.md) - [What support options exist?](https://docs.fireworks.ai/faq-new/support-general/what-support-options-exist.md) - [Importing fine-tuned models](https://docs.fireworks.ai/fine-tuning/fine-tuned-import.md) - [External GCS Bucket Integration](https://docs.fireworks.ai/fine-tuning/fine-tuning-extrenal-dataset.md): Use external Google Cloud Storage buckets for fine-tuning while keeping your data private with secure, isolated access - [Supervised fine-tuning (SFT)](https://docs.fireworks.ai/fine-tuning/fine-tuning-models.md) - [Introduction to fine-tuning](https://docs.fireworks.ai/fine-tuning/finetuning-intro.md) - [Using multi-LoRA](https://docs.fireworks.ai/fine-tuning/multi-lora.md) - [Reinforcement fine-tuning (RFT)](https://docs.fireworks.ai/fine-tuning/reinforcement-fine-tuning-models.md) - [Evaluators (RewardKit)](https://docs.fireworks.ai/fine-tuning/reward-kit.md) - [Concepts](https://docs.fireworks.ai/getting-started/concepts.md): This document outlines basic Fireworks AI concepts. - [Fireworks AI Developer Platform](https://docs.fireworks.ai/getting-started/introduction.md): Start building with open source AI models - [Quickstart](https://docs.fireworks.ai/getting-started/quickstart.md): Get started in minutes with an OpenAI-compatible endpoint - [Using function-calling](https://docs.fireworks.ai/guides/function-calling.md) - [Introduction](https://docs.fireworks.ai/guides/inference-introduction.md) - [On-demand deployments](https://docs.fireworks.ai/guides/ondemand-deployments.md) - [Using predicted outputs](https://docs.fireworks.ai/guides/predicted-outputs.md): Use Predicted Outputs to boost output generation speeds for editing / rewriting use cases - [Prompt caching](https://docs.fireworks.ai/guides/prompt-caching.md) - [Querying transcription models](https://docs.fireworks.ai/guides/querying-asr-models.md) - [Querying embedding models](https://docs.fireworks.ai/guides/querying-embeddings-models.md) - [Querying text models](https://docs.fireworks.ai/guides/querying-text-models.md) - [Querying vision-language models](https://docs.fireworks.ai/guides/querying-vision-language-models.md) - [Rate limits, spend limits and quotas](https://docs.fireworks.ai/guides/quotas_usage/rate-limits.md): Rate limits, spend limits and quotas for serverless inference and on-demand deployments - [Recommended open models](https://docs.fireworks.ai/guides/recommended-models.md): A list of recommended open models for common use cases - [Response API](https://docs.fireworks.ai/guides/response-api.md) - [Data privacy & security](https://docs.fireworks.ai/guides/security_compliance/data_handling.md): How we secure and handle your data - [Voice agent platform](https://docs.fireworks.ai/guides/voice-agents-preview.md): Instructions for using test voice agent endpoints - [null](https://docs.fireworks.ai/models/quantization.md) - [Uploading a custom base model](https://docs.fireworks.ai/models/uploading-custom-models.md) - [Using grammar mode](https://docs.fireworks.ai/structured-responses/structured-output-grammar-based.md) - [Using JSON mode](https://docs.fireworks.ai/structured-responses/structured-response-formatting.md) - [Authentication](https://docs.fireworks.ai/tools-sdks/firectl/commands/authentication.md): Authentication for access to your account - [Create a dataset](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-dataset.md): Create a Dataset on the Fireworks platform - [Create a deployment](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-deployment.md): Create a Deployment on Fireworks AI platform - [Create a fine-tuning job](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-finetune-job.md): Create a fine-tuning job with a base model - [Create model](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-model.md): Create a model on the Fireworks platform - [Delete resources](https://docs.fireworks.ai/tools-sdks/firectl/commands/delete-resources.md): Deletes a resource(s) in a Fireworks account - [Download a model](https://docs.fireworks.ai/tools-sdks/firectl/commands/download-model.md): Download a model from third-party locations - [Get resources](https://docs.fireworks.ai/tools-sdks/firectl/commands/get-resources.md): Retrieves information from the Fireworks platform - [List resources](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-resources.md): List various resources in a Fireworks account - [Load LoRA](https://docs.fireworks.ai/tools-sdks/firectl/commands/load-lora.md): Load a LoRA model to a deployment. - [Undelete resources](https://docs.fireworks.ai/tools-sdks/firectl/commands/undelete-resources.md): Undelete resources on the Fireworks platform - [Unload LoRA](https://docs.fireworks.ai/tools-sdks/firectl/commands/unload-lora.md): Unload a LORA model from a deployment. - [Update resources](https://docs.fireworks.ai/tools-sdks/firectl/commands/update-resources.md): Updates resources on the Fireworks platform - [Getting started](https://docs.fireworks.ai/tools-sdks/firectl/firectl.md): Learn to create, deploy, and manage resources using Firectl - [OpenAI compatibility](https://docs.fireworks.ai/tools-sdks/openai-compatibility.md) - [Developing Evaluators](https://docs.fireworks.ai/tools-sdks/python-client/developing-evaluators.md) - [Basics of the Build SDK](https://docs.fireworks.ai/tools-sdks/python-client/sdk-basics.md) - [Introducing the Fireworks Build SDK](https://docs.fireworks.ai/tools-sdks/python-client/sdk-introduction.md) - [Reference](https://docs.fireworks.ai/tools-sdks/python-client/sdk-reference.md) - [Tutorial](https://docs.fireworks.ai/tools-sdks/python-client/the-tutorial.md) - [Troubleshooting inference errors](https://docs.fireworks.ai/troubleshooting/status_error_codes/inference_error_code.md): This page lists common error codes encountered during inference requests using the Fireworks API, their meanings, and potential resolutions. - [Changelog](https://docs.fireworks.ai/updates/changelog.md)