# Fireworks AI Docs ## Docs - [Service Accounts](https://docs.fireworks.ai/accounts/service-accounts.md): How to manage and use service accounts in Fireworks - [Custom SSO](https://docs.fireworks.ai/accounts/sso.md): Set up custom Single Sign-On (SSO) authentication for Fireworks AI - [Managing users](https://docs.fireworks.ai/accounts/users.md): Add and delete additional users in your Fireworks account - [Batch Delete Batch Jobs](https://docs.fireworks.ai/api-reference-dlde/batch-delete-batch-jobs.md) - [Batch Delete Environments](https://docs.fireworks.ai/api-reference-dlde/batch-delete-environments.md) - [Batch Delete Node Pools](https://docs.fireworks.ai/api-reference-dlde/batch-delete-node-pools.md) - [Cancel Batch Job](https://docs.fireworks.ai/api-reference-dlde/cancel-batch-job.md): Cancels an existing batch job if it is queued, pending, or running. - [Connect Environment](https://docs.fireworks.ai/api-reference-dlde/connect-environment.md): Connects the environment to a node pool. Returns an error if there is an existing pending connection. - [Create Aws Iam Role Binding](https://docs.fireworks.ai/api-reference-dlde/create-aws-iam-role-binding.md) - [Create Batch Job](https://docs.fireworks.ai/api-reference-dlde/create-batch-job.md) - [Create Cluster](https://docs.fireworks.ai/api-reference-dlde/create-cluster.md) - [Create Environment](https://docs.fireworks.ai/api-reference-dlde/create-environment.md) - [Create Node Pool](https://docs.fireworks.ai/api-reference-dlde/create-node-pool.md) - [Create Node Pool Binding](https://docs.fireworks.ai/api-reference-dlde/create-node-pool-binding.md) - [Create Snapshot](https://docs.fireworks.ai/api-reference-dlde/create-snapshot.md) - [Delete Aws Iam Role Binding](https://docs.fireworks.ai/api-reference-dlde/delete-aws-iam-role-binding.md) - [Delete Batch Job](https://docs.fireworks.ai/api-reference-dlde/delete-batch-job.md) - [Delete Cluster](https://docs.fireworks.ai/api-reference-dlde/delete-cluster.md) - [Delete Environment](https://docs.fireworks.ai/api-reference-dlde/delete-environment.md) - [Delete Node Pool](https://docs.fireworks.ai/api-reference-dlde/delete-node-pool.md) - [Delete Node Pool Binding](https://docs.fireworks.ai/api-reference-dlde/delete-node-pool-binding.md) - [Delete Snapshot](https://docs.fireworks.ai/api-reference-dlde/delete-snapshot.md) - [Disconnect Environment](https://docs.fireworks.ai/api-reference-dlde/disconnect-environment.md): Disconnects the environment from the node pool. Returns an error if the environment is not connected to a node pool. - [Get Batch Job](https://docs.fireworks.ai/api-reference-dlde/get-batch-job.md) - [Get Batch Job Logs](https://docs.fireworks.ai/api-reference-dlde/get-batch-job-logs.md) - [Get Cluster](https://docs.fireworks.ai/api-reference-dlde/get-cluster.md) - [Get Cluster Connection Info](https://docs.fireworks.ai/api-reference-dlde/get-cluster-connection-info.md): Retrieve connection settings for the cluster to be put in kubeconfig - [Get Environment](https://docs.fireworks.ai/api-reference-dlde/get-environment.md) - [Get Node Pool](https://docs.fireworks.ai/api-reference-dlde/get-node-pool.md) - [Get Node Pool Stats](https://docs.fireworks.ai/api-reference-dlde/get-node-pool-stats.md) - [Get Snapshot](https://docs.fireworks.ai/api-reference-dlde/get-snapshot.md) - [List Aws Iam Role Bindings](https://docs.fireworks.ai/api-reference-dlde/list-aws-iam-role-bindings.md) - [List Batch Jobs](https://docs.fireworks.ai/api-reference-dlde/list-batch-jobs.md) - [List Clusters](https://docs.fireworks.ai/api-reference-dlde/list-clusters.md) - [List Environments](https://docs.fireworks.ai/api-reference-dlde/list-environments.md) - [List Node Pool Bindings](https://docs.fireworks.ai/api-reference-dlde/list-node-pool-bindings.md) - [List Node Pools](https://docs.fireworks.ai/api-reference-dlde/list-node-pools.md) - [List Snapshots](https://docs.fireworks.ai/api-reference-dlde/list-snapshots.md) - [Update Batch Job](https://docs.fireworks.ai/api-reference-dlde/update-batch-job.md) - [Update Cluster](https://docs.fireworks.ai/api-reference-dlde/update-cluster.md) - [Update Environment](https://docs.fireworks.ai/api-reference-dlde/update-environment.md) - [Update Node Pool](https://docs.fireworks.ai/api-reference-dlde/update-node-pool.md) - [Streaming Transcription](https://docs.fireworks.ai/api-reference/audio-streaming-transcriptions.md) - [Transcribe audio](https://docs.fireworks.ai/api-reference/audio-transcriptions.md) - [Translate audio](https://docs.fireworks.ai/api-reference/audio-translations.md) - [Create API Key](https://docs.fireworks.ai/api-reference/create-api-key.md) - [Create Batch Inference Job](https://docs.fireworks.ai/api-reference/create-batch-inference-job.md) - [Create Batch Request](https://docs.fireworks.ai/api-reference/create-batch-request.md) - [Create Dataset](https://docs.fireworks.ai/api-reference/create-dataset.md) - [Load LoRA](https://docs.fireworks.ai/api-reference/create-deployed-model.md) - [Create Deployment](https://docs.fireworks.ai/api-reference/create-deployment.md) - [Create Model](https://docs.fireworks.ai/api-reference/create-model.md) - [Create Reinforcement Fine-tuning Job](https://docs.fireworks.ai/api-reference/create-reinforcement-fine-tuning-job.md) - [Create Supervised Fine-tuning Job](https://docs.fireworks.ai/api-reference/create-supervised-fine-tuning-job.md) - [Create User](https://docs.fireworks.ai/api-reference/create-user.md) - [Create embeddings](https://docs.fireworks.ai/api-reference/creates-an-embedding-vector-representing-the-input-text.md) - [Delete API Key](https://docs.fireworks.ai/api-reference/delete-api-key.md) - [Delete Batch Inference Job](https://docs.fireworks.ai/api-reference/delete-batch-inference-job.md) - [Delete Dataset](https://docs.fireworks.ai/api-reference/delete-dataset.md) - [Unload LoRA](https://docs.fireworks.ai/api-reference/delete-deployed-model.md) - [Delete Deployment](https://docs.fireworks.ai/api-reference/delete-deployment.md) - [Delete Model](https://docs.fireworks.ai/api-reference/delete-model.md) - [Delete Reinforcement Fine-tuning Job](https://docs.fireworks.ai/api-reference/delete-reinforcement-fine-tuning-job.md) - [Delete a model response](https://docs.fireworks.ai/api-reference/delete-response.md): Deletes a model response by its ID. Once deleted, the response data will be gone immediately and permanently. The response cannot be recovered and any conversations that reference this response ID will no longer be able to access it. - [Delete Supervised Fine-tuning Job](https://docs.fireworks.ai/api-reference/delete-supervised-fine-tuning-job.md) - [Generate an image with FLUX.1 [schnell] FP8](https://docs.fireworks.ai/api-reference/generate-a-new-image-from-a-text-prompt.md) - [Generate or edit an image with FLUX.1 Kontext](https://docs.fireworks.ai/api-reference/generate-or-edit-image-using-flux-kontext.md) - [Get Account](https://docs.fireworks.ai/api-reference/get-account.md) - [Get Batch Inference Job](https://docs.fireworks.ai/api-reference/get-batch-inference-job.md) - [Check Batch Status](https://docs.fireworks.ai/api-reference/get-batch-status.md) - [Get Dataset](https://docs.fireworks.ai/api-reference/get-dataset.md) - [Get Dataset Upload Endpoint](https://docs.fireworks.ai/api-reference/get-dataset-upload-endpoint.md) - [Get LoRA](https://docs.fireworks.ai/api-reference/get-deployed-model.md) - [Get Deployment](https://docs.fireworks.ai/api-reference/get-deployment.md) - [Get generated image from FLUX.1 Kontext](https://docs.fireworks.ai/api-reference/get-generated-image-from-flux-kontex.md) - [Get Model](https://docs.fireworks.ai/api-reference/get-model.md) - [Get Model Download Endpoint](https://docs.fireworks.ai/api-reference/get-model-download-endpoint.md) - [Get Model Upload Endpoint](https://docs.fireworks.ai/api-reference/get-model-upload-endpoint.md) - [Get Reinforcement Fine-tuning Job](https://docs.fireworks.ai/api-reference/get-reinforcement-fine-tuning-job.md) - [Get Supervised Fine-tuning Job](https://docs.fireworks.ai/api-reference/get-supervised-fine-tuning-job.md) - [Get User](https://docs.fireworks.ai/api-reference/get-user.md) - [Introduction](https://docs.fireworks.ai/api-reference/introduction.md) - [List API Keys](https://docs.fireworks.ai/api-reference/list-api-keys.md) - [List Batch Inference Jobs](https://docs.fireworks.ai/api-reference/list-batch-inference-jobs.md) - [List Datasets](https://docs.fireworks.ai/api-reference/list-datasets.md) - [List LoRAs](https://docs.fireworks.ai/api-reference/list-deployed-models.md) - [List Deployments](https://docs.fireworks.ai/api-reference/list-deployments.md) - [List Models](https://docs.fireworks.ai/api-reference/list-models.md) - [List Reinforcement Fine-tuning Jobs](https://docs.fireworks.ai/api-reference/list-reinforcement-fine-tuning-jobs.md) - [List Supervised Fine-tuning Jobs](https://docs.fireworks.ai/api-reference/list-supervised-fine-tuning-jobs.md) - [List Users](https://docs.fireworks.ai/api-reference/list-users.md) - [Create Chat Completion](https://docs.fireworks.ai/api-reference/post-chatcompletions.md): Creates a model response for the given chat conversation. - [Create Completion](https://docs.fireworks.ai/api-reference/post-completions.md): Creates a completion for the provided prompt and parameters. - [Create a model response](https://docs.fireworks.ai/api-reference/post-responses.md): Creates a model response, optionally interacting with custom tools via the Model Context Protocol (MCP). This endpoint supports conversational continuation and streaming. Explore our cookbooks for detailed examples: - [Basic MCP Usage](https://github.com/fw-ai/cookbook/blob/main/learn/response-api/fireworks_mcp_examples.ipynb) - [Streaming with MCP](https://github.com/fw-ai/cookbook/blob/main/learn/response-api/fireworks_mcp_with_streaming.ipynb) - [Conversational History with `previous_response_id`](https://github.com/fw-ai/cookbook/blob/main/learn/response-api/fireworks_previous_response_cookbook.ipynb) - [Basic Streaming](https://github.com/fw-ai/cookbook/blob/main/learn/response-api/fireworks_streaming_example.ipynb) - [Controlling Response Storage](https://github.com/fw-ai/cookbook/blob/main/learn/response-api/mcp_server_with_store_false_argument.ipynb) - [Prepare Model for different precisions](https://docs.fireworks.ai/api-reference/prepare-model.md) - [Undelete Deployment](https://docs.fireworks.ai/api-reference/undelete-deployment.md) - [Update Dataset](https://docs.fireworks.ai/api-reference/update-dataset.md) - [Update LoRA](https://docs.fireworks.ai/api-reference/update-deployed-model.md) - [Update Deployment](https://docs.fireworks.ai/api-reference/update-deployment.md) - [Update Model](https://docs.fireworks.ai/api-reference/update-model.md) - [Update User](https://docs.fireworks.ai/api-reference/update-user.md) - [Upload Dataset Files](https://docs.fireworks.ai/api-reference/upload-dataset-files.md): Provides a streamlined way to upload a dataset file in a single API request. This path can handle file sizes up to 150Mb. For larger file sizes use [Get Dataset Upload Endpoint](get-dataset-upload-endpoint). - [Validate Dataset Upload](https://docs.fireworks.ai/api-reference/validate-dataset-upload.md) - [Validate Model Upload](https://docs.fireworks.ai/api-reference/validate-model-upload.md) - [Client-side performance optimization](https://docs.fireworks.ai/deployments/client-side-performance-optimization.md): Optimize your client code for maximum performance with dedicated deployments - [Direct routing](https://docs.fireworks.ai/deployments/direct-routing.md): Direct routing enables enterprise users reduce latency to their deployments. - [Exporting Metrics](https://docs.fireworks.ai/deployments/exporting-metrics.md): Export metrics from your dedicated deployments to your observability stack - [Regions](https://docs.fireworks.ai/deployments/regions.md): Fireworks runs a global fleet of hardware on which you can deploy your models. - [Reserved capacity](https://docs.fireworks.ai/deployments/reservations.md) - [Hugging Face](https://docs.fireworks.ai/ecosystem/integrations/hugging-face.md): Learn how developers can integrate and use Fireworks.ai inference capabilities via the Hugging Face ecosystem. - [Amazon SageMaker](https://docs.fireworks.ai/ecosystem/integrations/sagemaker.md): Learn how to integrate and use Fireworks AI inference capabilities in Amazon SageMaker. - [null](https://docs.fireworks.ai/evaluators/api_reference/api_overview.md) - [null](https://docs.fireworks.ai/evaluators/api_reference/data_models.md) - [null](https://docs.fireworks.ai/evaluators/api_reference/reward_function_class.md) - [null](https://docs.fireworks.ai/evaluators/api_reference/reward_function_decorator.md) - [null](https://docs.fireworks.ai/evaluators/cli_reference/cli_overview.md) - [null](https://docs.fireworks.ai/evaluators/developer_guide/agent_evaluation.md) - [null](https://docs.fireworks.ai/evaluators/developer_guide/core_data_types.md) - [null](https://docs.fireworks.ai/evaluators/developer_guide/evaluation_workflows.md) - [null](https://docs.fireworks.ai/evaluators/developer_guide/getting_started.md) - [null](https://docs.fireworks.ai/evaluators/developer_guide/reward_function_anatomy.md) - [null](https://docs.fireworks.ai/evaluators/documentation_home.md) - [null](https://docs.fireworks.ai/evaluators/examples/accuracy_length/accuracy_length_overview.md) - [null](https://docs.fireworks.ai/evaluators/examples/advanced_examples/advanced_reward_functions.md) - [null](https://docs.fireworks.ai/evaluators/examples/advanced_examples/code_execution_with_e2b.md) - [null](https://docs.fireworks.ai/evaluators/examples/advanced_examples/math_evaluation.md) - [null](https://docs.fireworks.ai/evaluators/examples/apps_coding_example.md) - [null](https://docs.fireworks.ai/evaluators/examples/basic_examples/basic_reward_function.md) - [null](https://docs.fireworks.ai/evaluators/examples/basic_examples/reward_functions_overview.md) - [null](https://docs.fireworks.ai/evaluators/examples/e2b/e2b_code_execution_overview.md) - [null](https://docs.fireworks.ai/evaluators/examples/examples_overview.md) - [null](https://docs.fireworks.ai/evaluators/examples/gcp_cloud_run_deployment_example.md) - [null](https://docs.fireworks.ai/evaluators/examples/math_with_formatting_example.md) - [null](https://docs.fireworks.ai/evaluators/examples/tool_calling_example.md) - [Featured](https://docs.fireworks.ai/examples/introduction.md): Standalone examples showing how to use Fireworks to solve real-world use cases - [null](https://docs.fireworks.ai/examples/knowledge-distillation.md) - [null](https://docs.fireworks.ai/examples/reward-hacking.md) - [null](https://docs.fireworks.ai/examples/text-to-sql.md) - [How do I close my Fireworks.ai account?](https://docs.fireworks.ai/faq-new/account-access/how-do-i-close-my-fireworksai-account.md) - [I have multiple Fireworks accounts. When I try to login with Google on Fireworks' web UI, I'm getting signed into the wrong account. How do I fix this?](https://docs.fireworks.ai/faq-new/account-access/i-have-multiple-fireworks-accounts-when-i-try-to-login-with-google-on-fireworks.md) - [What email does GitHub authentication use?](https://docs.fireworks.ai/faq-new/account-access/what-email-does-github-authentication-use.md) - [What email does LinkedIn authentication use?](https://docs.fireworks.ai/faq-new/account-access/what-email-does-linkedin-authentication-use.md) - [What should I do if I can't access my company account after being invited when I already have a personal account?](https://docs.fireworks.ai/faq-new/account-access/what-should-i-do-if-i-cant-access-my-company-account-after-being-invited-when-i.md) - [Are calls to the Models API billable?](https://docs.fireworks.ai/faq-new/billing-pricing/are-calls-to-the-models-api-billable.md) - [Are there discounts for bulk spend on serverless deployments?](https://docs.fireworks.ai/faq-new/billing-pricing/are-there-discounts-for-bulk-spend-on-serverless-deployments.md) - [Are there discounts for bulk usage?](https://docs.fireworks.ai/faq-new/billing-pricing/are-there-discounts-for-bulk-usage.md) - [Are there extra fees for serving fine-tuned models?](https://docs.fireworks.ai/faq-new/billing-pricing/are-there-extra-fees-for-serving-fine-tuned-models.md) - [How does billing and credit usage work?](https://docs.fireworks.ai/faq-new/billing-pricing/how-does-billing-and-credit-usage-work.md) - [How many tokens per image?](https://docs.fireworks.ai/faq-new/billing-pricing/how-many-tokens-per-image.md): Learn how to calculate token usage for images in vision models and understand pricing implications - [How much does Fireworks cost?](https://docs.fireworks.ai/faq-new/billing-pricing/how-much-does-fireworks-cost.md) - [I bought credits but don’t see them reflected in my account. Did they disappear?](https://docs.fireworks.ai/faq-new/billing-pricing/i-bought-credits-but-dont-see-them-reflected-in-my-account-did-they-disappear.md) - [Is prompt caching billed differently for serverless models?](https://docs.fireworks.ai/faq-new/billing-pricing/is-prompt-caching-billed-differently.md) - [What happens when I finish my $1 credit?](https://docs.fireworks.ai/faq-new/billing-pricing/what-happens-when-i-finish-my-1-dollar-credit.md) - [Where's my receipt for purchased credits?](https://docs.fireworks.ai/faq-new/billing-pricing/wheres-my-receipt-for-purchased-credits.md) - [Why did I receive an invoice when I only deposited credits?](https://docs.fireworks.ai/faq-new/billing-pricing/why-did-i-receive-an-invoice-when-i-only-deposited-credits.md) - [Why might my account be suspended even with remaining credits?](https://docs.fireworks.ai/faq-new/billing-pricing/why-might-my-account-be-suspended-even-with-remaining-credits.md) - [Are there any quotas for serverless?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/are-there-any-quotas-for-serverless.md) - [Are there any SLAs for serverless models?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/are-there-any-slas-for-serverless-models.md) - [Are there costs associated with deploying fine-tuned models to serverless infrastructure?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/are-there-costs-associated-with-deploying-fine-tuned-models-to-serverless-infras.md) - [Do you provide notice before removing model availability?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/do-you-provide-notice-before-removing-model-availability.md) - [Do you support Auto Scaling?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/do-you-support-auto-scaling.md) - [How can we benchmark?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/how-can-we-benchmark.md) - [How does autoscaling affect my costs?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/how-does-autoscaling-affect-my-costs.md) - [How does billing and scaling work for on-demand GPU deployments?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/how-does-billing-and-scaling-work-for-on-demand-gpu-deployments.md) - [How does billing work for on-demand deployments?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/how-does-billing-work-for-on-demand-deployments.md) - [How does the system scale?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/how-does-the-system-scale.md) - [How can I optimize latency for single replica deployments?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/how-to-reduce-latency-for-deployment-on-single-replica.md) - [I have more specific performance questions about improvements](https://docs.fireworks.ai/faq-new/deployment-infrastructure/i-have-more-specific-performance-questions-about-improvements.md) - [Is latency guaranteed for serverless models?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/is-latency-guaranteed-for-serverless-models.md) - [What are the best practices for optimizing performance?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/what-are-the-best-practices-for-optimizing-performance.md) - [What are the common issues when deploying custom models?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/what-are-the-common-issues-when-deploying-custom-models.md) - [What are the rate limits for on-demand deployments?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/what-are-the-rate-limits-for-on-demand-deployments.md) - [What are the techniques to improve performance?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/what-are-the-techniques-to-improve-performance.md) - [What factors affect model latency and performance?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/what-factors-affect-model-latency-and-performance.md) - [What factors affect the number of simultaneous requests that can be handled?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/what-factors-affect-the-number-of-simultaneous-requests-that-can-be-handled.md) - [What should I expect for deployment and scaling performance?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/what-should-i-expect-for-deployment-and-scaling-performance.md) - [What’s the latency for small, medium, and large LLM models?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/whats-the-latency-for-small-medium-and-large-llm-models.md) - [What’s the supported throughput?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/whats-the-supported-throughput.md) - [Which accelerator/GPU should I use?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/which-acceleratorgpu-should-i-use.md) - [Why am I experiencing request timeout errors and slow response times with serverless LLM models?](https://docs.fireworks.ai/faq-new/deployment-infrastructure/why-am-i-experiencing-request-timeout-errors-and-slow-response-times-with-server.md) - [Does Fireworks offer a fine-tuning service?](https://docs.fireworks.ai/faq-new/fine-tuning/does-fireworks-offer-a-fine-tuning-service.md) - [What models are supported for fine-tuning? Is Llama 3 supported for fine-tuning?](https://docs.fireworks.ai/faq-new/fine-tuning/what-models-are-supported-for-fine-tuning-is-llama-3-supported-for-fine-tuning.md) - [Why am I getting "invalid id" errors when using firectl commands like create deployment or list deployments?](https://docs.fireworks.ai/faq-new/fine-tuning/why-am-i-getting-invalid-id-errors-when-using-firectl-commands-like-create-deplo.md) - [Why am I getting "Model not found" errors when trying to access my fine-tuned model?](https://docs.fireworks.ai/faq-new/fine-tuning/why-am-i-getting-model-not-found-errors-when-trying-to-access-my-fine-tuned-mode.md) - [Why can’t I deploy my fine-tuned Llama 3.1 LoRA adapter?](https://docs.fireworks.ai/faq-new/fine-tuning/why-cant-i-deploy-my-fine-tuned-llama-31-lora-adapter.md) - [Can I create custom LoRA models with FLUX?](https://docs.fireworks.ai/faq-new/models-inference/can-i-create-custom-lora-models-with-flux.md) - [Can I generate multiple images in a single API call using FLUX serverless?](https://docs.fireworks.ai/faq-new/models-inference/can-i-generate-multiple-images-in-a-single-api-call-using-flux-serverless.md) - [Can safety filters or content restrictions be disabled on text generation models?](https://docs.fireworks.ai/faq-new/models-inference/can-safety-filters-or-content-restrictions-be-disabled-on-text-generation-models.md) - [Does Fireworks support custom base models?](https://docs.fireworks.ai/faq-new/models-inference/does-fireworks-support-custom-base-models.md) - [Does FLUX support image-to-image generation?](https://docs.fireworks.ai/faq-new/models-inference/does-flux-support-image-to-image-generation.md) - [Does the API support batching and load balancing?](https://docs.fireworks.ai/faq-new/models-inference/does-the-api-support-batching-and-load-balancing.md) - [How do I control output image sizes when using SDXL ControlNet?](https://docs.fireworks.ai/faq-new/models-inference/how-do-i-control-output-image-sizes-when-using-sdxl-controlnet.md) - [How to check if a model is available on serverless?](https://docs.fireworks.ai/faq-new/models-inference/how-to-check-if-a-model-is-available-on-serverless.md) - [How to get performance metrics for streaming responses?](https://docs.fireworks.ai/faq-new/models-inference/how-to-get-performance-metrics-for-streaming-responses.md) - [There’s a model I would like to use that isn’t available on Fireworks. Can I request it?](https://docs.fireworks.ai/faq-new/models-inference/theres-a-model-i-would-like-to-use-that-isnt-available-on-fireworks-can-i-reques.md) - [What are the maximum completion token limits for models, and can they be increased?](https://docs.fireworks.ai/faq-new/models-inference/what-are-the-maximum-completion-token-limits-for-models-and-can-they-be-increase.md) - [What factors affect the number of simultaneous requests that can be handled?](https://docs.fireworks.ai/faq-new/models-inference/what-factors-affect-the-number-of-simultaneous-requests-that-can-be-handled.md) - [What quantization format is used for the Llama 3.1 405B model?](https://docs.fireworks.ai/faq-new/models-inference/what-quantization-format-is-used-for-the-llama-31-405b-model.md) - [Do you provide private connections?](https://docs.fireworks.ai/faq-new/security-compliance/do-you-provide-private-connections.md) - [Do you put any guardrails before any LLM models?](https://docs.fireworks.ai/faq-new/security-compliance/do-you-put-any-guardrails-before-any-llm-models.md) - [Does Fireworks provide client-side encryption or allow customers to bring their own encryption keys?](https://docs.fireworks.ai/faq-new/security-compliance/does-fireworks-provide-client-side-encryption-or-allow-customers-to-bring-their.md) - [How is data encrypted at rest?](https://docs.fireworks.ai/faq-new/security-compliance/how-is-data-encrypted-at-rest.md) - [How is data encrypted in transit?](https://docs.fireworks.ai/faq-new/security-compliance/how-is-data-encrypted-in-transit.md) - [What type of certifications do you have?](https://docs.fireworks.ai/faq-new/security-compliance/what-type-of-certifications-do-you-have.md) - [Where can I find more information about your security policies?](https://docs.fireworks.ai/faq-new/security-compliance/where-can-i-find-more-information-about-your-security-policies.md) - [Are there any quotas for Enterprise Tier?](https://docs.fireworks.ai/faq-new/support-general/are-there-any-quotas-for-enterprise-tier.md) - [Do you have a shared Slack channel?](https://docs.fireworks.ai/faq-new/support-general/do-you-have-a-shared-slack-channel.md) - [Do you host your deployments in the EU or Asia?](https://docs.fireworks.ai/faq-new/support-general/do-you-host-your-deployments-in-the-eu-or-asia.md) - [How does Support work?](https://docs.fireworks.ai/faq-new/support-general/how-does-support-work.md) - [I have another question or issue.](https://docs.fireworks.ai/faq-new/support-general/i-have-another-question-or-issue.md) - [I have specific performance questions or want to know about further performance improvement options.](https://docs.fireworks.ai/faq-new/support-general/i-have-specific-performance-questions-or-want-to-know-about-further-performance.md) - [If you're an Enterprise customer, how do you contact support?](https://docs.fireworks.ai/faq-new/support-general/if-youre-an-enterprise-customer-how-do-you-contact-support.md) - [What are the support tiers and SLAs for enterprise?](https://docs.fireworks.ai/faq-new/support-general/what-are-the-support-tiers-and-slas-for-enterprise.md) - [What support options exist?](https://docs.fireworks.ai/faq-new/support-general/what-support-options-exist.md) - [Direct Preference Optimization (DPO) on Fireworks AI](https://docs.fireworks.ai/fine-tuning/dpo-fine-tuning.md) - [Importing fine-tuned models](https://docs.fireworks.ai/fine-tuning/fine-tuned-import.md) - [External GCS Bucket Integration](https://docs.fireworks.ai/fine-tuning/fine-tuning-extrenal-dataset.md): Use external Google Cloud Storage buckets for fine-tuning while keeping your data private with secure, isolated access - [Supervised fine-tuning for text (SFT)](https://docs.fireworks.ai/fine-tuning/fine-tuning-models.md) - [Supervised fine-tuning for VLMs (SFT)](https://docs.fireworks.ai/fine-tuning/fine-tuning-vlm.md): Learn how to fine-tune vision-language models on Fireworks AI with image and text datasets - [Introduction to fine-tuning](https://docs.fireworks.ai/fine-tuning/finetuning-intro.md) - [Using multi-LoRA](https://docs.fireworks.ai/fine-tuning/multi-lora.md) - [Reinforcement fine-tuning (RFT)](https://docs.fireworks.ai/fine-tuning/reinforcement-fine-tuning-models.md) - [Evaluators (RewardKit)](https://docs.fireworks.ai/fine-tuning/reward-kit.md) - [Single-LoRA deployment with live merge](https://docs.fireworks.ai/fine-tuning/single-lora.md): Deploy a LoRA fine-tuned model using live merge for simplified deployment and optimal performance - [Concepts](https://docs.fireworks.ai/getting-started/concepts.md): This document outlines basic Fireworks AI concepts. - [Fireworks AI Developer Platform](https://docs.fireworks.ai/getting-started/introduction.md): Start building with open source AI models - [Quickstart](https://docs.fireworks.ai/getting-started/quickstart.md): Get started in minutes with an OpenAI-compatible endpoint - [Batch Inference](https://docs.fireworks.ai/guides/batch-inference.md) - [Using function-calling](https://docs.fireworks.ai/guides/function-calling.md) - [Introduction](https://docs.fireworks.ai/guides/inference-introduction.md) - [On-demand deployments](https://docs.fireworks.ai/guides/ondemand-deployments.md) - [Using predicted outputs](https://docs.fireworks.ai/guides/predicted-outputs.md): Use Predicted Outputs to boost output generation speeds for editing / rewriting use cases - [Prompt caching](https://docs.fireworks.ai/guides/prompt-caching.md) - [Querying transcription models](https://docs.fireworks.ai/guides/querying-asr-models.md) - [Querying embedding models](https://docs.fireworks.ai/guides/querying-embeddings-models.md) - [Querying text models](https://docs.fireworks.ai/guides/querying-text-models.md) - [Querying vision-language models](https://docs.fireworks.ai/guides/querying-vision-language-models.md) - [Rate limits, spend limits and quotas](https://docs.fireworks.ai/guides/quotas_usage/rate-limits.md): Rate limits, spend limits and quotas for serverless inference and on-demand deployments - [Recommended open models](https://docs.fireworks.ai/guides/recommended-models.md): A list of recommended open models for common use cases - [Responses API](https://docs.fireworks.ai/guides/response-api.md) - [Data privacy & security](https://docs.fireworks.ai/guides/security_compliance/data_handling.md): How we secure and handle your data - [Voice agent platform](https://docs.fireworks.ai/guides/voice-agents-preview.md): Instructions for using test voice agent endpoints - [null](https://docs.fireworks.ai/models/quantization.md) - [Uploading a custom base model](https://docs.fireworks.ai/models/uploading-custom-models.md) - [Using grammar mode](https://docs.fireworks.ai/structured-responses/structured-output-grammar-based.md) - [Using JSON mode](https://docs.fireworks.ai/structured-responses/structured-response-formatting.md) - [Authentication](https://docs.fireworks.ai/tools-sdks/firectl/commands/authentication.md): Authentication for access to your account - [Create a batch inference job](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-batch-inference-job.md): Create a batch inference job to perform Chat Completion in bulk - [Create a dataset](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-dataset.md): Create a Dataset on the Fireworks platform - [Create a deployment](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-deployment.md): Create a Deployment on Fireworks AI platform - [Create a fine-tuning job](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-finetune-job.md): Create a fine-tuning job with a base model - [Create model](https://docs.fireworks.ai/tools-sdks/firectl/commands/create-model.md): Create a model on the Fireworks platform - [Delete resources](https://docs.fireworks.ai/tools-sdks/firectl/commands/delete-resources.md): Deletes a resource(s) in a Fireworks account - [Download a model](https://docs.fireworks.ai/tools-sdks/firectl/commands/download-model.md): Download a model from third-party locations - [Get resources](https://docs.fireworks.ai/tools-sdks/firectl/commands/get-resources.md): Retrieves information from the Fireworks platform - [List resources](https://docs.fireworks.ai/tools-sdks/firectl/commands/list-resources.md): List various resources in a Fireworks account - [Load LoRA](https://docs.fireworks.ai/tools-sdks/firectl/commands/load-lora.md): Load a LoRA model to a deployment. - [Undelete resources](https://docs.fireworks.ai/tools-sdks/firectl/commands/undelete-resources.md): Undelete resources on the Fireworks platform - [Unload LoRA](https://docs.fireworks.ai/tools-sdks/firectl/commands/unload-lora.md): Unload a LORA model from a deployment. - [Update resources](https://docs.fireworks.ai/tools-sdks/firectl/commands/update-resources.md): Updates resources on the Fireworks platform - [Getting started](https://docs.fireworks.ai/tools-sdks/firectl/firectl.md): Learn to create, deploy, and manage resources using Firectl - [OpenAI compatibility](https://docs.fireworks.ai/tools-sdks/openai-compatibility.md) - [Developing Evaluators](https://docs.fireworks.ai/tools-sdks/python-client/developing-evaluators.md) - [Querying existing dedicated deployments](https://docs.fireworks.ai/tools-sdks/python-client/querying-dedicated-deployments.md): Learn how to connect to and query dedicated deployments that were created outside the SDK - [Basics of the Build SDK](https://docs.fireworks.ai/tools-sdks/python-client/sdk-basics.md) - [Introducing the Fireworks Build SDK](https://docs.fireworks.ai/tools-sdks/python-client/sdk-introduction.md) - [Reference](https://docs.fireworks.ai/tools-sdks/python-client/sdk-reference.md) - [Tutorial](https://docs.fireworks.ai/tools-sdks/python-client/the-tutorial.md) - [Troubleshooting inference errors](https://docs.fireworks.ai/troubleshooting/status_error_codes/inference_error_code.md): This page lists common error codes encountered during inference requests using the Fireworks API, their meanings, and potential resolutions. - [Changelog](https://docs.fireworks.ai/updates/changelog.md) ## Optional - [Model Library](https://app.fireworks.ai/models) - [Demos](https://demos.fireworks.ai/)