Create a batch inference job to perform Chat Completion in bulk
dataset-id
with the input data. Must be in OpenAI format (you may directly use an OpenAI batch format file)model-id
to perform batch inference (or just specify a lora-id here). Will overwrite any model that has been specified per row.