Responses
Generate responses while also allowing for the utilization of various tools.
Authentication
AuthorizationBearer
Bearer authentication of the form Bearer <token>, where token is your auth token.
Request
This endpoint expects an object.
model
The AI model to use for generating responses.
input
instructions
A system (or developer) message inserted into the model’s context.
max_output_tokens
The maximum number of tokens in the generated output.
max_tool_calls
The maximum amount of tool calls the model is able to do.
parallel_tool_calls
Whether to enable parallel function calling during tool use.
reasoning
Constrains effort on reasoning for reasoning models.
stream
Whether to stream back the model response. Not currently supported.
stream_options
Extra parameters used when streaming the response.
temperature
The temperature parameter for controlling randomness in completions. Supports a range of 0.0-2.0.
tool_choice
tools
The content of the tool call.
top_p
The diversity of the generated text based on nucleus sampling. Supports a range of 0.0-1.0.
safeguards
Safeguards to run on the request.
Response
Successful response.
id
Unique ID for the chat completion.
object
Type of object (chat completion).
created_at
Timestamp of when the chat completion was created.
status
The status of the request.
max_tool_calls
The max_tool_calls specified in the request.
model
The chat model used for generating completions.
output
The set of result outputs.
instructions
The instructions entered in the request input.
max_output_tokens
The max_output_tokens value entered in the request input.
parallel_tool_calls
The parallel_tool_calls value entered in the request input.
temperature
The temperature value entered in the request input.

