Chat Completions with Streaming
Generate chat completions based on a conversation history with streaming support.
Headers
Bearer authentication of the form Bearer <token>, where token is your auth token.
Request
The chat model to use for generating completions.
A value between -2.0 and 2.0, with positive values increasingly penalizing new tokens based on their frequency so far in order to decrease further occurrences.
Modifies the likelihood of specified tokens appearing in a response.
The maximum number of tokens in the generated completion.
A value between -2.0 and 2.0, with positive values causing a flat reduction of new tokens based on their existing presence so far in order to decrease further occurrences.
The temperature parameter for controlling randomness in completions.
The diversity of the generated text based on nucleus sampling.
The diversity of the generated text based on top-k sampling.
Turn streaming on.
Options to affect the input of the request.
Deprecated. Please use max_completion_tokens.
Response
Successful response.