Generate chat completions based on a conversation history with streaming support.
The chat model to use for generating completions.
The maximum number of tokens in the generated completion.
The temperature parameter for controlling randomness in completions.
The diversity of the generated text based on nucleus sampling.
The diversity of the generated text based on top-k sampling.
Turn streaming on.
Options to affect the input of the request.