Create a chat completion for given messages with streaming support
JWT token for authentication
Array of messages in the conversation
Model identifier
RedHatAI/QwQ-32B-FP8-dynamic Whether to stream the response
Sampling temperature
0 <= x <= 2Maximum number of tokens to generate
1 <= x <= 4096Nucleus sampling parameter
0 <= x <= 1Sequences where the API will stop generating