POST
/
v2
/
chat
/
completions

Authorizations

Authorization
string
header
required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

application/json

User input for a chat completion request.

messages
object[]
required

The LLM prompt.

model
enum<string>
required

A model name, for example 'sutra-light'.

Available options:
sutra-light,
sutra-pro,
sutra-online
extra_body
object
frequency_penalty
number
max_tokens
number

The maximum number of tokens to generate before terminating. This number cannot exceed the context window for the selected model. The default value is 1024.

presence_penalty
number
stop
object

May be a string, null or an array of strings.

temperature
number

Controls the randomness of the response, a lower temperature gives lower randomness. Values are in the range [0,2] with a default value of 0.3.

top_p
number

Response

200 - text/event-stream; charset=utf-8

A server-sent event (SSE) conforming to OpenAI format.