Perform LLM completion with streaming response in OpenAI format.

Authorizations

Authorization

string

header

required

Bearer authentication header of the form Bearer <token>, where <token> is your auth token.

Body

application/json

User input for a chat completion request.

messages

object[]

required

The LLM prompt.

model

enum<string>

required

A model name, for example 'sutra-light'.

Available options:

sutra-light,

sutra-pro,

sutra-online

extra_body

object

frequency_penalty

number

max_tokens

number

The maximum number of tokens to generate before terminating. This number cannot exceed the context window for the selected model. The default value is 1024.

presence_penalty

number

stop

object

May be a string, null or an array of strings.

temperature

number

Controls the randomness of the response, a lower temperature gives lower randomness. Values are in the range [0,2] with a default value of 0.3.

top_p

number

Response

200 - text/event-stream; charset=utf-8

A server-sent event (SSE) conforming to OpenAI format.

API Documentation

SUTRA-MULTILINGUAL

SUTRA-ONLINE

SCHEMA

Perform LLM completion with streaming response in OpenAI format.

Authorizations

Body

Response