Ollama-Compatible API
Chat
Multi-turn chat with message history (Ollama-compatible)
POST
Chat
Documentation Index
Fetch the complete documentation index at: https://student-213fb9fc.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Overview
Send a chat message with full conversation history and receive a response. This endpoint is fully compatible with the Ollama/api/chat format.
Request
Headers
Bearer token:
Bearer YOUR_API_KEYBody
The model name to use. See available models.
Array of message objects representing the conversation:
role(string, required) —"system","user", or"assistant"content(string, required) — Message textimages(array of strings, optional) — Base64-encoded images (for vision models)
Stream the response as it’s generated. Set to
false for a single JSON response.Model generation options:
temperature(float, 0–2)top_p(float)top_k(integer)num_predict(integer) — max tokensstop(array of strings)
List of tools/functions available for the model to call (function calling).
Output format. Set to
"json" to force JSON output.Response
The model that generated the response.
ISO 8601 timestamp.
The assistant’s reply:
role:"assistant"content: The response texttool_calls: Array of tool call objects (if function calling used)
true when the response is complete.stop, length, or tool_calls.Total time in nanoseconds.
Number of tokens generated.
