Documentation Index
Fetch the complete documentation index at: https://student-213fb9fc.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Overview
AJ STUDIOZ Cloud Infra hosts a curated selection of frontier open-weight AI models. All models are accessible through both the Ollama-compatible API and the OpenAI-compatible API.Use the model name exactly as shown in the
model column when making API requests.Flagship Models (100B+)
These are the most capable models for complex reasoning, coding, and agentic tasks.| Model Name | Size | Added | Best For |
|---|---|---|---|
kimi-k2:1t | ~1.1T | Sep 2025 | Agentic tasks, long context |
kimi-k2.5 | ~1.1T | Jan 2026 | Next-gen reasoning |
kimi-k2-thinking | ~1.1T | Nov 2025 | Chain-of-thought reasoning |
qwen3.5:397b | 397B | Feb 2026 | Multilingual, reasoning |
qwen3-coder:480b | 480B | Jul 2025 | State-of-the-art coding |
qwen3-vl:235b | 235B | Sep 2025 | Vision + language |
qwen3-vl:235b-instruct | 235B | Sep 2025 | Vision, instruction-tuned |
cogito-2.1:671b | 671B | Nov 2025 | Scientific reasoning |
deepseek-v3.1:671b | 671B | Nov 2025 | Coding, analysis |
deepseek-v3.2 | 671B | Dec 2025 | Latest DeepSeek release |
glm-5 | Large | Feb 2026 | Long context, multilingual |
glm-4.6 | Large | Sep 2025 | Balanced GLM model |
glm-4.7 | Large | Dec 2025 | Latest GLM release |
minimax-m2 | 230B | Oct 2025 | Multimodal |
minimax-m2.1 | 230B | Dec 2025 | Enhanced multimodal |
minimax-m2.5 | 230B | Feb 2026 | Latest MiniMax model |
mistral-large-3:675b | 675B | Dec 2025 | Multilingual excellence |
Large Models (20B–100B)
High-performance models with broad capability coverage.| Model Name | Size | Added | Best For |
|---|---|---|---|
gpt-oss:120b | 120B | Aug 2025 | General purpose |
devstral-2:123b | 123B | Dec 2025 | Code generation |
qwen3-next:80b | 80B | Sep 2025 | Coding + reasoning |
qwen3-coder-next | ~82B | Feb 2025 | Advanced code tasks |
gemma3:27b | 27B | Mar 2025 | Balanced, fast |
devstral-small-2:24b | 24B | Dec 2025 | Efficient code model |
gemma3:12b | 12B | Mar 2025 | Fast inference |
nemotron-3-nano:30b | 30B | Dec 2025 | NVIDIA specialized |
Small & Compact Models (≤20B)
Optimized for speed, low latency, and cost efficiency.| Model Name | Size | Added | Best For |
|---|---|---|---|
gpt-oss:20b | 20B | Aug 2025 | Fast general purpose |
rnj-1:8b | ~16B | Dec 2025 | Compact reasoning |
ministral-3:14b | 14B | Dec 2025 | Efficient multilingual |
ministral-3:8b | 10.4B | Dec 2025 | Balanced small model |
gemma3:4b | 8.6B | Mar 2025 | Ultra-fast inference |
ministral-3:3b | 4.7B | Dec 2025 | Fastest responses |
gemini-3-flash-preview | — | Dec 2025 | API-based, ultra-fast |
Full Model Catalog (JSON)
View raw model list (JSON)
View raw model list (JSON)
