Skip to main content

Documentation Index

Fetch the complete documentation index at: https://student-213fb9fc.mintlify.app/llms.txt

Use this file to discover all available pages before exploring further.

Model Families Overview

AJ STUDIOZ Cloud Infra hosts models from six major AI research labs, each with distinct strengths.

Gemma (Google)

Lightweight, efficient models from Google DeepMind. Great for production workloads.

Qwen (Alibaba)

High-capability coding and multilingual models with massive scale.

Kimi (Moonshot)

Ultra-large models excelling at agentic and reasoning tasks.

DeepSeek

Top-tier open-source models for coding and analysis.

GLM (Zhipu AI)

Multilingual long-context models with strong Chinese language support.

Mistral & Others

Best-in-class European AI models with multilingual excellence.

Gemma Family (Google DeepMind)

Gemma models are open-weight, production-ready models from Google. They offer excellent quality-per-compute-cost ratio.
ModelParametersNotes
gemma3:4b4BFastest Gemma, ideal for real-time applications
gemma3:12b12BBalanced performance
gemma3:27b27BBest Gemma quality, recommended default
Best for: General assistant tasks, fast inference, cost-sensitive workloads.

Qwen Family (Alibaba Cloud)

Qwen models offer state-of-the-art performance especially in coding and mathematical reasoning.
ModelParametersNotes
qwen3-coder-next~82BNext-gen coding model
qwen3-coder:480b480BLargest coding model available
qwen3-next:80b80BBalanced coding + reasoning
qwen3.5:397b397BLatest Qwen release
qwen3-vl:235b235BVision + language
qwen3-vl:235b-instruct235BVision, instruction-tuned
Best for: Code generation, debugging, mathematical problems, vision tasks.

Kimi Family (Moonshot AI)

Kimi models are among the largest available, optimized for complex multi-step reasoning and agentic workflows.
ModelParametersNotes
kimi-k2:1t~1.1TLargest model, general agentic tasks
kimi-k2.5~1.1TEnhanced capabilities
kimi-k2-thinking~1.1TOptimized for chain-of-thought
Best for: Complex reasoning, research, multi-step agentic tasks, long context.

DeepSeek Family

DeepSeek produces highly capable open-source models that compete with GPT-4 class models.
ModelParametersNotes
deepseek-v3.1:671b671BHigh-quality coding and reasoning
deepseek-v3.2671BLatest DeepSeek release
Best for: Coding, data analysis, structured outputs, STEM tasks.

GLM Family (Zhipu AI)

GLM models excel at long-context understanding and multilingual tasks, especially Chinese.
ModelParametersNotes
glm-5Very LargeLatest flagship, long context
glm-4.6LargeMature, stable release
glm-4.7LargeEnhanced version
Best for: Long documents, Chinese language, multilingual tasks.

MiniMax Family

MiniMax models offer strong multimodal capabilities.
ModelParametersNotes
minimax-m2230BMultimodal baseline
minimax-m2.1230BEnhanced multimodal
minimax-m2.5230BLatest MiniMax release
Best for: Vision + language tasks, multimodal reasoning.

Other Models

ModelFamilyNotes
gpt-oss:120bOpenAI OSSOpen-source OpenAI-style model
gpt-oss:20bOpenAI OSSEfficient variant
cogito-2.1:671bCogitoScientific reasoning
devstral-2:123bMistralCoding-specialized
devstral-small-2:24bMistralEfficient coding model
mistral-large-3:675bMistralFlagship multilingual
ministral-3:3bMistralUltra-compact
ministral-3:8bMistralCompact balanced
ministral-3:14bMistralEfficient mid-size
nemotron-3-nano:30bNVIDIAOptimized inference
rnj-1:8bCommunityCompact reasoning
gemini-3-flash-previewGoogleUltra-fast API model

Choosing the Right Model

Start with: gemma3:27bGreat balance of quality and speed. If you need more capability, try deepseek-v3.2 or glm-5.
Start with: qwen3-coder:480b for best quality, or devstral-2:123b for a balance of speed and quality.For small, fast coding tasks: devstral-small-2:24b or gemma3:12b.
Use: kimi-k2:1t or kimi-k2-thinking for complex multi-step reasoning.Alternatively, cogito-2.1:671b for scientific domains.
Use: gemma3:4b, ministral-3:3b, or gemini-3-flash-preview.These models prioritize speed and cost over raw capability.
Use: qwen3-vl:235b-instruct or minimax-m2.5.