Models - MiniMax API Docs

Models Overview
Text
Audio
Video
Music
Recommended Reading

2a45714f-1895-4058-90de-e78320f3b85a

MiniMax M2.1

Polyglot Programming Mastery

71df4ee8-f064-441e-85aa-bd6cab09111b

MiniMax Hailuo 2.3

Breathtaking Motion, Lifelike Emotion

20e5abab-7c0d-4ae5-92e5-a80eed96492e

MiniMax Speech 2.6

Real-Time Response, Intelligent Parsing

Models Overview

Text

Models	Description	Features
MiniMax-M2.1	• 230B total parameters with 10B activated per inference • Optimized for code generation and refactoring	• Polyglot code mastery • Precision code refactoring • Enhanced reasoning
MiniMax-M2.1-lightning	• Same performance as M2.1 • Significantly faster inference	• Polyglot code mastery • Precision code refactoring • Low latency
MiniMax-M2	• Context Length: 200k tokens • Maximum Output: 128k tokens (including CoT)	• Agentic capabilities • Function calling • Advanced reasoning • Real-time streaming

Audio

Models	Description	Features
speech-2.6-hd	• Ultimate Similarity • Ultra-High Quality	• 40 languages supported • 7 emotions supported • specified languages and dialects supported
speech-2.6-turbo	• Ultimate Value • Low latency	• 40 languages supported • 7 emotions supported • specified languages and dialects supported
speech-02-hd	• Stronger replication similarity • High quality voice generation	• 24 languages supported • 7 emotions supported • specified languages and dialects supported
speech-02-turbo	• Superior rhythm and stability • Low latency	• 24 languages supported • 7 emotions supported • specified languages and dialects supported

Video

Models	Description	Res.& Dur.	FPS
MiniMax Hailuo 2.3	• Text to Video & Image to Video • SOTA instruction following • Extreme physics mastery	• 1080p 6s • 768p 6s, 10s	24 fps
MiniMax Hailuo 2.3Fast	• Image to Video • Extreme physics mastery • Value and Efficiency	• 1080p 6s • 768p 6s, 10s	24 fps
MiniMax Hailuo 02	• Text to Video & Image to Video • SOTA instruction following • Extreme physics mastery	• 1080p 6s • 768p 6s, 10s • 512p 6s, 10s	24 fps

Music

Models	Description	Features
Music-2.5	• Text to Music • Human-like Emotional Vocals • Enhanced Multi-Instrument Performance	• Professional studio quality • Cohesive musical structure • Precision style control • Realistic, expressive vocals
Music-2.0	• Text to Music • Enhanced musicality • Natural vocals and smooth melodies	• Human-like performance • Riche emotional expression • Enhanced tone control

Recommended Reading

Quick start

Refer to the Quick Start Guide to explore and experience the model’s capabilities

Compatible Anthropic API (Recommended)

Use Anthropic SDK with MiniMax models

⌘I