Models - MiniMax API Docs

Language

Models	Description	Features
MiniMax-M3	Frontier multimodal coding model with 1M context window	• Multimodal • 1M context window • Frontier coding
MiniMax-M2.7	Beginning the journey of recursive self-improvement	• Top real-world engineering • Professional office delivery • Character-rich interaction
MiniMax-M2.7-highspeed	Same performance as M2.7 • Significantly faster inference	• Polyglot code mastery • Precision code refactoring • Low latency

Legacy Models

Models	Description	Features
MiniMax-M2.5	• Optimized for code generation and refactoring	• Peak Performance. Ultimate Value. Master the Complex.
MiniMax-M2.5-highspeed	• Same performance as M2.5 • Significantly faster inference	• Polyglot code mastery • Precision code refactoring • Low latency
MiniMax-M2.1	• 230B total parameters with 10B activated per inference • Optimized for code generation and refactoring	• Polyglot code mastery • Precision code refactoring • Enhanced reasoning
MiniMax-M2.1-highspeed	• Same performance as M2.1 • Significantly faster inference	• Polyglot code mastery • Precision code refactoring • Low latency
MiniMax-M2	• Context Length: 200k tokens • Maximum Output: 128k tokens (including CoT)	• Agentic capabilities • Function calling • Advanced reasoning • Real-time streaming

Video

Models	Description	Res.& Dur.	FPS
MiniMax Hailuo 2.3	• Text to Video & Image to Video • SOTA instruction following • Extreme physics mastery	• 1080p 6s • 768p 6s, 10s	24 fps
MiniMax Hailuo 2.3Fast	• Image to Video • Extreme physics mastery • Value and Efficiency	• 1080p 6s • 768p 6s, 10s	24 fps

Legacy Models

Models	Description	Res.& Dur.	FPS
MiniMax Hailuo 02	• Text to Video & Image to Video • SOTA instruction following • Extreme physics mastery	• 1080p 6s • 768p 6s, 10s • 512p 6s, 10s	24 fps

Audio

Models	Description	Features
speech-2.8-hd	• Ultra-realistic quality featuring sound tags	• 40 languages supported • 7 emotions supported • specified languages and dialects supported
speech-2.8-turbo	• Seamless speed meets natural flow	• 40 languages supported • 7 emotions supported • specified languages and dialects supported

Legacy Models

Models	Description	Features
speech-2.6-hd	• Ultimate Similarity • Ultra-High Quality	• 40 languages supported • 7 emotions supported • specified languages and dialects supported
speech-2.6-turbo	• Ultimate Value • Low latency	• 40 languages supported • 7 emotions supported • specified languages and dialects supported
speech-02-hd	• Stronger replication similarity • High quality voice generation	• 24 languages supported • 7 emotions supported • specified languages and dialects supported
speech-02-turbo	• Superior rhythm and stability • Low latency	• 24 languages supported • 7 emotions supported • specified languages and dialects supported

Music

Models	Description	Features
music-3.0	• New Music Generation Capabilities	• Intent Understood • Sound Elevated • Vocals Humanized
music-2.6	• Cover Reborn. Bass Redefined.	• Cover Reborn. Bass Redefined.
music-cover	• Generate cover versions from reference audio	• One-step cover generation • Two-step cover with lyrics modification • Style transfer • Auto lyrics extraction

Legacy Models

Models	Description	Features
music-2.0	• Text to Music • Enhanced musicality • Natural vocals and smooth melodies	• Human-like performance • Riche emotional expression • Enhanced tone control