Skip to main content
This API provides synchronous text-to-speech (T2A) generation, supporting up to 10,000 characters per request.
The interface is stateless: each call only processes the provided input without involving business logic, and the model does not store any user data.
Key Features
  1. Access to 300+ system voices and custom cloned voices.
  2. Adjustable volume, pitch, speed, and output formats.
  3. Support for proportional audio mixing.
  4. Configurable fixed time intervals.
  5. Multiple audio formats and specifications supported: mp3, pcm, flac, wav (wav is supported only in non-streaming mode).
  6. Support for streaming output.
Typical Use Cases: short text generation, voice chat, online social interactions.

Supported Models

ModelDescription
speech-2.5-hd-previewLatest HD model with outstanding prosody and excellent cloning similarity.
speech-2.5-turbo-previewLatest Turbo model with support for 40 languages.
speech-02-hdSuperior rhythm and stability, with outstanding performance in replication similarity and sound quality.
speech-02-turboSuperior rhythm and stability, with enhanced multilingual capabilities and excellent performance.
speech-01-hdRich Voices, Expressive Emotions, Authentic Languages.
speech-01-turboExcellent performance and low latency.

Available Interfaces

Synchronous speech synthesis provides two interfaces. Choose based on your needs:

Supported Languages

MiniMax speech synthesis models offer robust multilingual capability, supporting 40 widely used languages worldwide.
Our goal is to break down language barriers and build a truly global AI model.
Support Languages                                                     
1. Chinese15. Turkish28. Malay
2. Cantonese16. Dutch29. Persian
3. English17. Ukrainian30. Slovak
4. Spanish18. Thai31. Swedish
5. French19. Polish32. Croatian
6. Russian20. Romanian33. Filipino
7. German21. Greek34. Hungarian
8. Portuguese22. Czech35. Norwegian
9. Arabic23. Finnish36. Slovenian
10. Italian24. Hindi37. Catalan
11. Japanese25. Bulgarian38. Nynorsk
12. Korean26. Danish39. Tamil
13. Indonesian27. Hebrew40. Afrikaans
14. Vietnamese

Official MCP

MiniMax provides official Model Context Protocol (MCP) server implementations with speech synthesis support: For detailed usage instructions, see the MiniMax MCP User Guide.