🎉 MiniMax-M2.7: Peak Performance. Ultimate Value. Master the Complex. ➔ Try Now.





curl --request POST \
--url https://api.minimax.io/v1/voice_clone \
--header 'Authorization: Bearer <token>' \
--header 'Content-Type: <content-type>' \
--data '
{
"file_id": 123456789,
"voice_id": "<voice_id>",
"clone_prompt": {
"prompt_audio": 987654321,
"prompt_text": "This voice sounds natural and pleasant."
},
"text": "A gentle breeze sweeps across the soft grass(breath), carrying the fresh scent along with the songs of birds.",
"model": "speech-2.8-hd",
"need_noise_reduction": false,
"need_volume_normalization": false
}
'{
"input_sensitive": false,
"input_sensitive_type": 0,
"demo_audio": "",
"base_resp": {
"status_code": 0,
"status_msg": "success"
}
}Use this API for rapid voice cloning. If a cloned voice is not used within 7 days, the system will delete it.
curl --request POST \
--url https://api.minimax.io/v1/voice_clone \
--header 'Authorization: Bearer <token>' \
--header 'Content-Type: <content-type>' \
--data '
{
"file_id": 123456789,
"voice_id": "<voice_id>",
"clone_prompt": {
"prompt_audio": 987654321,
"prompt_text": "This voice sounds natural and pleasant."
},
"text": "A gentle breeze sweeps across the soft grass(breath), carrying the fresh scent along with the songs of birds.",
"model": "speech-2.8-hd",
"need_noise_reduction": false,
"need_volume_normalization": false
}
'{
"input_sensitive": false,
"input_sensitive_type": 0,
"demo_audio": "",
"base_resp": {
"status_code": 0,
"status_msg": "success"
}
}Documentation Index
Fetch the complete documentation index at: https://platform.minimax.io/docs/llms.txt
Use this file to discover all available pages before exploring further.
HTTP: Bearer Auth
Bearer API_key, can be found in Account Management>API Keys.The media type of the request body. Must be set to application/json to ensure the data is sent in JSON format.
application/json Voice clone request parameters
The file_id of the audio to be cloned, obtained through the File Upload API.
Uploaded files must comply with the following rules:
The voice_id of the cloned voice. Example: "MiniMax001". When defining a custom voice_id, note the following rules:
-, and _- or _voice_id, otherwise an error will occurVoice cloning parameters. Providing this field helps improve the similarity and stability of synthesized voice. If used, you must also upload a short sample audio clip (less than 8s, supported formats: mp3, m4a, wav) along with its corresponding transcript.
Show child attributes
Optional preview text, up to 1000 characters. The cloned voice will be used to read the text, and an audio preview link will be returned. Note: Preview requests are charged based on character count, consistent with T2A pricing.
speech-2.8-hd or speech-2.8-turbo models. Supported interjections: (laughs), (chuckle), (coughs), (clear-throat), (groans), (breath), (pant), (inhale), (exhale), (gasps), (sniffs), (sighs), (snorts), (burps), (lip-smacking), (humming), (hissing), (emm), (whistles), (sneezes), (crying), (applause).Specifies which voice synthesis model to use for generating the preview audio. Required when the text field is provided.
speech-2.8-hd, speech-2.8-turbo, speech-2.6-hd, speech-2.6-turbo, speech-02-hd, speech-02-turbo, speech-01-hd, speech-01-turbo Controls whether recognition for specific minority languages and dialects is enhanced. Default is null. If the language type is unknown, set to "auto" and the model will automatically detect it.
Chinese, Chinese,Yue, English, Arabic, Russian, Spanish, French, Portuguese, German, Turkish, Dutch, Ukrainian, Vietnamese, Indonesian, Japanese, Italian, Korean, Thai, Polish, Romanian, Greek, Czech, Finnish, Hindi, Bulgarian, Danish, Hebrew, Malay, Persian, Slovak, Swedish, Croatian, Filipino, Hungarian, Norwegian, Slovenian, Catalan, Nynorsk, Tamil, Afrikaans, auto Indicates whether to enable noise reduction.
Indicates whether to enable volume normalization.