Voice Design

Voice Design

curl --request POST \
  --url https://api.minimax.io/v1/voice_design \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: <content-type>' \
  --data @- <<EOF
{
  "prompt": "Excited and enthusiastic male product reviewer (e.g., tech vlogger), fast-paced, high energy, and persuasive.",
  "preview_text": "What is UP, everyone! Today we're unboxing the brand new Gadget X-Pro, and let me tell you, this thing is absolutely insane! The features are next level, you guys are gonna love this!"
}
EOF

{
  "trial_audio": "hex-encoded audio",
  "voice_id": "ttv-voice-2025060717322425-xxxxxxxx",
  "base_resp": {
    "status_code": 0,
    "status_msg": "success"
  }
}

POST

voice_design

Voice Design

curl --request POST \
  --url https://api.minimax.io/v1/voice_design \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: <content-type>' \
  --data @- <<EOF
{
  "prompt": "Excited and enthusiastic male product reviewer (e.g., tech vlogger), fast-paced, high energy, and persuasive.",
  "preview_text": "What is UP, everyone! Today we're unboxing the brand new Gadget X-Pro, and let me tell you, this thing is absolutely insane! The features are next level, you guys are gonna love this!"
}
EOF

{
  "trial_audio": "hex-encoded audio",
  "voice_id": "ttv-voice-2025060717322425-xxxxxxxx",
  "base_resp": {
    "status_code": 0,
    "status_msg": "success"
  }
}

Authorizations

Authorization

string

header

required

HTTP: Bearer Auth

Security Scheme Type: http
HTTP Authorization Scheme: Bearer API_key, can be found in Account Management>API Keys.

Headers

Content-Type

enum<string>

default:application/json

required

The media type of the request body. Must be set to application/json to ensure the data is sent in JSON format.

Available options:

application/json

Body

application/json

Voice design request parameters

prompt

string

required

Voice description.

preview_text

string

required

The text used for generating a preview audio sample.

Note: Generating preview audio incurs a fee of $30 per 1M characters.

Maximum string length: 500

voice_id

string

Custom voice ID for the generated voice. If not provided, a unique voice_id will be automatically created and returned.

Response

200 - application/json

voice_id

string

The generated voice ID, which can be used for speech synthesis.

trial_audio

string

The generated preview audio in hex-encoded format.

base_resp

object

Status code and details.

Show child attributes

Voice Clone Get Voice

Using the API

Text

Speech

Video

Image

Music

File

Authorizations

Headers

Body

Response