Skip to main content
POST
/
v1
/
voice_design
Voice Design
curl --request POST \
  --url https://api.minimax.io/v1/voice_design \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: <content-type>' \
  --data '{
  "prompt": "Excited and enthusiastic male product reviewer (e.g., tech vlogger), fast-paced, high energy, and persuasive.",
  "preview_text": "What is UP, everyone! Today we'\''re unboxing the brand new Gadget X-Pro, and let me tell you, this thing is absolutely insane! The features are next level, you guys are gonna love this!"
}'
{
  "trial_audio": "hex-encoded audio",
  "voice_id": "ttv-voice-2025060717322425-xxxxxxxx",
  "base_resp": {
    "status_code": 0,
    "status_msg": "success"
  }
}

Authorizations

Authorization
string
header
required

HTTP: Bearer Auth

Headers

Content-Type
enum<string>
default:application/json
required

The media type of the request body. Must be set to application/json to ensure the data is sent in JSON format.

Available options:
application/json

Body

application/json

Voice design request parameters

prompt
string
required

Voice description.

preview_text
string
required

The text used for generating a preview audio sample.

Note: Generating preview audio incurs a fee of $30 per 1M characters.

voice_id
string

Custom voice ID for the generated voice. If not provided, a unique voice_id will be automatically created and returned.

Response

200 - application/json
voice_id
string

The generated voice ID, which can be used for speech synthesis.

trial_audio
string

The generated preview audio in hex-encoded format.

base_resp
object

Status code and details.