Skip to main content

Music 2.5: Full-Dimensional Breakthrough

Music 2.5 achieves a full-dimensional breakthrough with “High Fidelity + Strong Control”, bringing significant improvements across four key dimensions: Instrumentation & Mixing, Vocal Performance, Structural Precision, and Sound Design.
Expanded high-sample-rate sound library (including orchestral and traditional instruments); optimized soundstage algorithms for more rational spectral distribution, allowing vocals and accompaniment to achieve complete spectral characteristics independently for a more transparent listening experience.
Deep optimization targeting AI synthesis artifacts, introducing humanized timbre simulation with significantly enhanced Flow expressiveness, achieving physically authentic “real voice” quality.
  • Full Section Tag Control: Precise support for 14+ music structure variants including Intro / Bridge / Interlude / Build-up / Hook, meeting the creative logic of complex compositions
  • Dynamic Evolution Control: Vocals can be fine-tuned for emotion and singing techniques section by section; instruments now feature precise control over orchestration, articulation, and sound texture—every sonic detail at your fingertips
Stylized filters for music—delivering more genre-specific mixing characteristics based on different music styles. The system can automatically identify and reproduce the physical characteristics of specific genres, such as:
  • Rock’s saturated distortion
  • The “Minneapolis Sound” of the 80s
  • Modern electronic’s wide-frequency transients
  • Classic jazz’s warm low-pass feel

Example: Text-to-Music Creation

import requests
import json
import os

url = "https://api.minimax.io/v1/music_generation"
api_key = os.environ["MINIMAX_API_KEY"]

headers = {
    "Content-Type": "application/json",
    "Authorization": f"Bearer {api_key}"
}

payload = {
    "model": "music-2.5",
    "prompt": "This is a contemporary R&B/Pop track with distinct Trap influences, radiating a confident, assertive, and empowered energy. It features a bright, clear, and agile female vocal with a polished and heavily processed modern sound. The singer's rhythmic and confident delivery is defined by the heavy and stylistic use of Auto-Tune, creating its signature character. Extensive backing vocals, including layered harmonies and ad-libs built upon stacked unison vocals, produce a rich and full texture, enhanced by moderate reverb for a spacious feel. Set at a tempo of 80 BPM, the arrangement is driven by a dominant 808 bassline and electronic drums with intricate hi-hat patterns and sharp claps, while atmospheric synth pads and subtle sound effects craft a dynamic backdrop. This track is perfect for clubbing, parties, driving with the windows down, or a workout session, making it an essential addition to any confidence-boosting playlist.",
    "lyrics": "[chorus]\nSummit, i reached the summit\nI'm the peak with the fire, they all want from it\nSpill a bit of my glow, like a comet\nI ain't worried 'bout hills, you just plummet\nSummit, i reached the summit\nObsidian shards 'round my throat, now they run from it\nAin't no wonder why the valleys all run from it\nI'm awake, from the summit\n[verse]\nI know what i hold\nAnd i'm about to erupt, yeah\nA story untold, yeah\nI know you won't interrupt it\nKeep your eyes on the rise, no surprise that i'm bright\nGot one stream for the sea, other stream for the night\nI be flowin', you're erodin'\nSwear you're slowin', i'm explodin'\nPressure's growin', growin', growin'\n[interlude]\nSummit, i reached the summit\nI'm the peak with the fire, they all want from it\nSpill a bit of my glow, like a comet\nI ain't worried 'bout stone\n[verse]\nI ain't worried 'bout nada\nUnless it's new earth, unless it's magma\nUnless it's deep core, a new nirvana\nUnless it's shaping a new savanna\nI wanna feel like i'm mother gaia\nI wanna feel like i'm way up\nRumbling, grumbling 'til the world pay up\nMade another island, no layups\nStay hot every single day i wake up\n[chorus]\nSummit, i reached the summit\nI'm the peak with the fire, they all want from it\nSpill a bit of my glow, like a comet\nI ain't worried 'bout hills, you just plummet\nSummit, i reached the summit\nObsidian shards 'round my throat, now they run from it\nAin't no wonder why the valleys all run from it\nI'm awake, from the summit\n[outro]\nSummit\nRooo-ar",
    "audio_setting": {
        "sample_rate": 44100,
        "bitrate": 256000,
        "format": "mp3"
    },
    "output_format": "url"
}

response = requests.post(url, headers=headers, json=payload)
result = response.json()

print(json.dumps(result, ensure_ascii=False, indent=2))

The generated audio