Skip to main content
The generated voices (voice_id) can then be used in the T2A APIand the T2A Async API for speech generation.

Supported Models

It is recommended to use speech-02-hd for the best results.

Supported Models

ModelDescription
speech-2.5-hd-previewLatest HD model with outstanding prosody and excellent cloning similarity.
speech-2.5-turbo-previewLatest Turbo model with support for 40 languages.
speech-02-hdSuperior rhythm and stability, with outstanding performance in replication similarity and sound quality.
speech-02-turboSuperior rhythm and stability, with enhanced multilingual capabilities and excellent performance.
speech-01-hdRich Voices, Expressive Emotions, Authentic Languages.
speech-01-turboExcellent performance and low latency.

Notes

  • Using this API to generate a voice does not immediately incur a fee. The generation fee will be charged upon the first use of the generated voice in speech synthesis (excluding preview actions within this API).
  • Voices generated through this API are temporary. If you wish to keep a voice permanently, you must use it in any speech synthesis API within 168 hours (7 days). Voices that are not used within this period will be automatically deleted.