Use GLM-TTS to convert text into natural speech, with support for multiple voices, emotion control, and intonation adjustment.
Bearer authentication format: Bearer {{API Key}}.
Request Body
Text to be converted to speech.Length limit: 0 - 1024
Speech rate. Default is 1.0. Range: [0.5, 2].Range: [0.5, 2]
voice
string
default:"tongtong"
required
Voice timbre used for audio generation. Supports both system voices and cloned voices. System voices include: tongtong (default), chuichui, xiaochen, jam, kazi, douji, luodo.
Volume. Default is 1.0. Range: (0, 10].Range: [0, 10]
Audio output format. Defaults to pcm format.Possible values: wav, pcm
Controls whether a watermark is added to AI-generated audio. true: enables explicit and implicit digital watermarks by default, compliant with policy requirements. false: disables all watermarks, only effective for users who have completed the watermark removal process.
Response
On success, the recommended sample rate is 24000.
Format: binary