GLM Text to Speech
Audio
GLM Text to Speech
POST
GLM Text to Speech
Use GLM-TTS to convert text into natural speech, with support for multiple voices, emotion control, and intonation adjustment.
Request Headers
Enum:
application/jsonBearer authentication format: Bearer {{API Key}}.
Request Body
Text to be converted to speech.Length limit: 0 - 1024
Speech rate. Default is 1.0. Range: [0.5, 2].Range: [0.5, 2]
Voice timbre used for audio generation. Supports both system voices and cloned voices. System voices include: tongtong (default), chuichui, xiaochen, jam, kazi, douji, luodo.
Volume. Default is 1.0. Range: (0, 10].Range: [0, 10]
Audio output format. Defaults to pcm format.Possible values:
wav, pcmControls whether a watermark is added to AI-generated audio. true: enables explicit and implicit digital watermarks by default, compliant with policy requirements. false: disables all watermarks, only effective for users who have completed the watermark removal process.
Response
On success, the recommended sample rate is 24000. Format:binary