Transcribes audio files to text using the GLM-ASR-2512 model, with multi-language transcription support.
Bearer authentication format: Bearer {{API Key}}.
Request Body
URL or Base64 encoded string of the audio file to transcribe. Supported audio formats: .wav / .mp3. Specifications: file size ≤ 25 MB, audio duration ≤ 30 seconds
For long text scenarios, you can provide previous transcription results as context. Recommended to be under 8000 characters.
Hotword list to improve recognition accuracy for domain-specific vocabulary. Format example: [“person name”, “place name”]. Recommended not to exceed 100 entries.Array length: 0 - 100
Response
The complete transcribed content of the audio