For best results, it is recommended to upload reference audio using the Voice Cloning API before using this API. This will improve voice quality and reduce latency.
Fish Audio converts text to speech.Supported audio formats:
Controls the randomness of speech generation. Higher values (e.g., 1.0) make the output more random, lower values (e.g., 0.1) make it more deterministic. We recommend 0.9 for the s1 model.Required range: 0 <= x <= 1
Controls diversity through nucleus sampling. Lower values (e.g., 0.1) make the output more focused, higher values (e.g., 1.0) allow more diversity. We recommend 0.9 for the s1 model.Required range: 0 <= x <= 1