Voice Cloning API | Fish Audio

Fish Audio Voice Cloning

curl --request POST \
  --url https://api.myrouter.ai/v4beta/model \
  --header 'Authorization: <authorization>' \
  --header 'Content-Type: <content-type>' \
  --data '
{
  "type": {},
  "title": "<string>",
  "train_mode": {},
  "voices": [
    null
  ],
  "visibility": {},
  "description": {},
  "cover_image": {},
  "texts": [
    "<string>"
  ],
  "tags": [
    "<string>"
  ],
  "enhance_audio_quality": true
}
'

{
  "_id": "<string>",
  "type": {},
  "title": "<string>",
  "description": "<string>",
  "cover_image": "<string>",
  "state": {},
  "tags": [
    "<string>"
  ],
  "created_at": {},
  "updated_at": {},
  "visibility": {},
  "like_count": 123,
  "mark_count": 123,
  "shared_count": 123,
  "task_count": 123,
  "author": {
    "_id": "<string>",
    "nickname": "<string>",
    "avatar": "<string>"
  },
  "train_mode": {},
  "samples": [
    {
      "title": "<string>",
      "text": "<string>",
      "task_id": "<string>",
      "audio": "<string>"
    }
  ],
  "languages": [
    "<string>"
  ],
  "lock_visibility": true,
  "unliked": true,
  "liked": true,
  "marked": true
}

POST

v4beta

model

Fish Audio Voice Cloning

curl --request POST \
  --url https://api.myrouter.ai/v4beta/model \
  --header 'Authorization: <authorization>' \
  --header 'Content-Type: <content-type>' \
  --data '
{
  "type": {},
  "title": "<string>",
  "train_mode": {},
  "voices": [
    null
  ],
  "visibility": {},
  "description": {},
  "cover_image": {},
  "texts": [
    "<string>"
  ],
  "tags": [
    "<string>"
  ],
  "enhance_audio_quality": true
}
'

{
  "_id": "<string>",
  "type": {},
  "title": "<string>",
  "description": "<string>",
  "cover_image": "<string>",
  "state": {},
  "tags": [
    "<string>"
  ],
  "created_at": {},
  "updated_at": {},
  "visibility": {},
  "like_count": 123,
  "mark_count": 123,
  "shared_count": 123,
  "task_count": 123,
  "author": {
    "_id": "<string>",
    "nickname": "<string>",
    "avatar": "<string>"
  },
  "train_mode": {},
  "samples": [
    {
      "title": "<string>",
      "text": "<string>",
      "task_id": "<string>",
      "audio": "<string>"
    }
  ],
  "languages": [
    "<string>"
  ],
  "lock_visibility": true,
  "unliked": true,
  "liked": true,
  "marked": true
}

Fish Audio API for creating voice models (voice cloning).

Request Headers

Content-Type

string

required

Enum: application/json

Authorization

string

required

Bearer authentication format: Bearer {{API Key}}.

Request Body

type

enum<string>

required

Model type. tts stands for text-to-speech.Possible values: ttsAllowed values: "tts"

title

string

required

Model title or name.

train_mode

enum<string>

required

Model training mode. For TTS models, fast means the model is available immediately after creation.Possible values: fastAllowed values: "fast"

voices

file[]

required

Upload voice files for model fine-tuning.

visibility

enum<string>

default:"public"

Model visibility. public will display on the discovery page, unlist allows anyone with the link to access, private is visible only to the creator.Possible values: public, unlist, private

description

string | null

Model description.

cover_image

file | null

Model cover image. Required if the model is public.

texts

string[]

Text corresponding to the voices. If not specified, ASR (Automatic Speech Recognition) will be performed on the voices.

Response

_id

string

required

Unique identifier of the created model.

type

enum<string>

required

Model type.Possible values: svc, tts

title

string

required

Model title or name.

description

string

required

Model description.

cover_image

string

required

URL of the model cover image.

state

enum<string>

required

Current state of the model.Possible values: created, training, trained, failed

​Request Headers

​Request Body

​Response

Request Headers

Request Body

Response