Skip to main content
POST
/
v4beta
/
model
Fish Audio Voice Cloning
curl --request POST \
  --url https://api.myrouter.ai/v4beta/model \
  --header 'Authorization: <authorization>' \
  --header 'Content-Type: <content-type>' \
  --data '
{
  "type": {},
  "title": "<string>",
  "train_mode": {},
  "voices": [
    null
  ],
  "visibility": {},
  "description": {},
  "cover_image": {},
  "texts": [
    "<string>"
  ],
  "tags": [
    "<string>"
  ],
  "enhance_audio_quality": true
}
'
{
  "_id": "<string>",
  "type": {},
  "title": "<string>",
  "description": "<string>",
  "cover_image": "<string>",
  "state": {},
  "tags": [
    "<string>"
  ],
  "created_at": {},
  "updated_at": {},
  "visibility": {},
  "like_count": 123,
  "mark_count": 123,
  "shared_count": 123,
  "task_count": 123,
  "author": {
    "_id": "<string>",
    "nickname": "<string>",
    "avatar": "<string>"
  },
  "train_mode": {},
  "samples": [
    {
      "title": "<string>",
      "text": "<string>",
      "task_id": "<string>",
      "audio": "<string>"
    }
  ],
  "languages": [
    "<string>"
  ],
  "lock_visibility": true,
  "unliked": true,
  "liked": true,
  "marked": true
}
Fish Audio API for creating voice models (voice cloning).

Request Headers

Content-Type
string
required
Enum: application/json
Authorization
string
required
Bearer authentication format: Bearer {{API Key}}.

Request Body

type
enum<string>
required
Model type. tts stands for text-to-speech.Possible values: ttsAllowed values: "tts"
title
string
required
Model title or name.
train_mode
enum<string>
required
Model training mode. For TTS models, fast means the model is available immediately after creation.Possible values: fastAllowed values: "fast"
voices
file[]
required
Upload voice files for model fine-tuning.
visibility
enum<string>
default:"public"
Model visibility. public will display on the discovery page, unlist allows anyone with the link to access, private is visible only to the creator.Possible values: public, unlist, private
description
string | null
Model description.
cover_image
file | null
Model cover image. Required if the model is public.
texts
string[]
Text corresponding to the voices. If not specified, ASR (Automatic Speech Recognition) will be performed on the voices.
tags
string[]
Model tags.
enhance_audio_quality
boolean
default:false
Enhance audio quality.

Response

_id
string
required
Unique identifier of the created model.
type
enum<string>
required
Model type.Possible values: svc, tts
title
string
required
Model title or name.
description
string
required
Model description.
cover_image
string
required
URL of the model cover image.
state
enum<string>
required
Current state of the model.Possible values: created, training, trained, failed
tags
string[]
required
Model tags.
created_at
string<date-time>
required
Timestamp when the model was created.
updated_at
string<date-time>
required
Timestamp when the model was last updated.
visibility
enum<string>
required
Model visibility setting.Possible values: public, unlist, private
like_count
integer
required
Number of likes the model has received.
mark_count
integer
required
Number of bookmarks the model has received.
shared_count
integer
required
Number of times the model has been shared.
task_count
integer
required
Number of tasks associated with the model.
author
AuthorEntity · object
required
Information about the model author.
train_mode
enum<string>
default:"full"
Training mode used by the model.Possible values: fast, full
samples
SampleEntity · object[]
Sample data associated with the model.
languages
string[]
Languages supported by the model.
lock_visibility
boolean
default:false
Whether the visibility setting is locked.
unliked
boolean
default:false
Whether the current user has unliked this model.
liked
boolean
default:false
Whether the current user has liked this model.
marked
boolean
default:false
Whether the current user has bookmarked this model.