Whisper V3 API

POST

model

infer

whisper

curl --request POST \
  --url https://http.whisper.proxy.prod.s9t.link/model/infer/whisper \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "audio_data": "base64_encoded_audio_content",
  "language": "en",
  "task": "transcribe",
  "word_timestamps": true,
  "diarization": false,
  "streaming": false,
  "batch_size": 24,
  "length_penalty": 1,
  "patience": 1,
  "vad_onset": 0.5,
  "vad_offset": 0.363
}'

{
  "transcription": [
    "Hello, this is a test.",
    "The audio quality is good."
  ],
  "segments": [
    {
      "start": 0,
      "end": 2.5,
      "text": "Hello, this is a test.",
      "words": [
        {
          "word": "Hello",
          "start": 0,
          "end": 0.5
        }
      ]
    }
  ],
  "request_time": 2.5,
  "language": "en"
}

Authorizations

Authorization

string

header

required

JWT token for authentication

Body

application/json

Response

200

application/json

Successful transcription

The response is of type object.

Whisper V2 API Retrieve List of LLM/VLM Jobs By Org ID

curl --request POST \
  --url https://http.whisper.proxy.prod.s9t.link/model/infer/whisper \
  --header 'Authorization: Bearer <token>' \
  --header 'Content-Type: application/json' \
  --data '{
  "audio_data": "base64_encoded_audio_content",
  "language": "en",
  "task": "transcribe",
  "word_timestamps": true,
  "diarization": false,
  "streaming": false,
  "batch_size": 24,
  "length_penalty": 1,
  "patience": 1,
  "vad_onset": 0.5,
  "vad_offset": 0.363
}'

{
  "transcription": [
    "Hello, this is a test.",
    "The audio quality is good."
  ],
  "segments": [
    {
      "start": 0,
      "end": 2.5,
      "text": "Hello, this is a test.",
      "words": [
        {
          "word": "Hello",
          "start": 0,
          "end": 0.5
        }
      ]
    }
  ],
  "request_time": 2.5,
  "language": "en"
}

API Documentation

Inference APIs

Training APIs

Authorizations

Body

Response