Speech to Text: start task

POST /api/technology/speech-to-text-whisper-enhanced

Start an Enhanced Speech to Text Built on Whisper task for a media file.

Enhanced Speech to Text Built on Whisper features

Multi-channel audio files are supported.
Channel id is included in individual transcription segments.
Language for transcription can be specified as a query parameter.
Language switching can be activated via a query parameter. It is a feature of the Enhanced Speech to Text built on Whisper technology, which identifies the predominant language spoken within each thirty-second interval of audio. The identified language is then utilized for transcribing that particular section.
To use a specific language as the source, it must be licensed. Otherwise, an error is raised.
If you use auto-detect or language switching, only licensed languages are considered as the source for the translation. In case the actually detected language is not licensed, the closest licensed language is used instead.

Request

Responses

Speech to Text task was accepted. Follow the Location header to poll for the task state.

Response Headers

X-Location

⚠️ Deprecated - use Location header instead.

Example: /api/technology/speech-to-text-whisper-enhanced/123e4567-e89b-12d3-a456-426614174000

Location

A URL the client should poll for task state and result.

Example: /api/technology/speech-to-text-whisper-enhanced/123e4567-e89b-12d3-a456-426614174000

Request rate limit exceeded.

The request may be retried after a while. The following response headers may be checked for details: retry-after, x-ratelimit-limit, x-ratelimit-remaining, x-ratelimit-reset.

Response Headers

retry-after

Header indicates how long the user agent should wait before making a follow-up request.

x-ratelimit-limit

Size of the current rate limiting window.

x-ratelimit-remaining

Remaining number of requests in the current rate limiting window.

x-ratelimit-reset

Time at which the current rate limiting window resets (in UTC epoch).

Speech to Text: start task

/api/technology/speech-to-text-whisper-enhanced

Enhanced Speech to Text Built on Whisper features​

Request​

Responses​

Enhanced Speech to Text Built on Whisper features

Request

Responses