Skip to main content

Speech Translation: start task

POST 

/api/technology/speech-translation-whisper-enhanced

Start an Enhanced Speech Translation Built on Whisper task for a media file.

Enhanced Speech Translation Built on Whisper features

  • Multi-channel audio files are supported.
  • Channel id is included in individual translation segments.
  • Source language for translation of the whole audio can be specified as a query parameter. It is mutually exclusive with the language_switching parameter.
  • Language switching can be activated via a query parameter. With this feature, the technology automatically identifies the predominant language spoken within each thirty-second segment of audio and uses it as the source language for translating that particular segment. It is mutually exclusive with the source_language parameter.
  • To use a specific language as the source, it must be licensed. Otherwise, an error is raised.
  • If you use auto-detect or language switching, only licensed languages are considered as the source for the translation. In case the actually detected language is not licensed, the closest licensed language is used instead.

Request

Responses

Speech translation task was accepted. Follow the X-Location header to poll for the task state.

Response Headers
    X-Location

    A URL the client should poll for task state and result.

    Example: /api/technology/speech-translation-whisper-enhanced/123e4567-e89b-12d3-a456-426614174000