Skip to main content

Version 3.0.0

· One min read
  • The Speech to Text Whisper Enhanced technology has been renamed to Enhanced Speech to Text Built on Whisper and a new Language switching feature was added. This feature identifies the predominant language spoken within each thirty-second interval of audio and the identified language is then utilized for transcribing that particular section.

  • For Speech to Text Phonexia and Time Analysis of Speech technologies it's now possible to configure the number of tasks to be processed in parallel. It is done using the paralelism parameter in the corresponding sections of Virtual Appliance configuration file.

Included Components
  • Enhanced Speech to Text Built on Whisper 1.2.2
  • Speech to Text Phonexia 6th Generation 3.61.0
  • Time Analysis of Speech 3.61.0
  • Voiceprint Comparison 1.0.0
  • Voiceprint Extraction 1.2.0

Version 2.1.0

· One min read
  • Maintenance release with only configuration and administration related changes:
    • Models for Speech to Text Phonexia and Time Analysis of Speech technologies. are now loaded from data disk, not from image.
    • Speech to Text Phonexia and Time Analysis of Speech technologies updated to version 3.61.0.
    • Added extra environment variables for Speech to Text Whisper Enhanced.
    • Added maximum upload file size specification for filebrowser.
    • Moved Prometheus storage to data disk.
Included Components
  • Voiceprint Extraction 1.2.0
  • Voiceprint Comparison 1.0.0
  • Speech to Text Whisper Enhanced 1.1.0
  • Speech to Text Phonexia 6th Generation 3.61.0
  • Time Analysis of Speech 3.61.0

Version 2.0.0

· One min read
  • Added Time Analysis of Speech technology (high-level description), available via REST API only (no GUI).
  • Configuration and administration changes:
    • Added options to change tmpdir volume for speech-platform API and media-conversion.
    • Added options to configure UI limits.
    • Added option to change API log level.
    • Models are now stored on data disk separately for each microservice.
Included Components
  • Speech to Text Phonexia 6th Generation 3.60.1
  • Speech to Text Whisper Enhanced 1.1.0
  • Time Analysis of Speech 3.60.1
  • Voiceprint Comparison 1.0.0
  • Voiceprint Extraction 1.2.0

Version 1.1.0

· One min read
  • Initial release with Speaker Identification ([high-level description(/products/speech-platform-4/technologies/speaker-identification)) and Speech To Text (high-level description) technologies available via REST API and in GUI. The Speech to Text Whisper Enhanced supports auto-detection of the language.
Included Components
  • Speech to Text Phonexia 6th Generation 3.60.1
  • Speech to Text Whisper Enhanced 1.0.1
  • Voiceprint Comparison 1.0.0
  • Voiceprint Extraction 1.0.0