Skip to main content

Version 3.3.0

· One min read
  • Added Speech Translation technology preview in GUI.
  • Added Speaker Diarization technology in (high-level description) REST API (still only preview in GUI).
  • Added option to speed up Enhanced Speech to Text Built on Whisper via beamSize parameter in Virtual Appliance configuration file - smaller beamSize means faster processing (up to ~30% with large_v2 model and beamSize=1) at the expense of slightly lower accuracy.
  • Speech to Text Phonexia and Time Analysis of Speech technologies updated to version 3.62.0.
  • Configuration and administration changes:
    • Added support for importing Virtual Appliance to Microsoft Hyper-V.
    • Both system and data disk now automatically resize according to size set in virtualization software.
    • Customers can now use cloud-init with Virtual Appliance.
    • Added diagnostic script for collecting logs for troubleshooting.
Included Components
  • Enhanced Speech to Text Built on Whisper 1.5.0
  • Language Identification 1.3.1
  • Speaker Diarization 1.3.0
  • Speech to Text Phonexia 6th Generation 3.62.0
  • Time Analysis of Speech 3.62.0
  • Voiceprint Comparison 1.1.0
  • Voiceprint Extraction 1.4.0