Version 3.3.0
· One min read
- Added Speech Translation technology preview in GUI.
- Added Speaker Diarization technology in (high-level description) REST API (still only preview in GUI).
- Added option to speed up Enhanced Speech to Text Built on Whisper via
beamSize
parameter in Virtual Appliance configuration file - smallerbeamSize
means faster processing (up to ~30% with large_v2 model andbeamSize=1
) at the expense of slightly lower accuracy. - Speech to Text Phonexia and Time Analysis of Speech technologies updated to version 3.62.0.
- Configuration and administration changes:
- Added support for importing Virtual Appliance to Microsoft Hyper-V.
- Both system and data disk now automatically resize according to size set in virtualization software.
- Customers can now use cloud-init with Virtual Appliance.
- Added diagnostic script for collecting logs for troubleshooting.
Included Components
- Enhanced Speech to Text Built on Whisper 1.5.0
- Language Identification 1.3.1
- Speaker Diarization 1.3.0
- Speech to Text Phonexia 6th Generation 3.62.0
- Time Analysis of Speech 3.62.0
- Voiceprint Comparison 1.1.0
- Voiceprint Extraction 1.4.0