Version 3.4.0
· One min read
- Added Audio Quality Estimation technology (high-level description) in REST API.
- Voice Activity Detection technology (high-level description) is available in both REST API and GUI.
- Speech Translation technology is now fully working in GUI.
- Speaker Diarization technology (high-level description) is now fully working in GUI.
- Updated Speech to Text Phonexia with the ability to use Preferred phrases.
- Updated the Speaker Identification model to
xl-5.1.0
, which is capable of carrying out automatic adaptation to various input audio sources (YouTube, Skype, WhatsApp, VoLTE, AMBE).
Included Components
- Audio Quality Estimation 3.62.0
- Enhanced Speech to Text Built on Whisper 1.7.0
- Language Identification 1.5.0
- Speaker Diarization 1.4.1
- Speech to Text Phonexia 6th Generation 3.62.0
- Time Analysis of Speech 3.62.0
- Voice Activity Detection 1.0.1
- Voiceprint Comparison 1.3.0
- Voiceprint Extraction 1.5.2