Version 3.4.0

November 21, 2024 · One min read

Added Audio Quality Estimation technology (high-level description) in REST API.
Voice Activity Detection technology (high-level description) is available in both REST API and GUI.
Speech Translation technology is now fully working in GUI.
Speaker Diarization technology (high-level description) is now fully working in GUI.
Updated Speech to Text Phonexia with the ability to use Preferred phrases.
Updated the Speaker Identification model to xl-5.1.0, which is capable of carrying out automatic adaptation to various input audio sources (YouTube, Skype, WhatsApp, VoLTE, AMBE).

Included Components