Skip to main content

Version 3.4.0

· One min read
  • Added Audio Quality Estimation technology (high-level description) in REST API.
  • Voice Activity Detection technology (high-level description) is available in both REST API and GUI.
  • Speech Translation technology is now fully working in GUI.
  • Speaker Diarization technology (high-level description) is now fully working in GUI.
  • Updated Speech to Text Phonexia with the ability to use Preferred phrases.
  • Updated the Speaker Identification model to xl-5.1.0, which is capable of carrying out automatic adaptation to various input audio sources (YouTube, Skype, WhatsApp, VoLTE, AMBE).
Included Components
  • Audio Quality Estimation 3.62.0
  • Enhanced Speech to Text Built on Whisper 1.7.0
  • Language Identification 1.5.0
  • Speaker Diarization 1.4.1
  • Speech to Text Phonexia 6th Generation 3.62.0
  • Time Analysis of Speech 3.62.0
  • Voice Activity Detection 1.0.1
  • Voiceprint Comparison 1.3.0
  • Voiceprint Extraction 1.5.2