Skip to main content

Version 4.0.0

· 2 min read
  • Added Keyword Spotting technology (high-level description) in REST API and GUI
  • Added Age Estimation technology (high-level description) in REST API and GUI
  • The Authenticity Verification technology has been improved with
    • further improved Deepfake Detection model 2.2.0 with significantly reduced number of false positive detections
    • new Deepfake Detection scores output as Log Likelihood Ratio (LLR)
    • added Audio Manipulation Detection subtechnology (high-level description)
    • added Replay Attach Detection subtechnology (high-level description)
  • Further improvements of Virtual Appliance documentation:
    • more details about HW and SW requirements
    • more detailed deployment instructions for commonly used hypervisors

This release includes internal changes in licensing.
These changes are backwards compatible and should not affect existing users, with one exception - existing users of older Deepfake Detection model need to update to the new model and license to continue using Deepfake Detection in 4.0.0 release.

REST API changes

  • Increased API default file size limit to 500 MB (limit can be changed manually in configuration file, see documentation)
  • Added validation of voiceprint size in the REST API to prevent potential abuse via unusually large payloads masquerading as valid voiceprints
  • Introduced a new Location header; the previous X-Location header is now deprecated
  • Fixed missing X-Location header in some REST API responses (bug introduced in VA 3.7.0)
  • Fixed an issue where the GET /api/task/:task_id endpoint sometimes returned HTTP 404 even though the requested task existed
  • Various API documentation fixes and improvements, e.g. some query parameters minimum values are now documented correctly

GUI (web application) changes

  • Increased GUI default file size limit to 100 MB (Limit can be changed manually in configuration file, see documentation)
  • Fixed score format in Gender Identification exports: score is now exported correctly as percentage, not as decimal number
  • Minor changes in homepage tile order and style
  • Minor visual changes in the file table component (all technologies except Speaker Identification)
Included Components
  • Age Estimation 1.1.0
  • Audio Manipulation Detection 1.0.0
  • Audio Quality Estimation 3.62.0
  • Deepfake Detection 2.2.0
  • Emotion Recognition 1.2.1
  • Enhanced Speech to Text Built on Whisper 1.10.0
  • Gender Identification 1.4.0
  • Keyword Spotting 1.1.0
  • Language Identification 1.7.0
  • Replay Attack Detection 1.0.0
  • Speaker Diarization 1.6.0
  • Speech to Text Phonexia 6th Generation 3.62.0
  • Time Analysis of Speech 3.62.0
  • Voice Activity Detection 1.2.0
  • Voiceprint Comparison 1.4.0
  • Voiceprint Extraction 1.6.0