Version 4.0.0

June 30, 2025 · 3 min read

Added Keyword Spotting technology (high-level description) in REST API and GUI
Added Age Estimation technology (high-level description) in REST API and GUI
The Authenticity Verification technology has been improved with
- further improved Deepfake Detection model 2.2.0 with significantly reduced number of false positive detections
- new Deepfake Detection scores output as Log Likelihood Ratio (LLR)
- added Audio Manipulation Detection subtechnology (high-level description)
- added Replay Attach Detection subtechnology (high-level description)
Further improvements of Virtual Appliance documentation:
- more details about HW and SW requirements
- more detailed deployment instructions for commonly used hypervisors

This release includes internal changes in licensing.
These changes are backwards compatible and should not affect existing users, with one exception - existing users of older Deepfake Detection model need to update to the new model and license to continue using Deepfake Detection in 4.0.0 release.

REST API changes

Increased API default file size limit to 500 MB (limit can be changed manually in configuration file, see documentation)
Added validation of voiceprint size in the REST API to prevent potential abuse via unusually large payloads masquerading as valid voiceprints
Introduced a new Location header; the previous X-Location header is now deprecated
Fixed missing X-Location header in some REST API responses (bug introduced in VA 3.7.0)
Fixed an issue where the GET /api/task/:task_id endpoint sometimes returned HTTP 404 even though the requested task existed
Various API documentation fixes and improvements, e.g. some query parameters minimum values are now documented correctly

GUI (web application) changes

Increased GUI default file size limit to 100 MB (Limit can be changed manually in configuration file, see documentation)
Fixed score format in Gender Identification exports: score is now exported correctly as percentage, not as decimal number
Minor changes in homepage tile order and style
Minor visual changes in the file table component (all technologies except Speaker Identification)

Included Components

Age Estimation 1.1.0
Audio Manipulation Detection 1.0.0
Audio Quality Estimation 3.62.0
Deepfake Detection 2.2.0
Emotion Recognition 1.2.1
Enhanced Speech to Text Built on Whisper 1.10.0
Gender Identification 1.4.0
Keyword Spotting 1.1.0
Language Identification 1.7.0
Replay Attack Detection 1.0.0
Speaker Diarization 1.6.0
Speech to Text Phonexia 6th Generation 3.62.0
Time Analysis of Speech 3.62.0
Voice Activity Detection 1.2.0
Voiceprint Comparison 1.4.0
Voiceprint Extraction 1.6.0