Version 4.0.0
· 2 min read
- Added Keyword Spotting technology (high-level description) in REST API and GUI
- Added Age Estimation technology (high-level description) in REST API and GUI
- The Authenticity Verification technology has been improved with
- further improved Deepfake Detection model 2.2.0 with significantly reduced number of false positive detections
- new Deepfake Detection scores output as Log Likelihood Ratio (LLR)
- added Audio Manipulation Detection subtechnology (high-level description)
- added Replay Attach Detection subtechnology (high-level description)
- Further improvements of Virtual Appliance documentation:
- more details about HW and SW requirements
- more detailed deployment instructions for commonly used hypervisors
This release includes internal changes in licensing.
These changes are backwards compatible and should not affect existing users,
with one exception - existing users of older Deepfake Detection model need to
update to the new model and license to continue using Deepfake Detection in
4.0.0 release.
REST API changes
- Increased API default file size limit to 500 MB (limit can be changed manually in configuration file, see documentation)
- Added validation of voiceprint size in the REST API to prevent potential abuse via unusually large payloads masquerading as valid voiceprints
- Introduced a new
Location
header; the previousX-Location
header is now deprecated - Fixed missing
X-Location
header in some REST API responses (bug introduced in VA 3.7.0) - Fixed an issue where the
GET /api/task/:task_id
endpoint sometimes returned HTTP 404 even though the requested task existed - Various API documentation fixes and improvements, e.g. some query parameters minimum values are now documented correctly
GUI (web application) changes
- Increased GUI default file size limit to 100 MB (Limit can be changed manually in configuration file, see documentation)
- Fixed score format in Gender Identification exports: score is now exported correctly as percentage, not as decimal number
- Minor changes in homepage tile order and style
- Minor visual changes in the file table component (all technologies except Speaker Identification)
Included Components
- Age Estimation 1.1.0
- Audio Manipulation Detection 1.0.0
- Audio Quality Estimation 3.62.0
- Deepfake Detection 2.2.0
- Emotion Recognition 1.2.1
- Enhanced Speech to Text Built on Whisper 1.10.0
- Gender Identification 1.4.0
- Keyword Spotting 1.1.0
- Language Identification 1.7.0
- Replay Attack Detection 1.0.0
- Speaker Diarization 1.6.0
- Speech to Text Phonexia 6th Generation 3.62.0
- Time Analysis of Speech 3.62.0
- Voice Activity Detection 1.2.0
- Voiceprint Comparison 1.4.0
- Voiceprint Extraction 1.6.0