PhonexiaVersion: 3.4.0Phonexia Enhanced Speech to Text Built on Whisper changelog
- [PLAT-743] Upgraded CTranslate2 to v4
- [PLAT-936] Crash when using negative values in time range
- [PLAT-888] Added
PHX_KEEPALIVE_TIME_S
and PHX_KEEPALIVE_TIMEOUT_S
options for connection timeout specification
- [PLAT-801] If set,
start_time
is added up to all segment timestamps
- [PLAT-838] Possibility to set beam size on start
- [PLAT-813] Language tags are case insensitive now
- [PLAT-772] Support for sending RAW audio data
- [PLAT-781] Support for license extensions
- [PLAT-758] Device is logged as a number rather than a string
- [PLAT-644] Support for distilled whisper v3 models
- [PLAT-788] Missing entrypoint in GPU docker image
- [PLAT-756] Inconsistent transcription behavior between autodetect and forced language mode
- [PLAT-766] Misleading warning message in Python client
- [PLAT-765] Python client is not compatible with Python 3.8
- [PLAT-774] Optimized docker images
- [PLAT-751] Crash when second request arrive while processing is running
- [PLAT-748] Some licenses may be refused by the service
- [PLAT-309] Machine translation
- [PLAT-669] Support for custom log tags in metadata and logs
- [VOX-667] Renamed microservice to enhanced-speech-to-text-built-on-whisper
- [PLAT-465] Parameters for selecting the part of the audio to process in
TranscribeRequest
(audio.time_range
)
- [PLAT-581] Parameter for enabling language switching during the transcription in
TranscribeRequest
(config.enable_language_switching
)
- [PLAT-603] Premature processing cancellation if connection is canceled by client
- [PLAT-625] Additional logging when initializing service
- [PLAT-614] Inconsistent license logging
- [PLAT-609] Memory access violations
- [PLAT-521] Support for large-v3 Whisper model
- [VOX-396] Limiting of supported languages for Whisper models
- [PLAT-520] Correlation ID to log messages
- [PLAT-439] Optimization to model loading
- [PLAT-296] Phonexia Voice Activity Detection