Skip to main content

Recommended OS and HW

Required HW resources depend on set of technologies (i.e. SPE configuration) and the load that should be processed per day (or during a peak hour). Additionally, your own application built on top of SPE (including eventual external dependencies like databases, storage, etc.) would require additional resources. Therefore you should always perform a proper load test using your entire system to determine the actual HW requirements.

Recommendations for typical configurations:

Voice Biometrics, basic 100 hours/day package (***) files processing

  • CPU: 8 physical cores, 1x Intel® Xeon E5-2640 v4 or similar or 10th Gen Intel® Core Processor
  • RAM: 16 GB
  • Storage: 100 GB (depends on audio retention policy)
    SSD strongly recommended for superior performance over HDD
  • Configuration includes: SID4 XL4, GID XL4, LID L4, AGE L4, VAD, SQE

Transcription System, basic 100 hours/day package (***) files processing

  • CPU: 8 physical cores, 1x Intel® Xeon E5-2640 v4 or similar or 10th Gen Intel® Core Processor
  • RAM: 16 GB
  • Storage: 100 GB (depends on your audio retention policy)
    SSD strongly recommended for superior performance over HDD
  • Configuration includes: STT 6th generation – 2 languages (half load each), KWS 6th generation - 2 languages, LID L4, VAD, SQE

Voice Biometrics + Transcription System, basic 100 hours/day package (***) files processing

  • CPU: 14 physical cores, 1x Intel® Xeon Gold 5120 or similar or 10th Gen Intel® Core Processor
  • RAM: 32 GB
  • Storage: 500 GB (depends on your audio retention policy)
    SSD strongly recommended for superior performance over HDD
  • Configuration includes: SID4 XL4, GID XL4, LID L4, AGE L4, STT 6th generation - 2 languages (half load each), KWS 6th generation - 2 languages, VAD, SQE

(***) The amount of hours/day refers to the Phonexia pricing package, it does NOT mean maximum throughput of such configuration. In other words, this is a recommended configuration, not a minimal configuration.

  • Windows 64-bit – Windows Server 2019(*), latest version of Windows 10 (*)
  • Linux 64-bit – latest version of RHEL/CentOS 7 (*) since the version 3.62: the latest version of RHEL/Rocky is Linux/Alma Linux 8

Compatible Operating Systems

  • 64-bit Windows 8.1, Windows Server 2016, and newer
  • 64-bit Linux with glibc >= 2.17, e.g. Ubuntu 20.04, Mint 19.3, RHEL/CentOS 8.2, .. since version 3.62: 64-bit Linux with glibc >= 2.28, e.g. Ubuntu 20.04, Linux Mint 4-LMDE, RHEL/Rocky Linux/Alma Linux 8

(*) Speech Platform components (e.g. Speech Engine) are tested by Phonexia on these systems.
(**) Speech Platform components (e.g. Speech Engine) are known to be successfully deployed on these systems.

LINUX:

Library 'libasound' is a prerequisite for running Speech Platform 3. Please ensure that it is installed and after it is, ensure that dependencies are correctly met with command 'ldd phxspe' in SPE installation folder.