Diarization
Diarization technology enables users to distinguish between different speakers present in each channel of a recording, whether it's mono or stereo, by providing precise timestamps indicating when each speaker is active. This feature allows users to isolate and listen to individual speakers or further process specific speakers using other technologies.
Additionally, if the total number of speakers in the recording is unknown, the diarization technology can automatically detect and provide this information.
How it works
Number of speakers
The first step is optional. By clicking on 'Speaker Settings,' you can provide additional information to help the system deliver more accurate results. If you have the necessary details, there are two parameters you can specify:
-
Total number of speakers. If you know the exact number of speakers in the audio, you can input this value to isolate their individual speech segments with corresponding timestamps.
warningUse the Total number of speakers parameter only if you are absolutely certain of the exact number. The diarization technology will strictly adhere to this input, treating it as the definitive count of speakers in the audio. It will not attempt to verify or cross-check this number, so ensure its accuracy before proceeding.
-
Max number of speakers. If you're uncertain about the exact number of speakers, you can specify the maximum number that might be present in the audio.
Uploading files
Upload your files or create your own recordings by using the built-in recording feature. If you don't have your own files, you can use the provided Phonexia examples to explore how diarization works.
Read more about uploading files here.
Results
After uploading, your recordings will appear in the left panel. Once processing is complete, the results for each recording will be displayed in the right panel. In the upper right corner, you'll see the total number of speakers found in the recording (whether it's mono or stereo). Below the main player, you'll find multiple waveforms, each representing an individual speaker.
Further actions
After reviewing your results in the right panel, you can perform several actions:
- Edit the name of each speaker identified in the recording.
- Mute individual speakers or entire channels to play only the selected speakers or channels.
- Download the recording, including only the chosen speakers.