Silence Detection

Analysing audio only makes sense when actual audio signal could be detected in the provided audio. That is why all our models work together with a Volume detector. We use it to detect silence and near-silence and only process snippets which contain sound.

Configuration

olume detection is very much dependent on the audio capture device properties. That is why you can specify sensitivity of the silence detection by adjusting the volume_threshold in the audio processing functions. The sensitivity can vary between 0 (no sound treated as silence) and 1 (all sound treated as silence). We suggest using the value of 0.05 as a starting point (that's also the default value) - that should exclude very quiet audio chunks from the analysis. If too much of the audio is tagged as silence, we suggest using dividing the threshold by 10 until the proper silence threshold is found.

ParameterRangeDefault value
volume_threshold<0, 1>0.05

Audio fragments that were treated as silence by the Volume detector will be marked as "silence" and the results of any other requested models will not be returned.

For some examples go to file processing or stream processing section.