Voxseg
Voxseg是用于语音活动检测(VAD)的Python软件包,用于语音/非语音音频分段。 它提供了完整的VAD流水线,包括一个预训练的VAD模型,并且基于介绍的工作。
该VAD的使用可引述如下:
@inproceedings{cnnbilstm_vad,
title = {A hybrid {CNN-BiLSTM} voice activity detector},
author = {Wilkinson, N. and Niesler, T.},
booktitle = {Proc. IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP)},
year = {2021},
address = {Toronto, Cana
1