提取均值信号特征的matlab代码-PIT-LSTM-Speech-Separation:用于语音分离的PIT的张量流实现

系统开源

提取均值信号特征的matlab代码两个扬声器的基于 LSTM/BLSTM 的 PIT 在多通话者混合语音分离和识别方面取得的进展，通常被称为“鸡尾酒会问题”，并没有那么令人印象深刻。尽管人类听众可以很容易地感知混合声音中的不同来源，但对于计算机来说，同样的任务似乎极其困难，尤其是当只有一个麦克风记录混合语音时。 1. 运行性能注意：训练集和验证集包含通过从 WSJ0 集中随机选择说话者和话语生成的两个说话者混合，并以 -2.5 dB 和 2.5 dB 之间统一选择的各种信噪比 (SNR) 混合它们. 对于LSTM ，不同性别的混合音频结果如下：对于BLSTM ，不同性别的混合音频结果如下：从上面的结果可以看出，混合性别音频的分离效果优于同性音频，BLSTM 的性能优于 LSTM。 2. 评价标准 SDR：信号失真比 SAR：信号与伪像的比率 SIR：信号干扰比 STOI：短期客观可懂度测量 ESTOI：扩展的短期目标可懂度测量 PESQ：语音质量的感知评估 3. 依赖库 matlab（我的测试版：R2016b 64位） tensorflow（我的测试版本：1.4.0） anac

文件下载

资源详情

[{"title":"（ 63 个子文件 5.37MB ）提取均值信号特征的matlab代码-PIT-LSTM-Speech-Separation:用于语音分离的PIT的张量流实现","children":[{"title":"PIT-LSTM-Speech-Separation-master","children":[{"title":"run_lstm.py 16.23KB ","children":null,"spread":false},{"title":"tfrecords_io.py 5.06KB ","children":null,"spread":false},{"title":"signal_processing.py 8.02KB ","children":null,"spread":false},{"title":"gen_tfrecords.py 4.20KB ","children":null,"spread":false},{"title":"make_wav_list.py 1015B ","children":null,"spread":false},{"title":"wsj0-train-spkrinfo.txt 864B ","children":null,"spread":false},{"title":"6. separated_result_LSTM","children":[{"title":"two_women_2.wav 83.79KB ","children":null,"spread":false},{"title":"two_women_1.wav 83.79KB ","children":null,"spread":false},{"title":"two_men_2.wav 96.79KB ","children":null,"spread":false},{"title":"one_man_one_woman_2.wav 86.79KB ","children":null,"spread":false},{"title":"one_man_one_woman_1.wav 86.79KB ","children":null,"spread":false},{"title":"two_men_1.wav 96.79KB ","children":null,"spread":false}],"spread":true},{"title":"4. introduction_to_mask","children":[{"title":"masks.png 21.02KB ","children":null,"spread":false},{"title":"SA2.wav 95.24KB ","children":null,"spread":false},{"title":"SA1.wav 104.64KB ","children":null,"spread":false},{"title":"recoverd2.png 19.83KB ","children":null,"spread":false},{"title":"Introduction to Ideal Binary Mask.ipynb 185.85KB ","children":null,"spread":false},{"title":"recoverd1.png 18.90KB ","children":null,"spread":false},{"title":"mixed.wav 125.04KB ","children":null,"spread":false},{"title":"MPM14-Time-Frequency-Masking.pdf 1.08MB ","children":null,"spread":false},{"title":"mixturesignals.png 24.97KB ","children":null,"spread":false},{"title":"spectrograms.png 52.45KB ","children":null,"spread":false}],"spread":true},{"title":"utils.py 4.14KB ","children":null,"spread":false},{"title":"README.md 5.99KB ","children":null,"spread":false},{"title":"blstm.py 8.89KB ","children":null,"spread":false},{"title":"run.sh 5.23KB ","children":null,"spread":false},{"title":"2. create-speaker-mixtures-V2","children":[{"title":"mix_2_spk_tt.txt 230.49KB ","children":null,"spread":false},{"title":"mix_3_spk_cv.txt 530.56KB ","children":null,"spread":false},{"title":"mix_2_spk_cv.txt 374.36KB ","children":null,"spread":false},{"title":"mix_3_spk_tt.txt 327.25KB ","children":null,"spread":false},{"title":"create_wav_2speakers.m 8.95KB ","children":null,"spread":false},{"title":"activlev.m 16.29KB ","children":null,"spread":false},{"title":"maxfilt.m 4.70KB ","children":null,"spread":false},{"title":"create_wav_3speakers.m 9.25KB ","children":null,"spread":false},{"title":"mix_2_spk_tr.txt 1.46MB ","children":null,"spread":false},{"title":"readme.txt 781B ","children":null,"spread":false},{"title":"mix_3_spk_tr.txt 2.07MB ","children":null,"spread":false}],"spread":false},{"title":"3. SPHFile2Wav","children":[{"title":"SA1.WAV 110.20KB ","children":null,"spread":false},{"title":"SPH2Wav.py 435B ","children":null,"spread":false},{"title":"README.md 366B ","children":null,"spread":false},{"title":"converted.wav 109.24KB ","children":null,"spread":false}],"spread":true},{"title":"8.Result picture","children":[{"title":"BLSTM-result.png 47.89KB ","children":null,"spread":false},{"title":"spectrogram.PNG 821.59KB ","children":null,"spread":false},{"title":"LSTM-result.png 46.13KB ","children":null,"spread":false}],"spread":false},{"title":"7. separated_result_BLSTM","children":[{"title":"two_women_2.wav 83.79KB ","children":null,"spread":false},{"title":"two_women_1.wav 83.79KB ","children":null,"spread":false},{"title":"two_men_2.wav 96.79KB ","children":null,"spread":false},{"title":"one_man_one_woman_2.wav 86.79KB ","children":null,"spread":false},{"title":"one_man_one_woman_1.wav 86.79KB ","children":null,"spread":false},{"title":"two_men_1.wav 96.79KB ","children":null,"spread":false}],"spread":false},{"title":"5. step_to_CASA_DL","children":[{"title":"train_test_model.py 7.83KB ","children":null,"spread":false},{"title":"evaluation_metric.py 2.19KB ","children":null,"spread":false},{"title":"ProjectReport-Speech Separation in Supervised Setting.pdf 158.17KB ","children":null,"spread":false},{"title":"speech_preprocess.py 13.39KB ","children":null,"spread":false}],"spread":false},{"title":"1. create-speaker-mixtures-V1","children":[{"title":"mix_2_spk_tt.txt 230.49KB ","children":null,"spread":false},{"title":"mix_3_spk_cv.txt 530.56KB ","children":null,"spread":false},{"title":"mix_2_spk_cv.txt 374.36KB ","children":null,"spread":false},{"title":"mix_3_spk_tt.txt 327.25KB ","children":null,"spread":false},{"title":"create_wav_2speakers.m 7.51KB ","children":null,"spread":false},{"title":"create_wav_3speakers.m 9.25KB ","children":null,"spread":false},{"title":"mix_2_spk_tr.txt 1.46MB ","children":null,"spread":false},{"title":"readme.txt 672B ","children":null,"spread":false},{"title":"mix_3_spk_tr.txt 2.07MB ","children":null,"spread":false}],"spread":false}],"spread":false}],"spread":true}]

评论信息

其他资源

免责申明

【只为小站】的资源来自网友分享，仅供学习研究，请务必在下载后24小时内给予删除，不得用于其他任何用途，否则后果自负。基于互联网的特殊性，【只为小站】无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查；无论【只为小站】经营者是否已进行审查，用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场，基于网友分享，根据中国法律《信息网络传播权保护条例》第二十二条之规定，若资源存在侵权或相关问题请联系本站客服人员，zhiweidada#qq.com，请把#换成@，本站将给予最大的支持与配合，做到及时反馈和处理。关于更多版权及免责申明参见版权及免责申明

提取均值信号特征的matlab代码-PIT-LSTM-Speech-Separation:用于语音分离的PIT的张量流实现

文件下载

资源详情

评论信息

其他资源

免责申明

个人信息

相关资源标签

热门下载

最新下载