matlab由频域变时域的代码-speaker-recognition:说话人识别

上传者: 38739101 | 上传时间: 2022-06-01 21:48:16 | 文件大小: 2.8MB | 文件类型: ZIP
matlab由频域变时域的代码EEC-201 [说话者识别] ♪我周围都是熟悉的面Kong..但是现在他们是熟悉的声音...♫ 团队:敬业的工程师 这个项目是由Aakansha和Sadia共同完成的,目的是使用MFCC,VQ和LBG算法来实现说话人识别。 Sadia从事过预加工和MFCC。 Aakansha从事LBG,噪声添加和陷波过滤方面的工作。 培训,测试和分析写作是同时进行的。 介绍 在当今世界大流行和隔离的情况下,从字面上看,我们的声音变得越来越重要。 由于通信仅限于虚拟,因此通过面对面或通过指纹进行的身份验证已经过时。 但是,正如我们的脸部和指纹独特一样,我们的声音也具有鲜明而可区分的特征。 如我们的项目所示,计算机程序比人耳能够更好地识别这些功能。 我们使用模式识别或特征匹配实现说话人识别系统,其中将从输入语音信号中提取的声学矢量序列分类为各个说话人ID。 具体来说,我们的系统是监督模式识别的一种实现,其中数据库由训练集中的已知模式组成,这些模式与测试集进行比较以评估我们的分类算法。 有两种方法可以进行说话人识别-依赖文本和不依赖文本。 依赖于文本的说话者识别策略要求说话者

文件下载

资源详情

[{"title":"( 71 个子文件 2.8MB ) matlab由频域变时域的代码-speaker-recognition:说话人识别","children":[{"title":"speaker-recognition-main","children":[{"title":"Source Code","children":[{"title":"main.m <span style='color:#111;'> 1.92KB </span>","children":null,"spread":false},{"title":"mfcc.m <span style='color:#111;'> 3.39KB </span>","children":null,"spread":false},{"title":"LBG.m <span style='color:#111;'> 1.78KB </span>","children":null,"spread":false},{"title":"train.m <span style='color:#111;'> 889B </span>","children":null,"spread":false},{"title":"kmeans.m <span style='color:#111;'> 1.06KB </span>","children":null,"spread":false},{"title":"test.m <span style='color:#111;'> 2.29KB </span>","children":null,"spread":false},{"title":"disteu.m <span style='color:#111;'> 771B </span>","children":null,"spread":false},{"title":"add_noise.m <span style='color:#111;'> 601B </span>","children":null,"spread":false},{"title":"preProcressing.m <span style='color:#111;'> 1.01KB </span>","children":null,"spread":false},{"title":"melfb.m <span style='color:#111;'> 1.31KB </span>","children":null,"spread":false}],"spread":true},{"title":"README.md <span style='color:#111;'> 20.89KB </span>","children":null,"spread":false},{"title":"Images","children":[{"title":"normalized_speech_signal_speaker_1.PNG <span style='color:#111;'> 27.15KB </span>","children":null,"spread":false},{"title":"MFCC Speaker1_2.PNG <span style='color:#111;'> 40.88KB </span>","children":null,"spread":false},{"title":"sig1 MFCC cluster1.PNG <span style='color:#111;'> 16.00KB </span>","children":null,"spread":false},{"title":"sig1 stft_3.PNG <span style='color:#111;'> 66.12KB </span>","children":null,"spread":false},{"title":"sig1 stft_2.PNG <span style='color:#111;'> 68.04KB </span>","children":null,"spread":false},{"title":"sig1 MFCC.PNG <span style='color:#111;'> 25.79KB </span>","children":null,"spread":false},{"title":"mel scale equation.jpeg <span style='color:#111;'> 8.77KB </span>","children":null,"spread":false},{"title":"speech_signal_speaker_1.PNG <span style='color:#111;'> 26.53KB </span>","children":null,"spread":false},{"title":"sig1 MelFreqWrap.PNG <span style='color:#111;'> 26.21KB </span>","children":null,"spread":false},{"title":"sig1 sig2 MFCC clusters.PNG <span style='color:#111;'> 21.09KB </span>","children":null,"spread":false},{"title":"results.PNG <span style='color:#111;'> 7.07KB </span>","children":null,"spread":false},{"title":"Normalized_Silence removed sig1.PNG <span style='color:#111;'> 56.06KB </span>","children":null,"spread":false},{"title":"sig1 stft_1.PNG <span style='color:#111;'> 66.80KB </span>","children":null,"spread":false},{"title":"20 mel filter banks.PNG <span style='color:#111;'> 39.29KB </span>","children":null,"spread":false},{"title":"stft sig1_before Mel.PNG <span style='color:#111;'> 142.79KB </span>","children":null,"spread":false},{"title":"sig1 N_512.PNG <span style='color:#111;'> 87.18KB </span>","children":null,"spread":false},{"title":"sig1 N_256.PNG <span style='color:#111;'> 89.46KB </span>","children":null,"spread":false},{"title":"noiseaddedsig1.PNG <span style='color:#111;'> 34.43KB </span>","children":null,"spread":false},{"title":"sig1 N_128.PNG <span style='color:#111;'> 90.10KB </span>","children":null,"spread":false},{"title":"VQ acoustic vector codeblocks.PNG <span style='color:#111;'> 21.50KB </span>","children":null,"spread":false},{"title":"mfcc5_6 speaker1_2.PNG <span style='color:#111;'> 22.72KB </span>","children":null,"spread":false},{"title":"noisechart.PNG <span style='color:#111;'> 447.52KB </span>","children":null,"spread":false}],"spread":false},{"title":"Audio","children":[{"title":"Training","children":[{"title":"s8.wav <span style='color:#111;'> 28.54KB </span>","children":null,"spread":false},{"title":"s1.wav <span style='color:#111;'> 25.54KB </span>","children":null,"spread":false},{"title":"s4.wav <span style='color:#111;'> 29.04KB </span>","children":null,"spread":false},{"title":"s3.wav <span style='color:#111;'> 26.04KB </span>","children":null,"spread":false},{"title":"s9.wav <span style='color:#111;'> 98.68KB </span>","children":null,"spread":false},{"title":"s6.wav <span style='color:#111;'> 29.04KB </span>","children":null,"spread":false},{"title":"s11.wav <span style='color:#111;'> 80.61KB </span>","children":null,"spread":false},{"title":"s7.wav <span style='color:#111;'> 28.04KB </span>","children":null,"spread":false},{"title":"s2.wav <span style='color:#111;'> 26.54KB </span>","children":null,"spread":false},{"title":"s10.wav <span style='color:#111;'> 108.44KB </span>","children":null,"spread":false},{"title":"__ <span style='color:#111;'> 1B </span>","children":null,"spread":false},{"title":"s5.wav <span style='color:#111;'> 35.54KB </span>","children":null,"spread":false}],"spread":false},{"title":"Self Audio","children":[{"title":"s13_test.wav <span style='color:#111;'> 280.11KB </span>","children":null,"spread":false},{"title":"s13.wav <span style='color:#111;'> 403.86KB </span>","children":null,"spread":false},{"title":"s12.wav <span style='color:#111;'> 147.95KB </span>","children":null,"spread":false},{"title":"s12_train.wav <span style='color:#111;'> 137.95KB </span>","children":null,"spread":false}],"spread":true},{"title":"Test","children":[{"title":"s8.wav <span style='color:#111;'> 28.54KB </span>","children":null,"spread":false},{"title":"s1.wav <span style='color:#111;'> 25.54KB </span>","children":null,"spread":false},{"title":"s1_edited.wav <span style='color:#111;'> 25.54KB </span>","children":null,"spread":false},{"title":"s4.wav <span style='color:#111;'> 29.04KB </span>","children":null,"spread":false},{"title":"s1_edited4.wav <span style='color:#111;'> 25.54KB </span>","children":null,"spread":false},{"title":"s1_edited5.wav <span style='color:#111;'> 25.54KB </span>","children":null,"spread":false},{"title":"s3.wav <span style='color:#111;'> 26.04KB </span>","children":null,"spread":false},{"title":"s1_notch1.wav <span style='color:#111;'> 25.54KB </span>","children":null,"spread":false},{"title":"s9.wav <span style='color:#111;'> 98.68KB </span>","children":null,"spread":false},{"title":"s1_notch3.wav <span style='color:#111;'> 25.54KB </span>","children":null,"spread":false},{"title":"s6.wav <span style='color:#111;'> 29.04KB </span>","children":null,"spread":false},{"title":"s11.wav <span style='color:#111;'> 80.61KB </span>","children":null,"spread":false},{"title":"s7.wav <span style='color:#111;'> 28.04KB </span>","children":null,"spread":false},{"title":"s13_slow.wav <span style='color:#111;'> 403.86KB </span>","children":null,"spread":false},{"title":"s13_tapping.m4a <span style='color:#111;'> 49.86KB </span>","children":null,"spread":false},{"title":"s1_edited3.wav <span style='color:#111;'> 25.54KB </span>","children":null,"spread":false},{"title":"s2.wav <span style='color:#111;'> 26.54KB </span>","children":null,"spread":false},{"title":"s10.wav <span style='color:#111;'> 108.44KB </span>","children":null,"spread":false},{"title":"s1_notch2.wav <span style='color:#111;'> 25.54KB </span>","children":null,"spread":false},{"title":"s5.wav <span style='color:#111;'> 35.54KB </span>","children":null,"spread":false},{"title":"s1_edited2.wav <span style='color:#111;'> 25.54KB </span>","children":null,"spread":false},{"title":"s1_notch4.wav <span style='color:#111;'> 25.54KB </span>","children":null,"spread":false}],"spread":false}],"spread":true}],"spread":true}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明