用matlab实现的语音识别算法

上传者: aileenwei | 上传时间: 2019-12-21 19:23:33 | 文件大小: 745KB | 文件类型: rar
在本文中,我们将深入探讨如何使用MATLAB实现语音识别算法,这一技术在现代通信、智能家居、自动驾驶等领域有着广泛应用。MATLAB作为一个强大的数值计算和数据可视化平台,为语音处理提供了丰富的工具箱和函数,使得开发这样的系统变得相对容易。 我们要理解语音识别的基本流程。它通常包括以下几个步骤: 1. 预处理:这部分涉及到语音信号的采集,噪声去除,预加重(Preemphasis)等。预加重是一种提升高频成分的技术,有助于消除人声的频率响应不平坦现象。 2. 分帧与窗函数:将连续的语音信号切分成固定长度的帧,并应用窗函数(如汉明窗或海明窗)以减少帧间干扰。 3. 声谱分析:通过快速傅里叶变换(FFT)对每帧进行分析,得到频域表示,即梅尔频率倒谱系数(MFCC)。MFCC是一种模拟人类听觉特性的特征提取方法,能有效压缩频谱信息。 4. 特征提取:从声谱图中提取关键特征,如MFCC系数、delta和delta-delta系数,这些特征将作为识别模型的输入。 5. 训练模型:利用机器学习算法(如隐马尔可夫模型HMM,支持向量机SVM,深度神经网络DNN等)训练模型,建立语音到文本的映射关系。 6. 匹配与识别:将待识别语音的特征与训练好的模型进行比较,找出最匹配的标签,完成识别过程。 在MATLAB中,可以使用Signal Processing Toolbox和Audio Toolbox来完成上述任务。例如,`audioread`用于读取音频文件,`preemph`进行预加重,`hamming`生成窗函数,`melFilterBank`计算MFCC,`hmmtrain`训练HMM模型,`viterbi`进行HMM的维特比解码。 在提供的压缩包中,"speech recognition"可能包含了以下文件: - `training.m`: 这个文件可能包含了训练部分的代码,用于构建识别模型。可能涵盖了特征提取和模型训练的过程。 - `matching.m`: 匹配部分的代码,用于将新语音样本与训练好的模型进行匹配,输出识别结果。 - `demo.m`: 演示部分,通常会调用上述两个函数,用户可以通过运行此文件来体验整个语音识别系统。 通过理解并实践这些MATLAB代码,你可以深入掌握语音识别的核心算法和技术,为自己的项目或研究打下坚实的基础。不过,需要注意的是,实际应用中的语音识别可能还需要考虑到更多的因素,如发音多样性、背景噪声、说话速度等,这需要在模型训练和参数调整中不断优化。

文件下载

资源详情

[{"title":"( 76 个子文件 745KB ) 用matlab实现的语音识别算法","children":[{"title":"speech recongition","children":[{"title":"gpl.txt <span style='color:#111;'> 17.90KB </span>","children":null,"spread":false},{"title":"speechann.p <span style='color:#111;'> 23.83KB </span>","children":null,"spread":false},{"title":"readme.m <span style='color:#111;'> 3.20KB </span>","children":null,"spread":false},{"title":"amintest","children":[{"title":"s2.wav <span style='color:#111;'> 108.94KB </span>","children":null,"spread":false},{"title":"s3.wav <span style='color:#111;'> 97.28KB </span>","children":null,"spread":false},{"title":"s1.wav <span style='color:#111;'> 73.54KB </span>","children":null,"spread":false}],"spread":true},{"title":"train.p <span style='color:#111;'> 952B </span>","children":null,"spread":false},{"title":"matlab.mat <span style='color:#111;'> 128B </span>","children":null,"spread":false},{"title":"test","children":[{"title":"s8.wav <span style='color:#111;'> 28.04KB </span>","children":null,"spread":false},{"title":"s6.wav <span style='color:#111;'> 33.04KB </span>","children":null,"spread":false},{"title":"s7.wav <span style='color:#111;'> 28.54KB </span>","children":null,"spread":false},{"title":"s2.wav <span style='color:#111;'> 27.54KB </span>","children":null,"spread":false},{"title":"s5.wav <span style='color:#111;'> 32.04KB </span>","children":null,"spread":false},{"title":"s3.wav <span style='color:#111;'> 37.04KB </span>","children":null,"spread":false},{"title":"s4.wav <span style='color:#111;'> 31.04KB </span>","children":null,"spread":false},{"title":"s1.wav <span style='color:#111;'> 24.54KB </span>","children":null,"spread":false}],"spread":true},{"title":"mfcc.p <span style='color:#111;'> 2.12KB </span>","children":null,"spread":false},{"title":"demo.m <span style='color:#111;'> 3.56KB </span>","children":null,"spread":false},{"title":"rfft.p <span style='color:#111;'> 2.01KB </span>","children":null,"spread":false},{"title":"train","children":[{"title":"s8.wav <span style='color:#111;'> 28.54KB </span>","children":null,"spread":false},{"title":"s6.wav <span style='color:#111;'> 29.04KB </span>","children":null,"spread":false},{"title":"s7.wav <span style='color:#111;'> 28.04KB </span>","children":null,"spread":false},{"title":"s2.wav <span style='color:#111;'> 26.54KB </span>","children":null,"spread":false},{"title":"s5.wav <span style='color:#111;'> 35.54KB </span>","children":null,"spread":false},{"title":"s3.wav <span style='color:#111;'> 26.04KB </span>","children":null,"spread":false},{"title":"s4.wav <span style='color:#111;'> 29.04KB </span>","children":null,"spread":false},{"title":"s1.wav <span style='color:#111;'> 25.54KB </span>","children":null,"spread":false}],"spread":true},{"title":"amintrain","children":[{"title":"s2.wav <span style='color:#111;'> 102.35KB </span>","children":null,"spread":false},{"title":"s3.wav <span style='color:#111;'> 97.50KB </span>","children":null,"spread":false},{"title":"s1.wav <span style='color:#111;'> 129.26KB </span>","children":null,"spread":false}],"spread":true},{"title":"blockFrames.p <span style='color:#111;'> 1.27KB </span>","children":null,"spread":false},{"title":"explanation.txt <span style='color:#111;'> 399B </span>","children":null,"spread":false},{"title":"melbankm.p <span style='color:#111;'> 5.92KB </span>","children":null,"spread":false},{"title":"sourcecode.asv <span style='color:#111;'> 1.62KB </span>","children":null,"spread":false},{"title":"sourcecode.m <span style='color:#111;'> 1.66KB </span>","children":null,"spread":false},{"title":"MyWords","children":[{"title":"MALE3DESKTOP1.wav <span style='color:#111;'> 28.85KB </span>","children":null,"spread":false},{"title":"FEMALE1COMPUTER1.wav <span style='color:#111;'> 24.98KB </span>","children":null,"spread":false},{"title":"FEMALE1DESKTOP2.wav <span style='color:#111;'> 21.54KB </span>","children":null,"spread":false},{"title":"MALE2COMPUTER1.wav <span style='color:#111;'> 27.34KB </span>","children":null,"spread":false},{"title":"MALE1COMPUTER2.wav <span style='color:#111;'> 19.61KB </span>","children":null,"spread":false},{"title":"FEMALE1ONE2.wav <span style='color:#111;'> 22.83KB </span>","children":null,"spread":false},{"title":"MALE1COMPUTER.wav <span style='color:#111;'> 22.40KB </span>","children":null,"spread":false},{"title":"MALE3COMUTER2.wav <span style='color:#111;'> 30.99KB </span>","children":null,"spread":false},{"title":"MALE1ONE1.wav <span style='color:#111;'> 18.96KB </span>","children":null,"spread":false},{"title":"MALE2ONE1.wav <span style='color:#111;'> 28.42KB </span>","children":null,"spread":false},{"title":"MALE1COMPUTER1.wav <span style='color:#111;'> 20.04KB </span>","children":null,"spread":false},{"title":"MALE3DESKTOP2.wav <span style='color:#111;'> 26.27KB </span>","children":null,"spread":false},{"title":"FEMALE1COMUTER2.wav <span style='color:#111;'> 24.98KB </span>","children":null,"spread":false},{"title":"MALE2ONE2.wav <span style='color:#111;'> 27.99KB </span>","children":null,"spread":false},{"title":"FELMALE1ONE1.wav <span style='color:#111;'> 20.04KB </span>","children":null,"spread":false},{"title":"MALE1ZERO1.wav <span style='color:#111;'> 23.26KB </span>","children":null,"spread":false},{"title":"MALE3ZERO1.wav <span style='color:#111;'> 26.05KB </span>","children":null,"spread":false},{"title":"MALE3ZERO2.wav <span style='color:#111;'> 30.35KB </span>","children":null,"spread":false},{"title":"MALE1ZERO2.wav <span style='color:#111;'> 17.03KB </span>","children":null,"spread":false},{"title":"MALE1ONE2.wav <span style='color:#111;'> 21.76KB </span>","children":null,"spread":false},{"title":"MALE2DESKTOP1.wav <span style='color:#111;'> 30.78KB </span>","children":null,"spread":false},{"title":"MALE2ZERO1.wav <span style='color:#111;'> 24.98KB </span>","children":null,"spread":false},{"title":"MALE2COMPUTER2.wav <span style='color:#111;'> 27.77KB </span>","children":null,"spread":false},{"title":"MALE2ZERO2.wav <span style='color:#111;'> 28.42KB </span>","children":null,"spread":false},{"title":"FEMALE1ZERO2.wav <span style='color:#111;'> 20.04KB </span>","children":null,"spread":false},{"title":"MALE3ONE2.wav <span style='color:#111;'> 24.76KB </span>","children":null,"spread":false},{"title":"MALE3ONE1.wav <span style='color:#111;'> 26.91KB </span>","children":null,"spread":false},{"title":"FEMALE1DESKTOP1.wav <span style='color:#111;'> 30.78KB </span>","children":null,"spread":false},{"title":"MALE3COMPUTER1.wav <span style='color:#111;'> 30.56KB </span>","children":null,"spread":false},{"title":"MALE2DESKTOP2.wav <span style='color:#111;'> 28.63KB </span>","children":null,"spread":false},{"title":"FEMALE1ZERO1.wav <span style='color:#111;'> 20.68KB </span>","children":null,"spread":false}],"spread":false},{"title":"vqlbg.p <span style='color:#111;'> 2.06KB </span>","children":null,"spread":false},{"title":"rdct.p <span style='color:#111;'> 2.59KB </span>","children":null,"spread":false},{"title":"enframe.p <span style='color:#111;'> 1.79KB </span>","children":null,"spread":false},{"title":"melfb.p <span style='color:#111;'> 2.65KB </span>","children":null,"spread":false},{"title":"disteu.p <span style='color:#111;'> 1.76KB </span>","children":null,"spread":false},{"title":"melcepst.p <span style='color:#111;'> 9.09KB </span>","children":null,"spread":false},{"title":"demo.asv <span style='color:#111;'> 3.79KB </span>","children":null,"spread":false},{"title":"www.pudn.com.txt <span style='color:#111;'> 218B </span>","children":null,"spread":false},{"title":"Read Me.txt <span style='color:#111;'> 327B </span>","children":null,"spread":false},{"title":"test.p <span style='color:#111;'> 1.72KB </span>","children":null,"spread":false}],"spread":false}],"spread":true}]

评论信息

  • YSU.ISE :
    都别下了,浪费。该 P 代码文件 C:\Users\jj\Desktop\speech recongition\blockFrames.p 是在 MATLAB 7.5 版(R2007b)之前生成的,且
    2019-06-05
  • wuwuwukkkkk :
    该代码是加密形式的,没办法看到具体代码,用于学习不太适合。
    2018-07-10
  • weixin_39380815 :
    还不错,应该有帮助
    2018-06-03
  • Deron_Kong :
    很不错的东西 值得学习
    2018-03-27
  • 霜冻召唤师 :
    觉得还不错,应该有帮助
    2018-01-06

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明