深度学习基于卷积神经网络的语音识别系统源代码.zip

上传者: 55305220 | 上传时间: 2022-06-12 09:08:04 | 文件大小: 36.73MB | 文件类型: ZIP
深度学习基于全卷积神经网络的语音识别系统源代码。 本项目使用基于卷积神经网络实现。 通过下载复制以后,需要将datalist目录下的文件全部拷贝到dataset目录下,也就是将其跟数据集放在一起。 $ cp -rf datalist/* dataset/ 目前可用的模型有24、25和251 本项目开始训练请执行: $ python3 train_mspeech.py 本项目开始测试请执行: $ python3 test_mspeech.py iters_num (这里的iters_num为迭代的step数,可以在生成的step_dfcnn.txt文件里查看) 测试之前,请确保代码中填写的模型文件路径存在。 ASRT API服务器启动请执行: $ python3 asrserver.py Model 模型 Speech Model 语音模型 CNN + LSTM/GRU + CTC Language Model 语言模型 基于概率图的最大熵隐马尔可夫模型 About Accuracy 关于准确率

文件下载

资源详情

[{"title":"( 98 个子文件 36.73MB ) 深度学习基于卷积神经网络的语音识别系统源代码.zip","children":[{"title":"DFCNN-master-master","children":[{"title":"step_dfcnn.txt <span style='color:#111;'> 54B </span>","children":null,"spread":false},{"title":"README.md <span style='color:#111;'> 2.38KB </span>","children":null,"spread":false},{"title":"model_speech","children":[{"title":"m_DFCNN","children":[{"title":"speech_model_dfcnn_e_0_step_60000.model.base <span style='color:#111;'> 5.82MB </span>","children":null,"spread":false},{"title":"speech_model_dfcnn_e_0_step_60000.model <span style='color:#111;'> 5.80MB </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"datalist","children":[{"title":"aishell","children":[{"title":"train.wav.lst <span style='color:#111;'> 7.67MB </span>","children":null,"spread":false},{"title":"test.wav.lst <span style='color:#111;'> 462.52KB </span>","children":null,"spread":false},{"title":"train.syllabel.txt <span style='color:#111;'> 10.30MB </span>","children":null,"spread":false},{"title":"dev.syllabel.txt <span style='color:#111;'> 1.22MB </span>","children":null,"spread":false},{"title":"dev.wav.lst <span style='color:#111;'> 909.37KB </span>","children":null,"spread":false},{"title":"test.syllabel.txt <span style='color:#111;'> 637.71KB </span>","children":null,"spread":false}],"spread":true},{"title":"st-cmds","children":[{"title":"dev.wav.txt <span style='color:#111;'> 38.67KB </span>","children":null,"spread":false},{"title":"train.syllabel.txt <span style='color:#111;'> 7.06MB </span>","children":null,"spread":false},{"title":"dev.syllabel.txt <span style='color:#111;'> 43.71KB </span>","children":null,"spread":false},{"title":"test.wav.txt <span style='color:#111;'> 128.91KB </span>","children":null,"spread":false},{"title":"train.wav.txt <span style='color:#111;'> 6.29MB </span>","children":null,"spread":false},{"title":"test.syllabel.txt <span style='color:#111;'> 145.12KB </span>","children":null,"spread":false}],"spread":true},{"title":"thchs30","children":[{"title":"train.wav.lst <span style='color:#111;'> 371.09KB </span>","children":null,"spread":false},{"title":"test.wav.lst <span style='color:#111;'> 90.64KB </span>","children":null,"spread":false},{"title":"train.syllabel.txt <span style='color:#111;'> 1.65MB </span>","children":null,"spread":false},{"title":"dev.syllabel.txt <span style='color:#111;'> 151.37KB </span>","children":null,"spread":false},{"title":"dev.wav.lst <span style='color:#111;'> 31.47KB </span>","children":null,"spread":false},{"title":"test.syllabel.txt <span style='color:#111;'> 422.74KB </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"dataset","children":[{"title":"aishell","children":[{"title":"train.wav.lst <span style='color:#111;'> 7.67MB </span>","children":null,"spread":false},{"title":"test.wav.lst <span style='color:#111;'> 462.52KB </span>","children":null,"spread":false},{"title":"train.syllabel.txt <span style='color:#111;'> 10.30MB </span>","children":null,"spread":false},{"title":"dev.syllabel.txt <span style='color:#111;'> 1.22MB </span>","children":null,"spread":false},{"title":"dev.wav.lst <span style='color:#111;'> 909.37KB </span>","children":null,"spread":false},{"title":"test.syllabel.txt <span style='color:#111;'> 637.71KB </span>","children":null,"spread":false}],"spread":true},{"title":"st-cmds","children":[{"title":"dev.wav.txt <span style='color:#111;'> 38.67KB </span>","children":null,"spread":false},{"title":"train.syllabel.txt <span style='color:#111;'> 7.06MB </span>","children":null,"spread":false},{"title":"dev.syllabel.txt <span style='color:#111;'> 43.71KB </span>","children":null,"spread":false},{"title":"test.wav.txt <span style='color:#111;'> 128.91KB </span>","children":null,"spread":false},{"title":"train.wav.txt <span style='color:#111;'> 6.29MB </span>","children":null,"spread":false},{"title":"test.syllabel.txt <span style='color:#111;'> 145.12KB </span>","children":null,"spread":false}],"spread":true},{"title":"thchs30","children":[{"title":"train.wav.lst <span style='color:#111;'> 371.09KB </span>","children":null,"spread":false},{"title":"test.wav.lst <span style='color:#111;'> 90.64KB </span>","children":null,"spread":false},{"title":"train.syllabel.txt <span style='color:#111;'> 1.65MB </span>","children":null,"spread":false},{"title":"dev.syllabel.txt <span style='color:#111;'> 151.37KB </span>","children":null,"spread":false},{"title":"dev.wav.lst <span style='color:#111;'> 31.47KB </span>","children":null,"spread":false},{"title":"test.syllabel.txt <span style='color:#111;'> 422.74KB </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"asrserver.py <span style='color:#111;'> 2.99KB </span>","children":null,"spread":false},{"title":"model_language","children":[{"title":"language_model1.txt <span style='color:#111;'> 47.01KB </span>","children":null,"spread":false},{"title":"dic_pinyin.txt <span style='color:#111;'> 1.94MB </span>","children":null,"spread":false},{"title":"language_model2.txt <span style='color:#111;'> 4.97MB </span>","children":null,"spread":false}],"spread":true},{"title":"test.py <span style='color:#111;'> 3.19KB </span>","children":null,"spread":false},{"title":"testClient.py <span style='color:#111;'> 435B </span>","children":null,"spread":false},{"title":"log","children":[{"title":"20180907.log <span style='color:#111;'> 15.64KB </span>","children":null,"spread":false},{"title":"20180829.log <span style='color:#111;'> 15.29KB </span>","children":null,"spread":false},{"title":"20180903.log <span style='color:#111;'> 5.66MB </span>","children":null,"spread":false},{"title":"20180831.log <span style='color:#111;'> 9.77KB </span>","children":null,"spread":false},{"title":"20180905_2.log <span style='color:#111;'> 577.06KB </span>","children":null,"spread":false},{"title":"20180905.log <span style='color:#111;'> 369.23KB </span>","children":null,"spread":false},{"title":"20180830.log <span style='color:#111;'> 9.77KB </span>","children":null,"spread":false},{"title":"20180906.log <span style='color:#111;'> 2.73MB </span>","children":null,"spread":false},{"title":"20180904.log <span style='color:#111;'> 160.95KB </span>","children":null,"spread":false},{"title":"20180824.log <span style='color:#111;'> 112.75KB </span>","children":null,"spread":false}],"spread":true},{"title":"general_function","children":[{"title":"muti_gpu.py <span style='color:#111;'> 3.84KB </span>","children":null,"spread":false},{"title":"file_dict.py <span style='color:#111;'> 11.63KB </span>","children":null,"spread":false},{"title":"gen_func.py <span style='color:#111;'> 514B </span>","children":null,"spread":false},{"title":"file_wav.py <span style='color:#111;'> 7.90KB </span>","children":null,"spread":false},{"title":"__pycache__","children":[{"title":"__init__.cpython-36.pyc <span style='color:#111;'> 311B </span>","children":null,"spread":false},{"title":"file_wav.cpython-36.pyc <span style='color:#111;'> 4.74KB </span>","children":null,"spread":false},{"title":"gen_func.cpython-36.pyc <span style='color:#111;'> 555B </span>","children":null,"spread":false},{"title":"muti_gpu.cpython-36.pyc <span style='color:#111;'> 3.79KB </span>","children":null,"spread":false},{"title":"file_dict.cpython-36.pyc <span style='color:#111;'> 14.14KB </span>","children":null,"spread":false}],"spread":false},{"title":"__init__.py <span style='color:#111;'> 166B </span>","children":null,"spread":false}],"spread":true},{"title":"LanguageModel.py <span style='color:#111;'> 7.46KB </span>","children":null,"spread":false},{"title":"test_mspeech.py <span style='color:#111;'> 2.24KB </span>","children":null,"spread":false},{"title":"kill_PID.py <span style='color:#111;'> 254B </span>","children":null,"spread":false},{"title":"read_data_aishell.py <span style='color:#111;'> 21.82KB </span>","children":null,"spread":false},{"title":"gen_aishell_data","children":[{"title":"datalist","children":[{"title":"aishell","children":[{"title":"train.wav.lst <span style='color:#111;'> 7.67MB </span>","children":null,"spread":false},{"title":"test.wav.lst <span style='color:#111;'> 462.52KB </span>","children":null,"spread":false},{"title":"dev.syllable.txt <span style='color:#111;'> 1.22MB </span>","children":null,"spread":false},{"title":"train.syllable.txt <span style='color:#111;'> 10.30MB </span>","children":null,"spread":false},{"title":"dev.wav.lst <span style='color:#111;'> 909.37KB </span>","children":null,"spread":false},{"title":"test.syllable.txt <span style='color:#111;'> 637.71KB </span>","children":null,"spread":false}],"spread":false},{"title":"st-cmds","children":[{"title":"dev.wav.txt <span style='color:#111;'> 38.67KB </span>","children":null,"spread":false},{"title":"dev.syllable.txt <span style='color:#111;'> 43.71KB </span>","children":null,"spread":false},{"title":"test.wav.txt <span style='color:#111;'> 128.91KB </span>","children":null,"spread":false},{"title":"train.syllable.txt <span style='color:#111;'> 7.06MB </span>","children":null,"spread":false},{"title":"train.wav.txt <span style='color:#111;'> 6.29MB </span>","children":null,"spread":false},{"title":"test.syllable.txt <span style='color:#111;'> 145.12KB </span>","children":null,"spread":false}],"spread":false},{"title":".st-cmds.swp <span style='color:#111;'> 12.00KB </span>","children":null,"spread":false},{"title":"thchs30","children":[{"title":"cv.wav.lst <span style='color:#111;'> 31.47KB </span>","children":null,"spread":false},{"title":"train.wav.lst <span style='color:#111;'> 371.09KB </span>","children":null,"spread":false},{"title":"test.wav.lst <span style='color:#111;'> 90.64KB </span>","children":null,"spread":false},{"title":"cv.syllable.txt <span style='color:#111;'> 151.37KB </span>","children":null,"spread":false},{"title":"train.syllable.txt <span style='color:#111;'> 1.65MB </span>","children":null,"spread":false},{"title":"test.syllable.txt <span style='color:#111;'> 422.74KB </span>","children":null,"spread":false}],"spread":false}],"spread":false},{"title":"read_data_aishell.py <span style='color:#111;'> 21.82KB </span>","children":null,"spread":false},{"title":"dict.txt <span style='color:#111;'> 32.09KB </span>","children":null,"spread":false},{"title":"aishell_pre.py <span style='color:#111;'> 4.78KB </span>","children":null,"spread":false},{"title":"gen_dict.py <span style='color:#111;'> 13.04KB </span>","children":null,"spread":false}],"spread":false},{"title":"dict_2.txt <span style='color:#111;'> 32.19KB </span>","children":null,"spread":false},{"title":"read_data.py <span style='color:#111;'> 10.67KB </span>","children":null,"spread":false},{"title":"SpeechModel_DFCNN.py <span style='color:#111;'> 16.91KB </span>","children":null,"spread":false},{"title":"train_mspeech.py <span style='color:#111;'> 1.65KB </span>","children":null,"spread":false},{"title":"gpu_condition.py <span style='color:#111;'> 60B </span>","children":null,"spread":false}],"spread":false}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明