LaboroTVSpeech

上传者: 42134769 | 上传时间: 2022-05-07 19:08:24 | 文件大小: 58KB | 文件类型: ZIP
LaboroTVSpeech 电视录音中的大型日语语音语料库 关于语料库 LaboroTVSpeech是一种大型的日语语音语料库,由广播的电视录音及其字幕构建而成。 我们当前的版本包含超过2,000个小时的演讲时间。 细节 所有音频样本均基于原始字幕的音域进行了细分,采样率为16 KHz。 使用和作为字典,将每个语音段标记为单词序列。 每个单词标记都包含一个简单的语素标记,例如名词(名词)或动词(动词),它们是通过预处理原始字幕获得的。 从原始的电视音频和字幕中,我们提取了语音段,从而能够以较高的置信度对齐音频和字幕段。 我们反复使用和 。 所有语音片段均随机洗牌。 子集 火车 开发者 音频长度(小时) 2036.2 13.7 #音频片段 160万 12 K #单词(令牌) 2 200万 147千 预防 某些单词的发音或语素标记可能不正确,尤其是对于随意单词。 例如「

文件下载

资源详情

[{"title":"( 62 个子文件 58KB ) LaboroTVSpeech","children":[{"title":"LaboroTVSpeech-main","children":[{"title":"kaldi","children":[{"title":"laborotv","children":[{"title":"s5","children":[{"title":"conf","children":[{"title":"fbank.conf <span style='color:#111;'> 394B </span>","children":null,"spread":false},{"title":"online_pitch.conf <span style='color:#111;'> 2.99KB </span>","children":null,"spread":false},{"title":"config_opt <span style='color:#111;'> 589B </span>","children":null,"spread":false},{"title":"fbank_40.conf <span style='color:#111;'> 18B </span>","children":null,"spread":false},{"title":"decode_dnn.config <span style='color:#111;'> 122B </span>","children":null,"spread":false},{"title":"topo.proto <span style='color:#111;'> 960B </span>","children":null,"spread":false},{"title":"mfcc_hires.conf <span style='color:#111;'> 616B </span>","children":null,"spread":false},{"title":"decode_tandem.config <span style='color:#111;'> 70B </span>","children":null,"spread":false},{"title":"plp.conf <span style='color:#111;'> 25B </span>","children":null,"spread":false},{"title":"pitch.conf <span style='color:#111;'> 25B </span>","children":null,"spread":false},{"title":"online_cmvn.conf <span style='color:#111;'> 108B </span>","children":null,"spread":false},{"title":"mfcc.conf <span style='color:#111;'> 100B </span>","children":null,"spread":false},{"title":"decode.config <span style='color:#111;'> 109B </span>","children":null,"spread":false}],"spread":false},{"title":"local","children":[{"title":"score.sh <span style='color:#111;'> 35B </span>","children":null,"spread":false},{"title":"nnet3","children":[{"title":"run_ivector_common.sh <span style='color:#111;'> 5.65KB </span>","children":null,"spread":false}],"spread":true},{"title":"chain","children":[{"title":"run_tdnn.sh <span style='color:#111;'> 8.25KB </span>","children":null,"spread":false}],"spread":true},{"title":"wer_output_filter <span style='color:#111;'> 270B </span>","children":null,"spread":false},{"title":"tedx-jp-10k_data_prep.sh <span style='color:#111;'> 1.13KB </span>","children":null,"spread":false},{"title":"lm","children":[{"title":"prepare_lang_from_arpa.sh <span style='color:#111;'> 1.13KB </span>","children":null,"spread":false},{"title":"prepare_dict.sh <span style='color:#111;'> 844B </span>","children":null,"spread":false},{"title":"interpolate_best_mix.sh <span style='color:#111;'> 2.59KB </span>","children":null,"spread":false}],"spread":true},{"title":"laborotv_data_prep.sh <span style='color:#111;'> 939B </span>","children":null,"spread":false},{"title":"laborotv_train_lms.sh <span style='color:#111;'> 2.58KB </span>","children":null,"spread":false}],"spread":true},{"title":"utils <span style='color:#111;'> 18B </span>","children":null,"spread":false},{"title":"run.sh <span style='color:#111;'> 8.58KB </span>","children":null,"spread":false},{"title":"steps <span style='color:#111;'> 18B </span>","children":null,"spread":false},{"title":"cmd.sh <span style='color:#111;'> 1.00KB </span>","children":null,"spread":false},{"title":"path.sh <span style='color:#111;'> 536B </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"laborotv_csj","children":[{"title":"s5","children":[{"title":"conf","children":[{"title":"fbank.conf <span style='color:#111;'> 394B </span>","children":null,"spread":false},{"title":"online_pitch.conf <span style='color:#111;'> 2.99KB </span>","children":null,"spread":false},{"title":"config_opt <span style='color:#111;'> 589B </span>","children":null,"spread":false},{"title":"fbank_40.conf <span style='color:#111;'> 18B </span>","children":null,"spread":false},{"title":"decode_dnn.config <span style='color:#111;'> 122B </span>","children":null,"spread":false},{"title":"topo.proto <span style='color:#111;'> 960B </span>","children":null,"spread":false},{"title":"mfcc_hires.conf <span style='color:#111;'> 616B </span>","children":null,"spread":false},{"title":"decode_tandem.config <span style='color:#111;'> 70B </span>","children":null,"spread":false},{"title":"plp.conf <span style='color:#111;'> 25B </span>","children":null,"spread":false},{"title":"pitch.conf <span style='color:#111;'> 25B </span>","children":null,"spread":false},{"title":"online_cmvn.conf <span style='color:#111;'> 108B </span>","children":null,"spread":false},{"title":"mfcc.conf <span style='color:#111;'> 100B </span>","children":null,"spread":false},{"title":"decode.config <span style='color:#111;'> 109B </span>","children":null,"spread":false}],"spread":false},{"title":"local","children":[{"title":"score.sh <span style='color:#111;'> 35B </span>","children":null,"spread":false},{"title":"csj_prep_all.sh <span style='color:#111;'> 5.64KB </span>","children":null,"spread":false},{"title":"csj_symlink_scripts.sh <span style='color:#111;'> 765B </span>","children":null,"spread":false},{"title":"nnet3","children":[{"title":"run_ivector_common.sh <span style='color:#111;'> 6.09KB </span>","children":null,"spread":false}],"spread":true},{"title":"chain","children":[{"title":"run_tdnn.sh <span style='color:#111;'> 8.25KB </span>","children":null,"spread":false}],"spread":true},{"title":"wer_output_filter <span style='color:#111;'> 270B </span>","children":null,"spread":false},{"title":"tedx-jp-10k_data_prep.sh <span style='color:#111;'> 1.13KB </span>","children":null,"spread":false},{"title":"laborotv_prep_all.sh <span style='color:#111;'> 2.00KB </span>","children":null,"spread":false},{"title":"lm","children":[{"title":"prepare_lang_from_arpa.sh <span style='color:#111;'> 1.13KB </span>","children":null,"spread":false},{"title":"prepare_dict.sh <span style='color:#111;'> 844B </span>","children":null,"spread":false},{"title":"interpolate_best_mix.sh <span style='color:#111;'> 2.53KB </span>","children":null,"spread":false}],"spread":false},{"title":"laborotv_data_prep.sh <span style='color:#111;'> 1.17KB </span>","children":null,"spread":false},{"title":"laborotv_train_lms.sh <span style='color:#111;'> 2.78KB </span>","children":null,"spread":false}],"spread":false},{"title":"utils <span style='color:#111;'> 18B </span>","children":null,"spread":false},{"title":"run.sh <span style='color:#111;'> 10.99KB </span>","children":null,"spread":false},{"title":".gitignore <span style='color:#111;'> 103B </span>","children":null,"spread":false},{"title":"steps <span style='color:#111;'> 18B </span>","children":null,"spread":false},{"title":"cmd.sh <span style='color:#111;'> 1.00KB </span>","children":null,"spread":false},{"title":"path.sh <span style='color:#111;'> 536B </span>","children":null,"spread":false}],"spread":true}],"spread":true}],"spread":true},{"title":"LICENSE <span style='color:#111;'> 11.09KB </span>","children":null,"spread":false},{"title":"README.md <span style='color:#111;'> 5.10KB </span>","children":null,"spread":false}],"spread":true}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明