Lip2Wav:这是包含我们2020年CVPR论文代码的存储库,标题为“学习准确的口语到语音合成的个别说话风格”

上传者: 42117150 | 上传时间: 2023-01-26 10:49:42 | 文件大小: 3.83MB | 文件类型: ZIP
更新:如果您正在寻找Wav2Lip, Lip2Wav 仅通过唇部动作即可产生高质量的语音。该代码是论文的一部分:在CVPR'20上发表的论文,学习个人说话风格以实现准确的语音合成。 | | 最近更新 发布了适用于所有扬声器的数据集和预训练模型! 已发布在LRW数据集上训练的多扬声器单词级Lip2Wav模型的预训练模型! (分支) 强调 在不受限制的情况下,仅通过嘴唇运动即可产生可理解的语音的第一项工作。 问题的序列到序列建模。 提供5个扬声器的数据集,其中包含100多个小时的视频数据! 提供了完整的培训代码和预训练的模型。 推理代码从预训练的模型生成结果。 还提供了用于计算论文中报告的指标的代码。 你也可能对此有兴趣: :party_popper:使用Wav2Lip进行口语同步的语音视频到任何语音: : 先决条件 Python 3.7.4 (此版本已通过代码测试) ffmpeg: sudo apt

文件下载

资源详情

[{"title":"( 69 个子文件 3.83MB ) Lip2Wav:这是包含我们2020年CVPR论文代码的存储库,标题为“学习准确的口语到语音合成的个别说话风格”","children":[{"title":"Lip2Wav-master","children":[{"title":"images","children":[{"title":"banner.gif <span style='color:#111;'> 3.77MB </span>","children":null,"spread":false}],"spread":true},{"title":"face_detection","children":[{"title":"models.py <span style='color:#111;'> 8.42KB </span>","children":null,"spread":false},{"title":"detection","children":[{"title":"sfd","children":[{"title":"sfd_detector.py <span style='color:#111;'> 1.75KB </span>","children":null,"spread":false},{"title":"net_s3fd.py <span style='color:#111;'> 5.17KB </span>","children":null,"spread":false},{"title":"detect.py <span style='color:#111;'> 3.68KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 53B </span>","children":null,"spread":false},{"title":"bbox.py <span style='color:#111;'> 4.18KB </span>","children":null,"spread":false}],"spread":true},{"title":"core.py <span style='color:#111;'> 4.75KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 30B </span>","children":null,"spread":false}],"spread":true},{"title":"utils.py <span style='color:#111;'> 11.53KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 183B </span>","children":null,"spread":false},{"title":"api.py <span style='color:#111;'> 2.27KB </span>","children":null,"spread":false},{"title":"README.md <span style='color:#111;'> 209B </span>","children":null,"spread":false}],"spread":true},{"title":"License.md <span style='color:#111;'> 1.04KB </span>","children":null,"spread":false},{"title":"train.py <span style='color:#111;'> 2.98KB </span>","children":null,"spread":false},{"title":"synthesizer","children":[{"title":"feeder.py <span style='color:#111;'> 11.72KB </span>","children":null,"spread":false},{"title":"models","children":[{"title":"architecture_wrappers.py <span style='color:#111;'> 8.41KB </span>","children":null,"spread":false},{"title":"attention.py <span style='color:#111;'> 8.86KB </span>","children":null,"spread":false},{"title":"tacotron.py <span style='color:#111;'> 29.68KB </span>","children":null,"spread":false},{"title":"helpers.py <span style='color:#111;'> 5.83KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 174B </span>","children":null,"spread":false},{"title":"custom_decoder.py <span style='color:#111;'> 5.32KB </span>","children":null,"spread":false},{"title":"modules.py <span style='color:#111;'> 24.29KB </span>","children":null,"spread":false}],"spread":true},{"title":"synthesize.py <span style='color:#111;'> 3.70KB </span>","children":null,"spread":false},{"title":"audio.py <span style='color:#111;'> 7.60KB </span>","children":null,"spread":false},{"title":"inference.py <span style='color:#111;'> 5.24KB </span>","children":null,"spread":false},{"title":"train.py <span style='color:#111;'> 20.18KB </span>","children":null,"spread":false},{"title":"presets","children":[{"title":"hs.json <span style='color:#111;'> 518B </span>","children":null,"spread":false},{"title":"chess.json <span style='color:#111;'> 518B </span>","children":null,"spread":false},{"title":"dl.json <span style='color:#111;'> 514B </span>","children":null,"spread":false},{"title":"chem.json <span style='color:#111;'> 514B </span>","children":null,"spread":false},{"title":"eh.json <span style='color:#111;'> 544B </span>","children":null,"spread":false}],"spread":true},{"title":"__init__.py <span style='color:#111;'> 1B </span>","children":null,"spread":false},{"title":"infolog.py <span style='color:#111;'> 1.23KB </span>","children":null,"spread":false},{"title":"hparams.py <span style='color:#111;'> 18.53KB </span>","children":null,"spread":false},{"title":"utils","children":[{"title":"cleaners.py <span style='color:#111;'> 2.36KB </span>","children":null,"spread":false},{"title":"text.py <span style='color:#111;'> 2.11KB </span>","children":null,"spread":false},{"title":"symbols.py <span style='color:#111;'> 633B </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 444B </span>","children":null,"spread":false},{"title":"_cmudict.py <span style='color:#111;'> 1.88KB </span>","children":null,"spread":false},{"title":"numbers.py <span style='color:#111;'> 2.07KB </span>","children":null,"spread":false},{"title":"plot.py <span style='color:#111;'> 2.15KB </span>","children":null,"spread":false}],"spread":false},{"title":"tacotron2.py <span style='color:#111;'> 12.36KB </span>","children":null,"spread":false}],"spread":false},{"title":"download_speaker.sh <span style='color:#111;'> 682B </span>","children":null,"spread":false},{"title":"preprocess.py <span style='color:#111;'> 4.10KB </span>","children":null,"spread":false},{"title":"requirements.txt <span style='color:#111;'> 433B </span>","children":null,"spread":false},{"title":".gitignore <span style='color:#111;'> 80B </span>","children":null,"spread":false},{"title":"complete_test_generate.py <span style='color:#111;'> 4.65KB </span>","children":null,"spread":false},{"title":"score.py <span style='color:#111;'> 1.35KB </span>","children":null,"spread":false},{"title":"README.md <span style='color:#111;'> 6.17KB </span>","children":null,"spread":false},{"title":"utils","children":[{"title":"logmmse.py <span style='color:#111;'> 8.97KB </span>","children":null,"spread":false},{"title":"profiler.py <span style='color:#111;'> 1.44KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 0B </span>","children":null,"spread":false},{"title":"argutils.py <span style='color:#111;'> 1.14KB </span>","children":null,"spread":false}],"spread":true},{"title":"Dataset","children":[{"title":"chem","children":[{"title":"val.txt <span style='color:#111;'> 204B </span>","children":null,"spread":false},{"title":"test.txt <span style='color:#111;'> 192B </span>","children":null,"spread":false},{"title":"train.txt <span style='color:#111;'> 3.67KB </span>","children":null,"spread":false}],"spread":false},{"title":"hs","children":[{"title":"val.txt <span style='color:#111;'> 24B </span>","children":null,"spread":false},{"title":"test.txt <span style='color:#111;'> 24B </span>","children":null,"spread":false},{"title":"train.txt <span style='color:#111;'> 684B </span>","children":null,"spread":false}],"spread":false},{"title":"eh","children":[{"title":"val.txt <span style='color:#111;'> 11B </span>","children":null,"spread":false},{"title":"test.txt <span style='color:#111;'> 23B </span>","children":null,"spread":false},{"title":"train.txt <span style='color:#111;'> 383B </span>","children":null,"spread":false}],"spread":false},{"title":"chess","children":[{"title":"val.txt <span style='color:#111;'> 12B </span>","children":null,"spread":false},{"title":"test.txt <span style='color:#111;'> 37B </span>","children":null,"spread":false},{"title":"train.txt <span style='color:#111;'> 2.39KB </span>","children":null,"spread":false}],"spread":false},{"title":"dl","children":[{"title":"val.txt <span style='color:#111;'> 24B </span>","children":null,"spread":false},{"title":"test.txt <span style='color:#111;'> 24B </span>","children":null,"spread":false},{"title":"train.txt <span style='color:#111;'> 1.32KB </span>","children":null,"spread":false}],"spread":false}],"spread":true}],"spread":false}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明