Talking-Face_PC-AVS:通过隐式模块化视听表示生成姿势可控制的说话人脸的代码(CVPR 2021)

上传者: 42170790 | 上传时间: 2022-05-18 16:34:32 | 文件大小: 28.48MB | 文件类型: ZIP
通过隐式模块化视听表示进行姿势可控的会说话脸生成(CVPR 2021) ,孙亚生,,,和。 | | 我们提出了可姿势控制的视听系统(PC-AVS) ,该系统可在驱动带有声音的任意说话的面部时实现自由姿势控制。 代替从音频中学习姿势运动,我们利用另一个姿势源视频来仅补偿头部运动。 关键是设计一种隐含的低维姿势代码,该代码不包含嘴巴形状或身份信息。 通过这种方式,将视听表示模块化为三个关键因素的空间:语音内容,头部姿势和身份信息。 要求 使用Python 3.6和 1.3.0。 基本要求在“ requirements.txt”中列出。 pip install -r requirements.txt 快速入门:生成演示结果 下载预训练的检查点。 创建默认的文件夹./checkpoints并解压demo.zip在./checkpoints/demo 。 其中应该有5 pth 。 解压缩mi

文件下载

资源详情

[{"title":"( 86 个子文件 28.48MB ) Talking-Face_PC-AVS:通过隐式模块化视听表示生成姿势可控制的说话人脸的代码(CVPR 2021)","children":[{"title":"Talking-Face_PC-AVS-main","children":[{"title":".gitignore <span style='color:#111;'> 1.30KB </span>","children":null,"spread":false},{"title":"requirements.txt <span style='color:#111;'> 153B </span>","children":null,"spread":false},{"title":"data","children":[{"title":"__init__.py <span style='color:#111;'> 2.65KB </span>","children":null,"spread":false},{"title":"base_dataset.py <span style='color:#111;'> 4.18KB </span>","children":null,"spread":false},{"title":"voxtest_dataset.py <span style='color:#111;'> 7.61KB </span>","children":null,"spread":false}],"spread":true},{"title":"options","children":[{"title":"train_options.py <span style='color:#111;'> 4.83KB </span>","children":null,"spread":false},{"title":"test_options.py <span style='color:#111;'> 2.20KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 0B </span>","children":null,"spread":false},{"title":"base_options.py <span style='color:#111;'> 15.86KB </span>","children":null,"spread":false}],"spread":true},{"title":"experiments","children":[{"title":"demo_vox.sh <span style='color:#111;'> 681B </span>","children":null,"spread":false}],"spread":true},{"title":"models","children":[{"title":"av_model.py <span style='color:#111;'> 35.20KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 1.15KB </span>","children":null,"spread":false},{"title":"networks","children":[{"title":"__init__.py <span style='color:#111;'> 2.73KB </span>","children":null,"spread":false},{"title":"base_network.py <span style='color:#111;'> 2.24KB </span>","children":null,"spread":false},{"title":"discriminator.py <span style='color:#111;'> 7.67KB </span>","children":null,"spread":false},{"title":"architecture.py <span style='color:#111;'> 5.04KB </span>","children":null,"spread":false},{"title":"sync_batchnorm","children":[{"title":"replicate.py <span style='color:#111;'> 4.28KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 232B </span>","children":null,"spread":false},{"title":"unittest.py <span style='color:#111;'> 471B </span>","children":null,"spread":false},{"title":"__pycache__","children":[{"title":"scatter_gather.cpython-36.pyc <span style='color:#111;'> 1.80KB </span>","children":null,"spread":false},{"title":"scatter_gather.cpython-37.pyc <span style='color:#111;'> 1.80KB </span>","children":null,"spread":false},{"title":"comm.cpython-37.pyc <span style='color:#111;'> 4.66KB </span>","children":null,"spread":false},{"title":"batchnorm.cpython-37.pyc <span style='color:#111;'> 14.53KB </span>","children":null,"spread":false},{"title":"__init__.cpython-36.pyc <span style='color:#111;'> 466B </span>","children":null,"spread":false},{"title":"__init__.cpython-37.pyc <span style='color:#111;'> 458B </span>","children":null,"spread":false},{"title":"replicate.cpython-36.pyc <span style='color:#111;'> 4.46KB </span>","children":null,"spread":false},{"title":"comm.cpython-36.pyc <span style='color:#111;'> 4.67KB </span>","children":null,"spread":false},{"title":"replicate.cpython-37.pyc <span style='color:#111;'> 4.46KB </span>","children":null,"spread":false},{"title":"batchnorm.cpython-36.pyc <span style='color:#111;'> 14.55KB </span>","children":null,"spread":false}],"spread":false},{"title":"scatter_gather.py <span style='color:#111;'> 1.80KB </span>","children":null,"spread":false},{"title":"batchnorm_reimpl.py <span style='color:#111;'> 2.06KB </span>","children":null,"spread":false},{"title":"comm.py <span style='color:#111;'> 4.08KB </span>","children":null,"spread":false},{"title":"batchnorm.py <span style='color:#111;'> 15.19KB </span>","children":null,"spread":false}],"spread":false},{"title":"vision_network.py <span style='color:#111;'> 1.89KB </span>","children":null,"spread":false},{"title":"__pycache__","children":[{"title":"loss.cpython-37.pyc <span style='color:#111;'> 6.70KB </span>","children":null,"spread":false},{"title":"stylegan2.cpython-37.pyc <span style='color:#111;'> 16.02KB </span>","children":null,"spread":false},{"title":"stylegan2.cpython-36.pyc <span style='color:#111;'> 17.70KB </span>","children":null,"spread":false},{"title":"generator.cpython-36.pyc <span style='color:#111;'> 6.95KB </span>","children":null,"spread":false},{"title":"base_network.cpython-36.pyc <span style='color:#111;'> 2.32KB </span>","children":null,"spread":false},{"title":"FAN_feature_extractor.cpython-36.pyc <span style='color:#111;'> 4.50KB </span>","children":null,"spread":false},{"title":"loss.cpython-36.pyc <span style='color:#111;'> 5.82KB </span>","children":null,"spread":false},{"title":"audio_network.cpython-37.pyc <span style='color:#111;'> 5.41KB </span>","children":null,"spread":false},{"title":"discriminator.cpython-37.pyc <span style='color:#111;'> 7.16KB </span>","children":null,"spread":false},{"title":"normalization.cpython-36.pyc <span style='color:#111;'> 4.29KB </span>","children":null,"spread":false},{"title":"FAN_feature_extractor.cpython-37.pyc <span style='color:#111;'> 4.49KB </span>","children":null,"spread":false},{"title":"normalization.cpython-37.pyc <span style='color:#111;'> 4.29KB </span>","children":null,"spread":false},{"title":"generator.cpython-37.pyc <span style='color:#111;'> 16.25KB </span>","children":null,"spread":false},{"title":"architecture.cpython-37.pyc <span style='color:#111;'> 4.03KB </span>","children":null,"spread":false},{"title":"architecture.cpython-36.pyc <span style='color:#111;'> 7.08KB </span>","children":null,"spread":false},{"title":"base_network.cpython-37.pyc <span style='color:#111;'> 2.30KB </span>","children":null,"spread":false},{"title":"audio_architecture.cpython-36.pyc <span style='color:#111;'> 8.52KB </span>","children":null,"spread":false},{"title":"audio_architecture.cpython-37.pyc <span style='color:#111;'> 8.34KB </span>","children":null,"spread":false},{"title":"encoder.cpython-37.pyc <span style='color:#111;'> 4.16KB </span>","children":null,"spread":false},{"title":"__init__.cpython-36.pyc <span style='color:#111;'> 2.80KB </span>","children":null,"spread":false},{"title":"Voxceleb_model.cpython-36.pyc <span style='color:#111;'> 5.52KB </span>","children":null,"spread":false},{"title":"__init__.cpython-37.pyc <span style='color:#111;'> 2.90KB </span>","children":null,"spread":false},{"title":"discriminator.cpython-36.pyc <span style='color:#111;'> 11.35KB </span>","children":null,"spread":false},{"title":"Voxceleb_model.cpython-37.pyc <span style='color:#111;'> 5.49KB </span>","children":null,"spread":false},{"title":"encoder.cpython-36.pyc <span style='color:#111;'> 8.94KB </span>","children":null,"spread":false},{"title":"vision_network.cpython-36.pyc <span style='color:#111;'> 3.51KB </span>","children":null,"spread":false},{"title":"vision_network.cpython-37.pyc <span style='color:#111;'> 2.13KB </span>","children":null,"spread":false}],"spread":false},{"title":"util.py <span style='color:#111;'> 4.92KB </span>","children":null,"spread":false},{"title":"loss.py <span style='color:#111;'> 7.03KB </span>","children":null,"spread":false},{"title":"FAN_feature_extractor.py <span style='color:#111;'> 5.17KB </span>","children":null,"spread":false},{"title":"audio_network.py <span style='color:#111;'> 6.24KB </span>","children":null,"spread":false},{"title":"generator.py <span style='color:#111;'> 18.75KB </span>","children":null,"spread":false},{"title":"encoder.py <span style='color:#111;'> 3.16KB </span>","children":null,"spread":false}],"spread":false}],"spread":true},{"title":"LICENSE <span style='color:#111;'> 17.68KB </span>","children":null,"spread":false},{"title":"inference.py <span style='color:#111;'> 4.17KB </span>","children":null,"spread":false},{"title":"util","children":[{"title":"visualizer.py <span style='color:#111;'> 7.81KB </span>","children":null,"spread":false},{"title":"__init__.py <span style='color:#111;'> 1B </span>","children":null,"spread":false},{"title":"html.py <span style='color:#111;'> 2.22KB </span>","children":null,"spread":false},{"title":"iter_counter.py <span style='color:#111;'> 2.88KB </span>","children":null,"spread":false},{"title":"util.py <span style='color:#111;'> 8.24KB </span>","children":null,"spread":false}],"spread":true},{"title":"scripts","children":[{"title":"align_68.py <span style='color:#111;'> 3.11KB </span>","children":null,"spread":false},{"title":"prepare_testing_files.py <span style='color:#111;'> 5.54KB </span>","children":null,"spread":false}],"spread":true},{"title":"README.md <span style='color:#111;'> 5.64KB </span>","children":null,"spread":false},{"title":"config","children":[{"title":"AudioConfig.py <span style='color:#111;'> 6.17KB </span>","children":null,"spread":false}],"spread":true},{"title":"misc","children":[{"title":"method.png <span style='color:#111;'> 1.46MB </span>","children":null,"spread":false},{"title":"Audio_Source.zip <span style='color:#111;'> 1.57MB </span>","children":null,"spread":false},{"title":"Mouth_Source.zip <span style='color:#111;'> 2.29MB </span>","children":null,"spread":false},{"title":"Pose_Source.zip <span style='color:#111;'> 2.10MB </span>","children":null,"spread":false},{"title":"output.gif <span style='color:#111;'> 8.33MB </span>","children":null,"spread":false},{"title":"demo.csv <span style='color:#111;'> 125B </span>","children":null,"spread":false},{"title":"demo.gif <span style='color:#111;'> 12.49MB </span>","children":null,"spread":false},{"title":"Input.zip <span style='color:#111;'> 604.32KB </span>","children":null,"spread":false}],"spread":true}],"spread":false}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明