evaluating-rewards:比较和评估奖励函数的库

Python

评估奖励 evaluating_rewards是一个用于比较和评估奖励函数的库。随附的论文描述了在这个存储库中实现的方法。入门安装要安装evaluating_rewards ，请克隆存储库并运行： pip install evaluating_rewards/ 要在开发人员模式下安装以便立即可以进行编辑： pip install -e evaluating_rewards/ 该软件包与 Python 3.6 及更高版本兼容。不支持 Python 2。计算 EPIC 距离 evaluating_rewards.analysis.dissimilarity_heatmaps.plot_epic_heatmap脚本提供了一个方便的前端来生成奖励模型之间 EPIC 距离的热图。例如，要从论文中复制图 2(a)，只需运行： python -m evaluating_rewar

文件下载

资源详情

[{"title":"（ 114 个子文件 260KB ） evaluating-rewards:比较和评估奖励函数的库","children":[{"title":"setup.cfg 540B ","children":null,"spread":false},{"title":"Dockerfile 2.87KB ","children":null,"spread":false},{"title":".dockerignore 10B ","children":null,"spread":false},{"title":".gitignore 1.81KB ","children":null,"spread":false},{"title":"epic_demo.ipynb 9.17KB ","children":null,"spread":false},{"title":"LICENSE 11.09KB ","children":null,"spread":false},{"title":"LICENSE 298B ","children":null,"spread":false},{"title":"README.md 5.53KB ","children":null,"spread":false},{"title":"CONTRIBUTING.md 1.08KB ","children":null,"spread":false},{"title":"README.md 105B ","children":null,"spread":false},{"title":"noto_running.pdf 18.26KB ","children":null,"spread":false},{"title":"mozilla_cheetah.pdf 17.22KB ","children":null,"spread":false},{"title":"noto_snail.pdf 7.29KB ","children":null,"spread":false},{"title":"vexel_backflip.pdf 5.05KB ","children":null,"spread":false},{"title":"aggregated.pkl 9.95KB ","children":null,"spread":false},{"title":"combined_distances.py 38.42KB ","children":null,"spread":false},{"title":"base.py 33.94KB ","children":null,"spread":false},{"title":"tabular.py 20.92KB ","children":null,"spread":false},{"title":"synthetic.py 17.40KB ","children":null,"spread":false},{"title":"epic.py 17.10KB ","children":null,"spread":false},{"title":"mujoco.py 16.26KB ","children":null,"spread":false},{"title":"test_rewards.py 16.00KB ","children":null,"spread":false},{"title":"preferences.py 15.95KB ","children":null,"spread":false},{"title":"common.py 15.69KB ","children":null,"spread":false},{"title":"datasets.py 14.10KB ","children":null,"spread":false},{"title":"test_synthetic.py 13.29KB ","children":null,"spread":false},{"title":"gridworld_reward_heatmap.py 13.23KB ","children":null,"spread":false},{"title":"plot_heatmap.py 13.19KB ","children":null,"spread":false},{"title":"point_mass.py 13.09KB ","children":null,"spread":false},{"title":"epic_sample.py 12.87KB ","children":null,"spread":false},{"title":"npec.py 12.02KB ","children":null,"spread":false},{"title":"plot_gridworld_heatmap.py 9.35KB ","children":null,"spread":false},{"title":"erc.py 9.33KB ","children":null,"spread":false},{"title":"rl_common.py 8.71KB ","children":null,"spread":false},{"title":"common_config.py 7.90KB ","children":null,"spread":false},{"title":"npec.py 7.80KB ","children":null,"spread":false},{"title":"results.py 7.76KB ","children":null,"spread":false},{"title":"train_experts.py 7.75KB ","children":null,"spread":false},{"title":"test_tabular.py 7.62KB ","children":null,"spread":false},{"title":"point_mass.py 7.15KB ","children":null,"spread":false},{"title":"rollout_return.py 6.67KB ","children":null,"spread":false},{"title":"heatmaps.py 6.41KB ","children":null,"spread":false},{"title":"serialize.py 6.12KB ","children":null,"spread":false},{"title":"test_scripts.py 6.10KB ","children":null,"spread":false},{"title":"test_epic_sample.py 6.10KB ","children":null,"spread":false},{"title":"plot_pm_reward.py 5.95KB ","children":null,"spread":false},{"title":"comparisons.py 5.81KB ","children":null,"spread":false},{"title":"monte_carlo.py 5.23KB ","children":null,"spread":false},{"title":"stylesheets.py 5.20KB ","children":null,"spread":false},{"title":"mixture.py 4.92KB ","children":null,"spread":false},{"title":"plot_gridworld_reward.py 4.90KB ","children":null,"spread":false},{"title":"train_preferences.py 4.84KB ","children":null,"spread":false},{"title":"train_regress.py 4.38KB ","children":null,"spread":false},{"title":"script_utils.py 4.04KB ","children":null,"spread":false},{"title":"transformations.py 4.00KB ","children":null,"spread":false},{"title":"util.py 3.95KB ","children":null,"spread":false},{"title":"regress_utils.py 3.70KB ","children":null,"spread":false},{"title":"common.py 3.61KB ","children":null,"spread":false},{"title":"reward_masks.py 3.48KB ","children":null,"spread":false},{"title":"test_comparisons.py 3.46KB ","children":null,"spread":false},{"title":"test_policies.py 3.34KB ","children":null,"spread":false},{"title":"env_rewards.py 3.33KB ","children":null,"spread":false},{"title":"__init__.py 2.35KB ","children":null,"spread":false},{"title":"setup.py 2.31KB ","children":null,"spread":false},{"title":"gridworld_rewards.py 2.10KB ","children":null,"spread":false},{"title":"test_envs.py 2.04KB ","children":null,"spread":false},{"title":"aggregated.py 1.96KB ","children":null,"spread":false},{"title":"test_util.py 1.60KB ","children":null,"spread":false},{"title":"train_adversarial.py 1.48KB ","children":null,"spread":false},{"title":"visualize.py 1.34KB ","children":null,"spread":false},{"title":"__init__.py 1.26KB ","children":null,"spread":false},{"title":"env_rewards.py 1.24KB ","children":null,"spread":false},{"title":"conftest.py 1.01KB ","children":null,"spread":false},{"title":"eval_policy.py 959B ","children":null,"spread":false},{"title":"__init__.py 804B ","children":null,"spread":false},{"title":"__init__.py 657B ","children":null,"spread":false},{"title":"__init__.py 644B ","children":null,"spread":false},{"title":"__init__.py 638B ","children":null,"spread":false},{"title":"__init__.py 634B ","children":null,"spread":false},{"title":"__init__.py 631B ","children":null,"spread":false},{"title":"__init__.py 629B ","children":null,"spread":false},{"title":"__init__.py 629B ","children":null,"spread":false},{"title":"__init__.py 613B ","children":null,"spread":false},{"title":"__init__.py 606B ","children":null,"spread":false},{"title":"version.py 103B ","children":null,"spread":false},{"title":"__init__.py 0B ","children":null,"spread":false},{"title":"__init__.py 0B ","children":null,"spread":false},{"title":"pylintrc 1.14KB ","children":null,"spread":false},{"title":"transfer_point_maze.sh 3.83KB ","children":null,"spread":false},{"title":"transfer.sh 2.04KB ","children":null,"spread":false},{"title":"common.sh 1.67KB ","children":null,"spread":false},{"title":"hyper_sweep.sh 1.50KB ","children":null,"spread":false},{"title":"visualize_pm_reward.sh 1.49KB ","children":null,"spread":false},{"title":"train_preferences.sh 1.46KB ","children":null,"spread":false},{"title":"train_regress.sh 1.43KB ","children":null,"spread":false},{"title":"doubleblind.sh 1.28KB ","children":null,"spread":false},{"title":"transfer_point_maze_checkpoints.sh 1.23KB ","children":null,"spread":false},{"title":"launch_docker.sh 1.23KB ","children":null,"spread":false},{"title":"train_irl.sh 1.21KB ","children":null,"spread":false},{"title":"greedy_pm_hardcoded.sh 1.20KB ","children":null,"spread":false},{"title":"......","children":null,"spread":false},{"title":"文件过多，未全部展示","children":null,"spread":false}],"spread":true}]

评论信息

其他资源

免责申明

【只为小站】的资源来自网友分享，仅供学习研究，请务必在下载后24小时内给予删除，不得用于其他任何用途，否则后果自负。基于互联网的特殊性，【只为小站】无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查；无论【只为小站】经营者是否已进行审查，用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场，基于网友分享，根据中国法律《信息网络传播权保护条例》第二十二条之规定，若资源存在侵权或相关问题请联系本站客服人员，zhiweidada#qq.com，请把#换成@，本站将给予最大的支持与配合，做到及时反馈和处理。关于更多版权及免责申明参见版权及免责申明

evaluating-rewards:比较和评估奖励函数的库

文件下载

资源详情

评论信息

其他资源

免责申明

个人信息

相关资源标签

热门下载

最新下载