DeepReinforcementLearning：深度RL实施。在pytorch中实现的DQN，SAC，DDPG，TD3，PPO和VPG。经过测试的环境：LunarLander-v2和Pendulum-v0-源码

algorithms ddpg sac ppo

使用Pytorch实现的深度RL算法算法列表：关于深入探讨实验结果：算法离散环境：LunarLander-v2 连续环境：Pendulum-v0 DQN -- VPG -- DDPG -- TD3 -- SAC -- PPO -- 用法：只需直接运行文件/算法。在我学习算法时，它们之间没有通用的结构。不同的算法来自不同的来源。资源：未来的项目：如果有时间，我将为使用RL的电梯添加一个简单的程序。更好的图形

文件下载

资源详情

[{"title":"（ 42 个子文件 391KB ） DeepReinforcementLearning：深度RL实施。在pytorch中实现的DQN，SAC，DDPG，TD3，PPO和VPG。经过测试的环境：LunarLander-v2和Pendulum-v0-源码","children":[{"title":"DeepReinforcementLearning-main","children":[{"title":"td3.py 4.28KB ","children":null,"spread":false},{"title":".ipynb_checkpoints","children":[{"title":"test_and_intial_Experimentation-checkpoint.ipynb 72B ","children":null,"spread":false},{"title":"Policy Gradient Methods-checkpoint.ipynb 13.08KB ","children":null,"spread":false}],"spread":true},{"title":"RLUtils","children":[{"title":"__init__.py 21B ","children":null,"spread":false},{"title":"utils.py 3.37KB ","children":null,"spread":false},{"title":"__pycache__","children":[{"title":"utils.cpython-37.pyc 3.93KB ","children":null,"spread":false},{"title":"__init__.cpython-37.pyc 179B ","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"SoftActorCritic.py 3.47KB ","children":null,"spread":false},{"title":"Policy Gradient Methods.ipynb 13.07KB ","children":null,"spread":false},{"title":"Readme.md 2.43KB ","children":null,"spread":false},{"title":".idea","children":[{"title":".gitignore 47B ","children":null,"spread":false},{"title":"misc.xml 292B ","children":null,"spread":false},{"title":"vcs.xml 180B ","children":null,"spread":false},{"title":"inspectionProfiles","children":[{"title":"Project_Default.xml 659B ","children":null,"spread":false},{"title":"profiles_settings.xml 174B ","children":null,"spread":false}],"spread":true},{"title":"modules.xml 294B ","children":null,"spread":false},{"title":"ReinforcementLearning.iml 317B ","children":null,"spread":false}],"spread":true},{"title":"ppo_clip.py 4.45KB ","children":null,"spread":false},{"title":"ddpg.py 9.33KB ","children":null,"spread":false},{"title":"agents","children":[{"title":"__init__.py 57B ","children":null,"spread":false},{"title":"agent.py 125B ","children":null,"spread":false},{"title":"__pycache__","children":[{"title":"__init__.cpython-37.pyc 225B ","children":null,"spread":false},{"title":"agent.cpython-37.pyc 560B ","children":null,"spread":false}],"spread":true},{"title":"ActorCriticAgents","children":[{"title":"__init__.py 63B ","children":null,"spread":false},{"title":"PPO_clip_agent.py 11.10KB ","children":null,"spread":false},{"title":"td3_agent.py 6.62KB ","children":null,"spread":false},{"title":"soft_Actor_critic_Agent.py 7.01KB ","children":null,"spread":false},{"title":"__pycache__","children":[{"title":"soft_Actor_critic_Agent.cpython-37.pyc 5.97KB ","children":null,"spread":false},{"title":"td3_agent.cpython-37.pyc 5.67KB ","children":null,"spread":false},{"title":"PPO_clip_agent.cpython-37.pyc 8.49KB ","children":null,"spread":false},{"title":"__init__.cpython-37.pyc 235B ","children":null,"spread":false}],"spread":false}],"spread":false},{"title":"MLPAgent.py 0B ","children":null,"spread":false}],"spread":true},{"title":"figures","children":[{"title":"PPO_MountainCarContinuous-v0_rewards.png 22.09KB ","children":null,"spread":false},{"title":"DQN_Lunar_lander_losses.png 38.10KB ","children":null,"spread":false},{"title":"VPG_LunarLander-v2_rewards.png 37.66KB ","children":null,"spread":false},{"title":"SAC_Pendulum-v0_rewards.png 50.90KB ","children":null,"spread":false},{"title":"DQN_Lunar_lander_rewards.png 48.31KB ","children":null,"spread":false},{"title":"TD3_Pendulum_rewards.png 61.74KB ","children":null,"spread":false},{"title":"DDPG_Pendulum-v0_rewards.png 42.82KB ","children":null,"spread":false},{"title":"PPO_Pendulum-v0_rewards.png 57.13KB ","children":null,"spread":false}],"spread":true},{"title":"vanilla_policy_gradient.py 8.05KB ","children":null,"spread":false},{"title":"DQN.py 19.12KB ","children":null,"spread":false}],"spread":false}],"spread":true}]

评论信息

o_o521 :

用户下载后在一定时间内未进行评价，系统默认好评。
2021-08-11

其他资源

免责申明

【只为小站】的资源来自网友分享，仅供学习研究，请务必在下载后24小时内给予删除，不得用于其他任何用途，否则后果自负。基于互联网的特殊性，【只为小站】无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查；无论【只为小站】经营者是否已进行审查，用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场，基于网友分享，根据中国法律《信息网络传播权保护条例》第二十二条之规定，若资源存在侵权或相关问题请联系本站客服人员，zhiweidada#qq.com，请把#换成@，本站将给予最大的支持与配合，做到及时反馈和处理。关于更多版权及免责申明参见版权及免责申明

DeepReinforcementLearning：深度RL实施。 在pytorch中实现的DQN，SAC，DDPG，TD3，PPO和VPG。 经过测试的环境：LunarLander-v2和Pendulum-v0-源码

文件下载

资源详情

评论信息

其他资源

免责申明

个人信息

热门下载

最新下载

DeepReinforcementLearning：深度RL实施。在pytorch中实现的DQN，SAC，DDPG，TD3，PPO和VPG。经过测试的环境：LunarLander-v2和Pendulum-v0-源码