带有火炬的深度增强学习:DQN,AC,ACER,A2C,A3C,PG,DDPG,TRPO,PPO,SAC,TD3和PyTorch实施...-源码

上传者: 42131443 | 上传时间: 2021-06-09 21:34:59 | 文件大小: 5.82MB | 文件类型: ZIP
状态:活动(在活动开发中,可能会发生重大更改) 该存储库将实现经典且最新的深度强化学习算法。 该存储库的目的是为人们提供清晰的pytorch代码,以供他们学习深度强化学习算法。 将来,将添加更多最先进的算法,并且还将保留现有代码。 要求 python <= 3.6 张量板 体育馆> = 0.10 火炬> = 0.4 请注意,tensorflow不支持python3.7 安装 pip install -r requirements.txt 如果失败: 安装健身房 pip install gym 安装pytorch please go to official webisite to install it: https://pytorch.org/ Recommend use Anaconda Virtual Environment to manage your packages 安装tensorboardX pip install tensorboardX pip install tensorflow==1.12 测试 cd Char10\ TD3/ python TD3

文件下载

资源详情

[{"title":"( 60 个子文件 5.82MB ) 带有火炬的深度增强学习:DQN,AC,ACER,A2C,A3C,PG,DDPG,TRPO,PPO,SAC,TD3和PyTorch实施...-源码","children":[{"title":"Deep-reinforcement-learning-with-pytorch-master","children":[{"title":"figures","children":[{"title":"test.png <span style='color:#111;'> 11.38KB </span>","children":null,"spread":false}],"spread":true},{"title":"Char02 Policy Gradient","children":[{"title":"Run_Model.py <span style='color:#111;'> 2.32KB </span>","children":null,"spread":false},{"title":"pytorch_MountainCar-v0.py <span style='color:#111;'> 2.91KB </span>","children":null,"spread":false},{"title":"REINFORCE.py <span style='color:#111;'> 3.05KB </span>","children":null,"spread":false},{"title":"naive-policy-gradient.py <span style='color:#111;'> 3.78KB </span>","children":null,"spread":false},{"title":"PolicyGradient.py <span style='color:#111;'> 3.09KB </span>","children":null,"spread":false},{"title":"REINFORCE_with_Baseline.py <span style='color:#111;'> 3.96KB </span>","children":null,"spread":false}],"spread":true},{"title":"Char03 Actor-Critic","children":[{"title":"AC_MountainCar-v0.py <span style='color:#111;'> 3.60KB </span>","children":null,"spread":false},{"title":"AC_CartPole-v0.py <span style='color:#111;'> 3.26KB </span>","children":null,"spread":false}],"spread":true},{"title":"Char04 A2C","children":[{"title":"multiprocessing_env.py <span style='color:#111;'> 4.78KB </span>","children":null,"spread":false},{"title":"A2C.py <span style='color:#111;'> 3.85KB </span>","children":null,"spread":false}],"spread":true},{"title":"Char10 TD3","children":[{"title":"TD3_Pendulum-v0.png <span style='color:#111;'> 47.53KB </span>","children":null,"spread":false},{"title":"TD3_BipedalWalker-v2.py <span style='color:#111;'> 12.90KB </span>","children":null,"spread":false},{"title":"expTD3_BipedalWalker-v2.pyBipedalWalker-v2.","children":[{"title":"critic_1_target.pth <span style='color:#111;'> 517.48KB </span>","children":null,"spread":false},{"title":"critic_2.pth <span style='color:#111;'> 517.48KB </span>","children":null,"spread":false},{"title":"actor.pth <span style='color:#111;'> 514.76KB </span>","children":null,"spread":false},{"title":"critic_1.pth <span style='color:#111;'> 517.48KB </span>","children":null,"spread":false},{"title":"critic_2_target.pth <span style='color:#111;'> 517.48KB </span>","children":null,"spread":false},{"title":"actor_target.pth <span style='color:#111;'> 514.76KB </span>","children":null,"spread":false}],"spread":true},{"title":"expTD3.pyPendulum-v0.","children":[{"title":"critic_1_target.pth <span style='color:#111;'> 479.98KB </span>","children":null,"spread":false},{"title":"critic_2.pth <span style='color:#111;'> 479.98KB </span>","children":null,"spread":false},{"title":"actor.pth <span style='color:#111;'> 478.42KB </span>","children":null,"spread":false},{"title":"critic_1.pth <span style='color:#111;'> 479.98KB </span>","children":null,"spread":false},{"title":"critic_2_target.pth <span style='color:#111;'> 479.98KB </span>","children":null,"spread":false},{"title":"actor_target.pth <span style='color:#111;'> 478.42KB </span>","children":null,"spread":false}],"spread":true},{"title":"Episode_reward_TD3_BipedakWalker.png <span style='color:#111;'> 89.23KB </span>","children":null,"spread":false},{"title":"TD3.py <span style='color:#111;'> 12.71KB </span>","children":null,"spread":false}],"spread":true},{"title":"Char01 DQN","children":[{"title":"DQN","children":[{"title":"pic","children":[{"title":"finish_episode.jpg <span style='color:#111;'> 34.84KB </span>","children":null,"spread":false},{"title":"value_loss.jpg <span style='color:#111;'> 35.01KB </span>","children":null,"spread":false},{"title":"readme.md <span style='color:#111;'> 7B </span>","children":null,"spread":false}],"spread":true}],"spread":true},{"title":"DQN_CartPole-v0.py <span style='color:#111;'> 4.09KB </span>","children":null,"spread":false},{"title":"DQN.py <span style='color:#111;'> 4.77KB </span>","children":null,"spread":false},{"title":"DQN_MountainCar-v0.py <span style='color:#111;'> 4.09KB </span>","children":null,"spread":false},{"title":"DQN_mountain_car_v1.py <span style='color:#111;'> 4.21KB </span>","children":null,"spread":false},{"title":"readme.md <span style='color:#111;'> 997B </span>","children":null,"spread":false},{"title":"naiveDQN.py <span style='color:#111;'> 4.20KB </span>","children":null,"spread":false}],"spread":true},{"title":"requirements.txt <span style='color:#111;'> 70B </span>","children":null,"spread":false},{"title":"Char07 PPO","children":[{"title":"PPO_CartPole_v0.py <span style='color:#111;'> 6.11KB </span>","children":null,"spread":false},{"title":"PPO2.py <span style='color:#111;'> 6.64KB </span>","children":null,"spread":false},{"title":"PPO_pendulum.py <span style='color:#111;'> 6.26KB </span>","children":null,"spread":false},{"title":"PPO_MountainCar-v0.py <span style='color:#111;'> 6.14KB </span>","children":null,"spread":false},{"title":"readme.md <span style='color:#111;'> 185B </span>","children":null,"spread":false}],"spread":true},{"title":"LICENSE <span style='color:#111;'> 1.04KB </span>","children":null,"spread":false},{"title":"Char00 Conventional Algorithms","children":[{"title":"gridworld.py <span style='color:#111;'> 6.64KB </span>","children":null,"spread":false},{"title":"Sarsa.py <span style='color:#111;'> 2.89KB </span>","children":null,"spread":false},{"title":"Q-learning.py <span style='color:#111;'> 2.70KB </span>","children":null,"spread":false}],"spread":true},{"title":"Char08 ACER","children":[{"title":"readme.md <span style='color:#111;'> 44B </span>","children":null,"spread":false}],"spread":true},{"title":"Char09 SAC","children":[{"title":"SAC_ep_r_curve.png <span style='color:#111;'> 62.16KB </span>","children":null,"spread":false},{"title":"SAC_dual_Q_net.py <span style='color:#111;'> 11.69KB </span>","children":null,"spread":false},{"title":"SAC.py <span style='color:#111;'> 11.00KB </span>","children":null,"spread":false},{"title":"test_agent.py <span style='color:#111;'> 12.44KB </span>","children":null,"spread":false},{"title":"SAC_BipedalWalker-v2.py <span style='color:#111;'> 12.48KB </span>","children":null,"spread":false}],"spread":true},{"title":"Char05 DDPG","children":[{"title":"DDPG_exp.jpg <span style='color:#111;'> 61.21KB </span>","children":null,"spread":false},{"title":"README.md <span style='color:#111;'> 420B </span>","children":null,"spread":false},{"title":"DDPG.py <span style='color:#111;'> 10.02KB </span>","children":null,"spread":false}],"spread":true},{"title":"readme.md <span style='color:#111;'> 8.26KB </span>","children":null,"spread":false},{"title":"More","children":[{"title":"Application in real world","children":[{"title":"README.md <span style='color:#111;'> 274B </span>","children":null,"spread":false}],"spread":false},{"title":"MARL","children":[{"title":"README.md <span style='color:#111;'> 581B </span>","children":null,"spread":false}],"spread":false},{"title":"readme.md <span style='color:#111;'> 53B </span>","children":null,"spread":false},{"title":"plot.py <span style='color:#111;'> 1.40KB </span>","children":null,"spread":false}],"spread":false}],"spread":false}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明