pytorch实现的在线强化学习11种常见算法代码

上传者: 45616285 | 上传时间: 2024-03-05 21:19:26 | 文件大小: 6.68MB | 文件类型: ZIP
这个资源是一个包含了使用 PyTorch 实现的11种常见在线强化学习算法的代码集合。每个算法都有独立的文件夹,可以单独运行并测试相应算法在不同环境中的性能。以下是资源中包含的算法: 1、Q-learning 2、SARSA 3、DQN (Deep Q-Network) 4、Double-DQN 5、Dueling-DQN 6、PG (Policy Gradient) 7、AC (Actor-Critic) 8、PPO (Proximal Policy Optimization) 9、DDPG (Deep Deterministic Policy Gradient) 10、TD3 (Twin Delayed DDPG) 11、SAC (Soft Actor-Critic)

文件下载

资源详情

[{"title":"( 110 个子文件 6.68MB ) pytorch实现的在线强化学习11种常见算法代码","children":[{"title":"ac_actor_best <span style='color:#111;'> 2.50KB </span>","children":null,"spread":false},{"title":"ac_actor_optimizer_best <span style='color:#111;'> 5.24KB </span>","children":null,"spread":false},{"title":"ac_critic_best <span style='color:#111;'> 2.45KB </span>","children":null,"spread":false},{"title":"ac_critic_optimizer_best <span style='color:#111;'> 5.13KB </span>","children":null,"spread":false},{"title":"ddpg_actor_30 <span style='color:#111;'> 69.57KB </span>","children":null,"spread":false},{"title":"ddpg_actor_60 <span style='color:#111;'> 69.57KB </span>","children":null,"spread":false},{"title":"ddpg_actor_optimizer_30 <span style='color:#111;'> 139.88KB </span>","children":null,"spread":false},{"title":"ddpg_actor_optimizer_60 <span style='color:#111;'> 139.88KB </span>","children":null,"spread":false},{"title":"ddpg_actor_target_30 <span style='color:#111;'> 69.63KB </span>","children":null,"spread":false},{"title":"ddpg_actor_target_60 <span style='color:#111;'> 69.63KB </span>","children":null,"spread":false},{"title":"ddpg_critic_30 <span style='color:#111;'> 70.08KB </span>","children":null,"spread":false},{"title":"ddpg_critic_60 <span style='color:#111;'> 70.08KB </span>","children":null,"spread":false},{"title":"ddpg_critic_optimizer_30 <span style='color:#111;'> 140.90KB </span>","children":null,"spread":false},{"title":"ddpg_critic_optimizer_60 <span style='color:#111;'> 140.90KB </span>","children":null,"spread":false},{"title":"ddpg_critic_target_30 <span style='color:#111;'> 70.14KB </span>","children":null,"spread":false},{"title":"ddpg_critic_target_60 <span style='color:#111;'> 70.14KB </span>","children":null,"spread":false},{"title":"ddqn_best <span style='color:#111;'> 7.53KB </span>","children":null,"spread":false},{"title":"ddqn_optimizer_best <span style='color:#111;'> 15.73KB </span>","children":null,"spread":false},{"title":"dqn_best <span style='color:#111;'> 7.52KB </span>","children":null,"spread":false},{"title":"dqn_optimizer_best <span style='color:#111;'> 15.71KB </span>","children":null,"spread":false},{"title":"dueling_dqn_best <span style='color:#111;'> 8.29KB </span>","children":null,"spread":false},{"title":"dueling_dqn_optimizer_best <span style='color:#111;'> 17.72KB </span>","children":null,"spread":false},{"title":".gitignore <span style='color:#111;'> 184B </span>","children":null,"spread":false},{"title":"online-RL.iml <span style='color:#111;'> 316B </span>","children":null,"spread":false},{"title":"README.md <span style='color:#111;'> 2.14KB </span>","children":null,"spread":false},{"title":"pg_best <span style='color:#111;'> 3.70KB </span>","children":null,"spread":false},{"title":"pg_optimizer_best <span style='color:#111;'> 8.68KB </span>","children":null,"spread":false},{"title":"ppo_actor_100 <span style='color:#111;'> 20.88KB </span>","children":null,"spread":false},{"title":"ppo_actor_200 <span style='color:#111;'> 20.88KB </span>","children":null,"spread":false},{"title":"ppo_actor_optimizer_100 <span style='color:#111;'> 42.89KB </span>","children":null,"spread":false},{"title":"ppo_actor_optimizer_200 <span style='color:#111;'> 42.89KB </span>","children":null,"spread":false},{"title":"ppo_critic_100 <span style='color:#111;'> 20.08KB </span>","children":null,"spread":false},{"title":"ppo_critic_200 <span style='color:#111;'> 20.08KB </span>","children":null,"spread":false},{"title":"ppo_critic_optimizer_100 <span style='color:#111;'> 40.90KB </span>","children":null,"spread":false},{"title":"ppo_critic_optimizer_200 <span style='color:#111;'> 40.90KB </span>","children":null,"spread":false},{"title":"SAC.py <span style='color:#111;'> 6.53KB </span>","children":null,"spread":false},{"title":"TD3.py <span style='color:#111;'> 5.78KB </span>","children":null,"spread":false},{"title":"PPO.py <span style='color:#111;'> 5.37KB </span>","children":null,"spread":false},{"title":"DDPG.py <span style='color:#111;'> 4.93KB </span>","children":null,"spread":false},{"title":"AC.py <span style='color:#111;'> 3.62KB </span>","children":null,"spread":false},{"title":"Dueling_DQN.py <span style='color:#111;'> 2.99KB </span>","children":null,"spread":false},{"title":"DDQN.py <span style='color:#111;'> 2.96KB </span>","children":null,"spread":false},{"title":"utils.py <span style='color:#111;'> 2.89KB </span>","children":null,"spread":false},{"title":"utils.py <span style='color:#111;'> 2.89KB </span>","children":null,"spread":false},{"title":"utils.py <span style='color:#111;'> 2.89KB </span>","children":null,"spread":false},{"title":"utils.py <span style='color:#111;'> 2.89KB </span>","children":null,"spread":false},{"title":"utils.py <span style='color:#111;'> 2.89KB </span>","children":null,"spread":false},{"title":"utils.py <span style='color:#111;'> 2.89KB </span>","children":null,"spread":false},{"title":"DQN.py <span style='color:#111;'> 2.73KB </span>","children":null,"spread":false},{"title":"SAC--Pendulum.py <span style='color:#111;'> 2.70KB </span>","children":null,"spread":false},{"title":"q_learning.py <span style='color:#111;'> 2.54KB </span>","children":null,"spread":false},{"title":"PG.py <span style='color:#111;'> 2.53KB </span>","children":null,"spread":false},{"title":"SARSA.py <span style='color:#111;'> 2.51KB </span>","children":null,"spread":false},{"title":"DDPG--Pendulum.py <span style='color:#111;'> 2.34KB </span>","children":null,"spread":false},{"title":"TD3--Pendulum.py <span style='color:#111;'> 2.33KB </span>","children":null,"spread":false},{"title":"Dueling_DQN--CartPole.py <span style='color:#111;'> 2.08KB </span>","children":null,"spread":false},{"title":"PPO--Pendulum.py <span style='color:#111;'> 2.06KB </span>","children":null,"spread":false},{"title":"DDQN--CartPole.py <span style='color:#111;'> 2.02KB </span>","children":null,"spread":false},{"title":"DQN--CartPole.py <span style='color:#111;'> 2.01KB </span>","children":null,"spread":false},{"title":"PG--CartPole.py <span style='color:#111;'> 1.83KB </span>","children":null,"spread":false},{"title":"AC--CartPole.py <span style='color:#111;'> 1.59KB </span>","children":null,"spread":false},{"title":"Pendulum.py <span style='color:#111;'> 669B </span>","children":null,"spread":false},{"title":"Pendulum.py <span style='color:#111;'> 668B </span>","children":null,"spread":false},{"title":"Pendulum.py <span style='color:#111;'> 664B </span>","children":null,"spread":false},{"title":"Pendulum.py <span style='color:#111;'> 664B </span>","children":null,"spread":false},{"title":"CartPole.py <span style='color:#111;'> 548B </span>","children":null,"spread":false},{"title":"CartPole.py <span style='color:#111;'> 518B </span>","children":null,"spread":false},{"title":"CartPole.py <span style='color:#111;'> 516B </span>","children":null,"spread":false},{"title":"CartPole.py <span style='color:#111;'> 511B </span>","children":null,"spread":false},{"title":"CartPole.py <span style='color:#111;'> 507B </span>","children":null,"spread":false},{"title":"SAC.cpython-310.pyc <span style='color:#111;'> 5.87KB </span>","children":null,"spread":false},{"title":"PPO.cpython-310.pyc <span style='color:#111;'> 4.81KB </span>","children":null,"spread":false},{"title":"TD3.cpython-310.pyc <span style='color:#111;'> 4.71KB </span>","children":null,"spread":false},{"title":"DDPG.cpython-310.pyc <span style='color:#111;'> 4.31KB </span>","children":null,"spread":false},{"title":"AC.cpython-310.pyc <span style='color:#111;'> 3.33KB </span>","children":null,"spread":false},{"title":"Dueling_DQN.cpython-310.pyc <span style='color:#111;'> 2.96KB </span>","children":null,"spread":false},{"title":"DDQN.cpython-310.pyc <span style='color:#111;'> 2.87KB </span>","children":null,"spread":false},{"title":"DQN.cpython-310.pyc <span style='color:#111;'> 2.79KB </span>","children":null,"spread":false},{"title":"PG.cpython-310.pyc <span style='color:#111;'> 2.54KB </span>","children":null,"spread":false},{"title":"utils.cpython-310.pyc <span style='color:#111;'> 2.28KB </span>","children":null,"spread":false},{"title":"utils.cpython-310.pyc <span style='color:#111;'> 2.28KB </span>","children":null,"spread":false},{"title":"utils.cpython-310.pyc <span style='color:#111;'> 2.27KB </span>","children":null,"spread":false},{"title":"utils.cpython-310.pyc <span style='color:#111;'> 2.27KB </span>","children":null,"spread":false},{"title":"utils.cpython-310.pyc <span style='color:#111;'> 2.27KB </span>","children":null,"spread":false},{"title":"utils.cpython-310.pyc <span style='color:#111;'> 2.27KB </span>","children":null,"spread":false},{"title":"sac_actor_30 <span style='color:#111;'> 266.12KB </span>","children":null,"spread":false},{"title":"sac_actor_60 <span style='color:#111;'> 266.12KB </span>","children":null,"spread":false},{"title":"sac_actor_optimizer_30 <span style='color:#111;'> 533.36KB </span>","children":null,"spread":false},{"title":"sac_actor_optimizer_60 <span style='color:#111;'> 533.36KB </span>","children":null,"spread":false},{"title":"sac_critic_30 <span style='color:#111;'> 530.21KB </span>","children":null,"spread":false},{"title":"sac_critic_60 <span style='color:#111;'> 530.21KB </span>","children":null,"spread":false},{"title":"sac_critic_optimizer_30 <span style='color:#111;'> 1.04MB </span>","children":null,"spread":false},{"title":"sac_critic_optimizer_60 <span style='color:#111;'> 1.04MB </span>","children":null,"spread":false},{"title":"td3_actor_30 <span style='color:#111;'> 69.56KB </span>","children":null,"spread":false},{"title":"td3_actor_60 <span style='color:#111;'> 69.56KB </span>","children":null,"spread":false},{"title":"td3_actor_optimizer_30 <span style='color:#111;'> 139.85KB </span>","children":null,"spread":false},{"title":"td3_actor_optimizer_60 <span style='color:#111;'> 139.85KB </span>","children":null,"spread":false},{"title":"td3_actor_target_30 <span style='color:#111;'> 69.63KB </span>","children":null,"spread":false},{"title":"td3_actor_target_60 <span style='color:#111;'> 69.63KB </span>","children":null,"spread":false},{"title":"td3_critic_30 <span style='color:#111;'> 139.21KB </span>","children":null,"spread":false},{"title":"......","children":null,"spread":false},{"title":"<span style='color:steelblue;'>文件过多,未全部展示</span>","children":null,"spread":false}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明