keras实现REINFORCE算法强化学习

上传者: kkatnv | 上传时间: 2021-08-10 23:56:31 | 文件大小: 6.48MB | 文件类型: GZ
keras实现REINFORCE算法强化学习: # Policy Gradient Minimal implementation of Stochastic Policy Gradient Algorithm in Keras ## Pong Agent ![pg](./assets/pg.gif) This PG agent seems to get more frequent wins after about 8000 episodes. Below is the score graph.

文件下载

资源详情

[{"title":"( 56 个子文件 6.48MB ) keras实现REINFORCE算法强化学习","children":[{"title":"policy-gradient","children":[{"title":"pg.py <span style='color:#111;'> 3.76KB </span>","children":null,"spread":false},{"title":"LICENSE <span style='color:#111;'> 1.04KB </span>","children":null,"spread":false},{"title":"assets","children":[{"title":"pg.gif <span style='color:#111;'> 1.81MB </span>","children":null,"spread":false},{"title":"score.png <span style='color:#111;'> 13.18KB </span>","children":null,"spread":false}],"spread":true},{"title":"README.md <span style='color:#111;'> 262B </span>","children":null,"spread":false},{"title":".git","children":[{"title":"logs","children":[{"title":"HEAD <span style='color:#111;'> 201B </span>","children":null,"spread":false},{"title":"refs","children":[{"title":"heads","children":[{"title":"master <span style='color:#111;'> 201B </span>","children":null,"spread":false}],"spread":true},{"title":"remotes","children":[{"title":"origin","children":[{"title":"HEAD <span style='color:#111;'> 201B </span>","children":null,"spread":false}],"spread":true}],"spread":true}],"spread":true}],"spread":true},{"title":"packed-refs <span style='color:#111;'> 107B </span>","children":null,"spread":false},{"title":"info","children":[{"title":"exclude <span style='color:#111;'> 240B </span>","children":null,"spread":false}],"spread":true},{"title":"index <span style='color:#111;'> 624B </span>","children":null,"spread":false},{"title":"objects","children":[{"title":"pack","children":null,"spread":false},{"title":"ff","children":[{"title":"471c2997bf5e63bf17ff6119080e38916d81d4 <span style='color:#111;'> 156B </span>","children":null,"spread":false}],"spread":true},{"title":"03","children":[{"title":"2e1793c19e1aa36e3cec2cc0d17a5ed1dc6b1b <span style='color:#111;'> 1.45KB </span>","children":null,"spread":false}],"spread":true},{"title":"67","children":[{"title":"f01e2c6f7b6e0d122debfda4d6fae8cf7f39a0 <span style='color:#111;'> 1.04MB </span>","children":null,"spread":false}],"spread":false},{"title":"5c","children":[{"title":"61d8aff0d62095db0c61fbe6b90f3580e7e99e <span style='color:#111;'> 652B </span>","children":null,"spread":false}],"spread":false},{"title":"df","children":[{"title":"b62f87cf904442824ad1cc93327e86f75eb519 <span style='color:#111;'> 11.55KB </span>","children":null,"spread":false}],"spread":false},{"title":"info","children":null,"spread":false},{"title":"fb","children":[{"title":"eaab4e4e16a2540cc1e568f96eb36c257b436b <span style='color:#111;'> 97B </span>","children":null,"spread":false}],"spread":false},{"title":"00","children":[{"title":"b702c583208c03bda5cf28f5b4ecb2e523faf2 <span style='color:#111;'> 180B </span>","children":null,"spread":false}],"spread":false},{"title":"9a","children":[{"title":"f7275d91756bcf89c29ded2809e88e63eeab0c <span style='color:#111;'> 212B </span>","children":null,"spread":false},{"title":"1ac01dc0914116eb00ed07d69d23b05f5b08fa <span style='color:#111;'> 1.65MB </span>","children":null,"spread":false}],"spread":false},{"title":"61","children":[{"title":"76f4d30c5de266b2ba0d5d9dbd404d349e752a <span style='color:#111;'> 202B </span>","children":null,"spread":false}],"spread":false},{"title":"f3","children":[{"title":"e5a5acc487bb7735d03376b9eb2af6b0e54a4a <span style='color:#111;'> 1.04MB </span>","children":null,"spread":false}],"spread":false},{"title":"5e","children":[{"title":"baf423cc1def6e8807976ce08e470880d0eb28 <span style='color:#111;'> 179B </span>","children":null,"spread":false}],"spread":false},{"title":"81","children":[{"title":"450470a57fe96cbc143e31458ecff6c269a35d <span style='color:#111;'> 180B </span>","children":null,"spread":false}],"spread":false},{"title":"5d","children":[{"title":"41fd99009bbefd7119b6aef63ad26b0c12c159 <span style='color:#111;'> 127B </span>","children":null,"spread":false}],"spread":false},{"title":"b6","children":[{"title":"977145d79b62a9118f2760969f47b1ba39d22a <span style='color:#111;'> 1.40KB </span>","children":null,"spread":false}],"spread":false},{"title":"72","children":[{"title":"364f99fe4bf8d5262df3b19b33102aeaa791e5 <span style='color:#111;'> 615B </span>","children":null,"spread":false}],"spread":false},{"title":"76","children":[{"title":"4ea3297aadb5c5b60582eb7dab7c4b84138ead <span style='color:#111;'> 1.56KB </span>","children":null,"spread":false}],"spread":false},{"title":"c9","children":[{"title":"d46b8a5680cfde8f61f96662b44d2c2753abd1 <span style='color:#111;'> 1.45KB </span>","children":null,"spread":false}],"spread":false},{"title":"6e","children":[{"title":"0f2a6f7552e123ff21471f25a9e0295bc94e90 <span style='color:#111;'> 543.71KB </span>","children":null,"spread":false}],"spread":false},{"title":"84","children":[{"title":"c7b93b8e9d5d7164f42b8183166df6017a0b8a <span style='color:#111;'> 84B </span>","children":null,"spread":false}],"spread":false},{"title":"6b","children":[{"title":"ba0ed8a7ee7a9e7505e0e52d4c643c138455d3 <span style='color:#111;'> 180B </span>","children":null,"spread":false}],"spread":false},{"title":"4f","children":[{"title":"587e5735849d2e4a006ea804a736ac107d7d62 <span style='color:#111;'> 165B </span>","children":null,"spread":false}],"spread":false},{"title":"e2","children":[{"title":"e46b9063da1b2cd101cb5787cb97278813434a <span style='color:#111;'> 121B </span>","children":null,"spread":false},{"title":"de7f9d51e8f5eb8cbc8fa4d97d655411089139 <span style='color:#111;'> 158B </span>","children":null,"spread":false}],"spread":false},{"title":"01","children":[{"title":"1d140702420da646e2d3ad08a9bcce14d4808b <span style='color:#111;'> 150B </span>","children":null,"spread":false}],"spread":false},{"title":"05","children":[{"title":"5e531d422b6c299f1922b2e0cda16f12c3ab80 <span style='color:#111;'> 171B </span>","children":null,"spread":false}],"spread":false},{"title":"3f","children":[{"title":"3fdcb4de7c6d0c51604bf4f4a49f263ebddaad <span style='color:#111;'> 1.55KB </span>","children":null,"spread":false}],"spread":false},{"title":"42","children":[{"title":"528df9dbb9c2ecba5870f66816bf68d39c95aa <span style='color:#111;'> 162B </span>","children":null,"spread":false}],"spread":false},{"title":"f0","children":[{"title":"c343c4938b86fd68a5cb14fdcd0d24e8b9ae29 <span style='color:#111;'> 158B </span>","children":null,"spread":false}],"spread":false}],"spread":false},{"title":"HEAD <span style='color:#111;'> 23B </span>","children":null,"spread":false},{"title":"config <span style='color:#111;'> 264B </span>","children":null,"spread":false},{"title":"refs","children":[{"title":"heads","children":[{"title":"master <span style='color:#111;'> 41B </span>","children":null,"spread":false}],"spread":true},{"title":"tags","children":null,"spread":false},{"title":"remotes","children":[{"title":"origin","children":[{"title":"HEAD <span style='color:#111;'> 32B </span>","children":null,"spread":false}],"spread":false}],"spread":false}],"spread":true},{"title":"branches","children":null,"spread":false},{"title":"hooks","children":[{"title":"pre-applypatch.sample <span style='color:#111;'> 424B </span>","children":null,"spread":false},{"title":"pre-push.sample <span style='color:#111;'> 1.32KB </span>","children":null,"spread":false},{"title":"commit-msg.sample <span style='color:#111;'> 896B </span>","children":null,"spread":false},{"title":"pre-commit.sample <span style='color:#111;'> 1.60KB </span>","children":null,"spread":false},{"title":"applypatch-msg.sample <span style='color:#111;'> 478B </span>","children":null,"spread":false},{"title":"prepare-commit-msg.sample <span style='color:#111;'> 1.21KB </span>","children":null,"spread":false},{"title":"update.sample <span style='color:#111;'> 3.53KB </span>","children":null,"spread":false},{"title":"post-update.sample <span style='color:#111;'> 189B </span>","children":null,"spread":false},{"title":"pre-rebase.sample <span style='color:#111;'> 4.78KB </span>","children":null,"spread":false}],"spread":false},{"title":"description <span style='color:#111;'> 73B </span>","children":null,"spread":false}],"spread":false},{"title":".gitignore <span style='color:#111;'> 1.02KB </span>","children":null,"spread":false},{"title":"pong.h5 <span style='color:#111;'> 596.13KB </span>","children":null,"spread":false}],"spread":true}],"spread":true}]

评论信息

免责申明

【只为小站】的资源来自网友分享,仅供学习研究,请务必在下载后24小时内给予删除,不得用于其他任何用途,否则后果自负。基于互联网的特殊性,【只为小站】 无法对用户传输的作品、信息、内容的权属或合法性、合规性、真实性、科学性、完整权、有效性等进行实质审查;无论 【只为小站】 经营者是否已进行审查,用户均应自行承担因其传输的作品、信息、内容而可能或已经产生的侵权或权属纠纷等法律责任。
本站所有资源不代表本站的观点或立场,基于网友分享,根据中国法律《信息网络传播权保护条例》第二十二条之规定,若资源存在侵权或相关问题请联系本站客服人员,zhiweidada#qq.com,请把#换成@,本站将给予最大的支持与配合,做到及时反馈和处理。关于更多版权及免责申明参见 版权及免责申明