[{"title":"( 157 个子文件 8.53MB ) PyTorch实施优势演员评论(A2C),近端策略优化(PPO),可扩展的信任区域方法,用于使用Kronecker因子逼近(ACKTR)和生成的对抗模仿学习(GAIL)进行深度强化学习。-Python开发","children":[{"title":"0.monitor.csv <span style='color:#111;'> 75.61KB </span>","children":null,"spread":false},{"title":"0.monitor.csv <span style='color:#111;'> 93.61KB </span>","children":null,"spread":false},{"title":"0.monitor.csv <span style='color:#111;'> 111.19KB </span>","children":null,"spread":false},{"title":"0.monitor.csv <span style='color:#111;'> 103.93KB </span>","children":null,"spread":false},{"title":"0.monitor.csv <span style='color:#111;'> 73.83KB </span>","children":null,"spread":false},{"title":"......","children":null,"spread":false},{"title":"<span style='color:steelblue;'>文件过多,未全部展示</span>","children":null,"spread":false}],"spread":true}]