解决网格迷宫问题的MATLAB强化学习程序合集:包括使用Q_learing算法、Sarsa算法以及Sarsa-Lambda算法解决网格迷宫问题。
“I thought what I'd do was I'd pretend I was one of those deaf-mutes, or should I?”
解决多臂赌机问题的MATLAB强化学习程序合集:包括使用e-greedy策略、softmax策略以及时变的e-greedy策略求解多臂赌机问题。
“I thought what I'd do was I'd pretend I was one of those deaf-mutes, or should I?”