搜索资源列表
2007-Transfer learning via inter-task mappings for temporal difference learning
- 2007-Transfer learning via inter-task mappings for temporal difference learning 迁移学习中的另外一篇重要文献
Othello
- 使用java编写的GUI的黑白棋游戏,搜索算法采用经典的博弈树,并在此基础上做了大量优化,我的评估函数采用了Simon M. Lucas 和 Thomas P. Runarsson 在其合作发表的 Temporal Difference Learning Versus Co-Evolution for Acquiring Othello Position Evaluation 中通过对比即时差分学习(TDL,Temporal Difference Learning)和协同进化(CEL,Co-
Othello
- 使用java編寫的GUI的黑白棋遊戲-GUI using java prepared Riversi games, search algorithms using classical game tree, and on this basis have done a lot of optimization, the evaluation function I used Simon M. Lucas and Thomas P. Runarsson published in its Temporal
maze_TD_lamda
- 时序差分学习算法TD(lamda)求解N*N方格走迷宫问题,求解每个方格的V值-Temporal difference learning algorithm for N*N maze problem
MATLAB
- 时序差分学习是强化学习的一种重要算法,该代码提供了时序差分学习做路径规划的一个仿真。-Temporal difference learning is an important algorithm for reinforcement learning, which provides a simulation of sequential differential learning for path planning.
Chapter 6 (Temporal Difference Learning)
- 讲解强化学习中的Q学习和sarsa学习,并通过具体实例讲解怎样运用这些学习方法(Explain Q learning and sarsa learning in intensive learning, and explain how to use these learning methods through concrete examples)