WindyGridWorldQLearning Q - 下载

热门搜索： 源码 Android 整站插件识别 p2p OpenCV 网络编程游戏源码算法更多...

登陆 | 会员注册

当前位置：

首页

资源下载

源码下载

其它

LabView编程

文件名称:WindyGridWorldQLearning

所属分类：

LabView
标签属性：

[Matlab] [源码]
上传时间：

2013-04-19
文件大小：

2kb
已下载：

0次
提供者：

a*****
相关连接：

无
下载说明：

别用迅雷下载，失败请重下，重下不扣分！

电信下载联通下载

报告错误！

修正介绍说明

介绍说明－－下载内容来自于网络，使用问题请自行百度

Q-learning (Watkins, 1989) is a simple way for agents to learn how to act optimally in controlled Markovian

domains. It amounts to an incremental method for dynamic programming which imposes limited computational

demands. It works by successively improving its evaluations of the quality of particular actions at particular states.

This paper presents and proves in detail a convergence theorem for Q,-learning based on that outlined in Watkins

(1989). We show that Q-learning converges to the optimum action-values with probability 1 so long as all actions

are repeatedly sampled in all states and the action-values are represented discretely. We also sketch extensions

to the cases of non-discounted, but absorbing, Markov environments, and where many Q values can be changed

each iteration, rather than just one.

(系统自动生成,下载前可以参看下载内容)

下载文件列表

WindyGridWorldQLearning.m

*快速评论：	推荐一般有密码和说明不符不是源码或资料文件不全不能解压纯粹是垃圾
*内　　容：
*验证码：

文件名称:WindyGridWorldQLearning

介绍说明－－下载内容来自于网络，使用问题请自行百度

下载文件列表

相关说明

相关评论

发表评论

下载资源主分类

源码下载

Web源码

开发工具

文档下载

其它资源

资源分类

汇编语言

SCSI/ASPI

编译器/词法分析

磁盘编程

语音合成与识别

编辑器/阅读器

杀毒

中文信息处理

并行运算

书籍源码

Dephi控件源码

操作系统开发

中间件编程

MacOS编程

LabView编程

易语言编程

在结果中搜索

浏览历史记录