搜索资源 - policy iteration - 搜珍网

CDN加速镜像 | 设为首页 | 加入收藏夹

热门搜索： 源码 Android 整站插件识别 p2p OpenCV 网络编程游戏源码算法更多...

登陆 | 会员注册

当前位置：

搜索资源 - policy iteration

下载资源主分类

源码下载

Web源码

开发工具

文档下载

其它资源

资源分类

搜索资源列表

srcV0624

0下载：
这个代码是policy iteration算法关于强化学习的. 请您用winzip 解压缩-policy iteration algorithm for enhanced learning. Please use winzip decompress
所属分类：人工智能/神经网络/遗传算法
- 发布日期：2008-10-13
- 文件大小：18680
- 提供者：柳春

MDPtoolbox

0下载：
The MDP toolbox proposes functions related to the resolution of discrete-time Markov Decision Process : finite horizon, value iteration, policy iteration, linear programming algorithms with some variants. The functions (m-functions) were developpe
所属分类：3G开发
- 发布日期：2008-10-13
- 文件大小：2437606
- 提供者：劉德華

MDPtoolbox

1下载：
马尔科夫决策过程值迭代算法value iteration，策略迭代等函数代码，从国外网站下载，非常详细，有用。-Markov decision process value iteration algorithm value iteration, policy iteration and so the function code, from the foreign website, very detailed and useful.
所属分类：matlab
- 发布日期：2017-03-28
- 文件大小：236965
- 提供者：小小

PolicyItr

0下载：
This is policy iteration learning algorithm
所属分类：Other systems
- 发布日期：2017-04-11
- 文件大小：827
- 提供者：fati

policyi

0下载：
Policy iteration algorithm of Howard applied to linear regulator Also known as Newton s method -Policy iteration algorithm of Howard applied to linear regulator Also known as Newton s method
所属分类：matlab
- 发布日期：2017-03-26
- 文件大小：1146
- 提供者：katore vishal

MDP-model-of-MPNP

3下载：
在matlab平台上，针对多周期报童问题，采用值迭代算法、策略迭代算法和强化学习算法求解MDP模型的实例-This is an example presentting how to apply value-iteration algorithm，policy-iteration algorithm and reinforcement learning algorithm to MDP model, which aims to solve the multi-period newsboy prob
所属分类：matlab例程
- 发布日期：2013-04-30
- 文件大小：18013
- 提供者：yejunyu

mdpPI

0下载：
Contains excellent and exact implementation of Markov Dec. Processes using Policy Iteration and Value Itreration from the book Perter Norvig for AI.-Contains excellent and exact implementation of Markov Dec. Processes using Policy Iteration and Value
所属分类：matlab
- 发布日期：2017-11-10
- 文件大小：2722
- 提供者：ankindian

Optimality-of-the-NVI-Adaptive-Policy-for-a-Parti

0下载：
Paper on the optimality of a non-stationary value iteration adaptive policy for a Partially Observed Markov Decision Proce-Paper on the optimality of a non-stationary value iteration adaptive policy for a Partially Observed Markov Decision Process
所属分类：Development Research
- 发布日期：2017-11-05
- 文件大小：1529904
- 提供者：elsk

MachineLearningMazePolicyEvaluation

0下载：
Machine Learning Code maze policy iteration value iteration
所属分类：matlab
- 发布日期：2017-04-03
- 文件大小：213274
- 提供者：elmesias

CleanRobot

0下载：
清洁机器人，确定情况，随机情况，策略迭代，Q值计算，人工智能实验-Cleaning robot to determine the situation, the random case, policy iteration, Q value, artificial intelligence experiment
所属分类：AI-NN-PR
- 发布日期：2017-04-02
- 文件大小：234351
- 提供者：罗元国

Policy-iteration

0下载：
this code simulates the policy improvement iteration in a 3*3 grid game.
所属分类：matlab
- 发布日期：2017-04-07
- 文件大小：1075
- 提供者：Funmilayo

MDPgridworldExample

0下载：
世界是空格自由（0）或障碍物（1）。每转动机器人可以在8个方向移动，或者留在地方。奖励功能，给人一种自由空间，目标定位，高回报。所有其他空格自由具有小的损失，和障碍具有大的负的奖励。值迭代是用来学习的最佳“政策”，即指定一个控制输入到每一个可能的位置的功能。- The world is freespaces (0) or obstacles (1). Each turn the robot can move in 8 directions, or stay in place. A reward
所属分类：matlab
- 发布日期：2017-04-14
- 文件大小：3371
- 提供者：莫文杰

inverted-pendulum-control

4下载：
利用强化学习的自适应动态规划中的值迭代和策略迭代方法，神经网络控制方法，LQR状态调节器最优控制方法，实现了三维倒立摆在飞行器上的稳定控制。鲁棒性很强，进行了高斯白噪声的扰动实验。-Reinforcement learning adaptive dynamic programming in value iteration and policy iteration method, neural network control method, LQR state regulator optimal
所属分类：AI-NN-PR
- 发布日期：2017-05-04
- 文件大小：49770
- 提供者：边疆

MDP_pi.py

0下载：
Reinforcement Learning. Policy iteration algorithm. Original coded.
所属分类：其他
- 发布日期：2018-01-03
- 文件大小：2048
- 提供者：helen_ray

pi.py

0下载：
Reinforcement Learning policy iteration algorithm
所属分类：人工智能/神经网络/深度学习
- 发布日期：2018-01-03
- 文件大小：2048
- 提供者：helen_ray

Dynamic-Programming-master

1下载：
经典的基于策略迭代和值迭代法的动态规划matlab代码，实现机器人的最优运输(The classic dynamic programming matlab code based on policy iteration and value iteration method realizes the optimal transportation of the robot.)
所属分类：其他
- 发布日期：2020-04-20
- 文件大小：103424
- 提供者：纯白色morning

KLSPI论文

0下载：
Kernel-based least squares policy iteration for reinforcement learning. IEEE Transactions on Neural Networks, 2007, 18(4) 973-992
所属分类：报告论文
- 发布日期：2023-11-01
- 文件大小：807168
- 提供者：zhenliu

搜珍网 www.dssz.com

本网站为编程资源及源代码搜集、介绍的搜索网站，版权归原作者所有！　　粤ICP备11031372号

1999-2046 搜珍网 All Rights Reserved.