Courses:

Decision Making in Large Scale Systems >> Content Detail



Lecture Notes



Lecture Notes

LEC #TOPICSLECTURE NOTES
1Markov Decision Processes

Finite-Horizon Problems: Backwards Induction

Discounted-Cost Problems: Cost-to-Go Function, Bellman's Equation
(PDF)
2Value Iteration

Existence and Uniqueness of Bellman's Equation Solution

Gauss-Seidel Value Iteration
(PDF)
3Optimality of Policies derived from the Cost-to-Go Function

Policy Iteration

Asynchronous Policy Iteration
(PDF)
4Average-Cost Problems

Relationship with Discounted-Cost Problems

Bellman's Equation

Blackwell Optimality
(PDF)
5Average-Cost Problems

Computational Methods
(PDF)
6Application of Value Iteration to Optimization of Multiclass Queueing Networks

Introduction to Simulation-based Methods Real-Time Value Iteration
(PDF)
7Q-Learning

Stochastic Approximations
(PDF)
8Stochastic Approximations: Lyapunov Function Analysis

The ODE Method

Convergence of Q-Learning
(PDF)
9Exploration versus Exploitation: The Complexity of Reinforcement Learning(PDF)
10Introduction to Value Function Approximation

Curse of Dimensionality

Approximation Architectures
(PDF)
11Model Selection and Complexity(PDF)
12Introduction to Value Function Approximation Algorithms

Performance Bounds
(PDF)
13Temporal-Difference Learning with Value Function Approximation(PDF)
14Temporal-Difference Learning with Value Function Approximation (cont.)(PDF)
15Temporal-Difference Learning with Value Function Approximation (cont.)

Optimal Stopping Problems

General Control Problems
(PDF)
16Approximate Linear Programming(PDF)
17Approximate Linear Programming (cont.)(PDF)
18Efficient Solutions for Approximate Linear Programming(PDF)
19Efficient Solutions for Approximate Linear Programming: Factored MDPs(PDF)
20Policy Search Methods(PDF)
21Policy Search Methods (cont.)(PDF)
22Policy Search Methods for POMDPs

Application: Call Admission Control

Actor-Critic Methods
23Approximate POMDP Compression
24Policy Search Methods: PEGASUS

Application: Helicopter Control

 








© 2017 Coursepedia.com, by Higher Ed Media LLC. All Rights Reserved.