Constructing Features for Prediction

Prediction and Control with Function Approximation

Coursera
notes
rl
reinforcement learning
the k-armed bandit problem
bandit algorithms
exploration
explotation
epsilon greedy algorithm
sample avarage method
Author

Oren Bochman

Published

Tuesday, April 2, 2024

RL algorithms

RL algorithms

Introduction

Lesson 1: Feature Construction for Linear Methods

Learning Objectives

Lesson 2: Neural Networks

Learning Objectives

Lesson 3: Training Neural Networks

Learning Objectives
Discussion prompt

What properties of the representation are important for our online setting? This contrasts the offline, batch setting.

References

Sutton, R. S., and A. G. Barto. 2018. Reinforcement Learning, Second Edition: An Introduction. Adaptive Computation and Machine Learning Series. MIT Press. http://incompleteideas.net/book/RLbook2020.pdf.

Reuse

CC SA BY-NC-ND

Citation

BibTeX citation:
@online{bochman2024,
  author = {Bochman, Oren},
  title = {Constructing {Features} for {Prediction}},
  date = {2024-04-02},
  url = {https://orenbochman.github.io/notes/RL/c3-w2.html},
  langid = {en}
}
For attribution, please cite this work as:
Bochman, Oren. 2024. “Constructing Features for Prediction.” April 2, 2024. https://orenbochman.github.io/notes/RL/c3-w2.html.