Houjun Liu

supervised learning

Supervised learning (also known as behavioral cloning) if the agent is learning what to do in an observe-act cycle) is a type of decision making method.

constituents

  • input space: \(\mathcal{X}\)
  • output space: \(\mathcal{Y}\)
  • hypothesis/model/prediction: \(h : \mathcal{X} \to \mathcal{Y}\)

requirements

Our ultimate goal is to learn a good model \(h\) from the training set:

  • what “good” means is hard to define
  • we generally want to use the model on new data, not just the training set

continuous \(\mathcal{Y}\) is then called a regression problem; discrete \(\mathcal{Y}\) is called a classification problem.

That is, we want our hypothesis to behave as \(h_{\theta}\qty (x^{(i)}) \approx y^{(i)}\).

additional information

training set

The training set is a set of pairs:

\begin{equation} \qty {\qty(x^{(1)}, y^{(1)}) \dots \qty (x^{(n)}, y^{(n)})} \end{equation}

such that \(x^{(j)} \in \mathcal{X}, y^{(j)} \in \mathcal{Y}\).

We call \(n\) the training set size.

main procedure

  1. provide the agent with some examples
  2. use an automated learning algorithm to generalize from the example

This is good for typically representative situations, but if you are throwing an agent into a completely unfamiliar situation, supervised learning cannot perform better.

Disadvantages

  • the labeled data is finite
  • limited by the quality of performance in the training data
  • interpolation between states are finite

cost function

see cost function

evaluation

see machine learning evaluation