Notation
New Concepts
- supervised learning
- machine learning evaluation
- root-mean-square error (this is least-squares error with normalization, so “units work well” for test benchmarks)
- linear regression
- gradient descent
- normal equation
Important Results / Claims
Questions
- Why can’t we use root-mean-square error for the training objective? It seems like its just more normalization…