Course - Optimisation for Data Science HT25

Created: January 20, 2025 | Updated: January 26, 2026 | Read markdown | About these notes | View in context | Study these flashcards

This course analyses optimisation methods suitable for large-scale data science problems, mainly by deriving results on the rate of convergence under increasing assumptions (smooth, convex, strongly convex) on the objective functions.

The course begins with some optimisation terminology and then covers gradient descent and the proximal method, which can be used to apply steepest descent techniques to regularised problems. Then it covers acceleration techniques such as the heavy ball method, and then moves onto stochastic gradient descent and accelerated techniques in that context. Finally, it covers coordinate descent methods.

Notes

Problem Sheets

To-Do List

Relevant reading

Courses HT25^U

(incoming)
Part B^U

(incoming)
University Notes^U

(incoming)
Course - Continuous Optimisation HT26^U

(incoming)
Course - Geometric Deep Learning HT26^U

(incoming)
Course - Theories of Deep Learning MT25^U

(incoming)
Lecture - Theories of Deep Learning MT25, VII, Stochastic gradient descent and its extensions^U

(incoming)
Lecture - Theories of Deep Learning MT25, VIII, Optimisation algorithms for training DNNs^U

(incoming)
Course - Machine Learning MT23^U

(sim: 0.653)
Lecture - Machine Learning MT23, IX^U

(sim: 0.69)
Notes - Machine Learning MT23, Optimisation^U

(incoming)
Notes - Optimisation for Data Science HT25, Coordinate descent^U

(incoming)
Notes - Optimisation for Data Science HT25, Motivation and examples^U

(incoming)
Notes - Optimisation for Data Science HT25, Nesterov's accelerated gradient method^U

(incoming)
Notes - Optimisation for Data Science HT25, Stochastic gradient descent^U

(incoming)
Notes - Optimisation for Data Science HT25, Subgradients^U

(incoming)
Notes - Optimisation for Data Science HT25, Steepest descent^U

(incoming)