Course - Optimisation for Data Science HT25
This course analyses optimisation methods suitable for large-scale data science problems, mainly by deriving results on the rate of convergence under increasing assumptions (smooth, convex, strongly convex) on the objective functions.
The course begins with some optimisation terminology and then covers gradient descent and the proximal method, which can be used to apply steepest descent techniques to regularised problems. Then it covers acceleration techniques such as the heavy ball method, and then moves onto stochastic gradient descent and accelerated techniques in that context. Finally, it covers coordinate descent methods.
- Course Webpage
- Lecture Notes
- From the previous year:
- Other courses this term: Courses HT25U
Notes
Notes - Optimisation for Data Science HT25, Overview of resultsU ⭐️
Notes - Optimisation for Data Science HT25, Motivation and examplesU
Notes - Optimisation for Data Science HT25, Steepest descentU
Notes - Optimisation for Data Science HT25, Steepest descent with inexact line searchU
Notes - Optimisation for Data Science HT25, Proximal methodsU
Notes - Optimisation for Data Science HT25, Accelerated methodsU
Notes - Optimisation for Data Science HT25, Nesterov’s accelerated gradient methodU
Notes - Optimisation for Data Science HT25, Stochastic gradient descentU
Notes - Optimisation for Data Science HT25, Stochastic variance reduction methodsU
Notes - Optimisation for Data Science HT25, Coordinate descentU
Problem Sheets
- redacted?
- Sheet 1, partial answers
- Sheet 2, partial answers
- Sheet 3, partial answers
- Sheet 4, partial answers
