CS 294: Algorithmic Aspects of Machine Learning

Fall 2024




In this graduate class, we will explore various facets of the question: What can theory contribute to our understanding of modern machine learning?

Our perspective will be algorithmic. We will introduce ubiquitous models like mixture models, graphical models, random graphs, linear dynamical systems, ReLU networks, Markov decision processes, diffusion models and large language models (if the instructor can figure out what to say about them!) and cover the state-of-the-art in terms of what is known in the way of algorithms and lower bounds. Through these examples, we will see powerful algorithmic methodologies in action like the method of moments, tensor decompositions, spectral methods and semi-definite programming.

In many ways, the theory of machine learning is also about how to frame problems so that we capture essential challenges in machine learning at the right level of abstraction. We will emphasize themes like beyond worst-case analysis, stability, robustness, identifiability and computational vs. statistical tradeoffs that serve as our guide for where and how to look for algorithmically interesting problems.

Announcement: Orr set up a Google groups for the class here which we can use for announcements

Announcement: Amaan set up a discord server for the class here which you can use for discussion, setting up study groups, etc.

Announcement: PSET 1 is posted here and due September 27th via gradescope

Course Information

Syllabus and Schedule

The syllabus and schedule of topics will vary based on student interests. Previous versions of the class taught at MIT can be found here. This section will be updated throughout the semester.

Instructor Notes