Christina Delimitrou

Assistant Professor, MIT

Welcome to my webpage! I am an assistant professor in the Electrical Engineering and Computer Science Department at MIT, where I lead the SAIL group. Previously, I was an Assistant Professor in the Electrical and Computer Engineering Department at Cornell University, a member of the Computer Systems Laboratory (CSL), and the John and Norma Balen Sesquicentennial Faculty Fellow. My main interests are in computer architecture and computer systems. Specifically, I work on improving the resource efficiency of large-scale datacenters through QoS-aware scheduling and resource management techniques. I am also interested in designing efficient server architectures, distributed performance debugging, and cloud security. Before joining Cornell, I earned a Ph.D. in Electrical Engineering at Stanford University, where I worked with Christos Kozyrakis. I had previously earned an M.S. in Electrical Engineering from Stanford (2011) and a Diploma in Electrical and Computer Engineering from the National Technical University of Athens (2009).

I have been fortunate to receive a Sloan Faculty Research Award, two Google Faculty Research Awards (2019 and 2020), a Microsoft Research Faculty Fellowship, the 2020 IEEE TCCA Young Computer Architect Award, an Intel Rising Star Award, a Google Research Award in Recognition of Technical Leadership and Achievements in Systems Research, a Facebook Faculty Research Award, the Cornell Excellence in Research Award, and the Cornell Excellence in Teaching Award. My work has also received six IEEE Micro TopPicks Awards, one TopPicks Honorable Mention, the ASPLOS'24 Influential Paper Award, and several best paper awards. You can find more information in my CV.

I am looking for motivated PhD, MS, and undergrad students to join my group! If you are excited about cloud computing, datacenters, and computer architecture and systems in general send me an email with your CV and we can set up a meeting.

Contact Information
32-G738 Stata Center, CSAIL, Electrical Engineering and Computer Science, MIT
Cambridge, MA
E-mail: delimitrou@csail.mit.edu

Research

"I know but one freedom and that is the freedom of the mind"
Antoine Saint Exupery

I work in computer architecture, cloud systems, and applied machine learning.

Current Projects

Older Projects

During my Ph.D. I had worked extensively on improving resource efficiency in cloud systems. Given the low datacenter utilization at the time, caused in part by overprovisioned resource reservations, I built a number of systems that relied on machine learning (ML) to automate the scheduling and resource management process in the cloud.

Paragon---QoS-Aware Cloud Scheduling: Paragon is a QoS-aware datacenter scheduler that accounts for interference between co-scheduled workloads and platform heterogeneity. The scheduler leverages fast classification techniques to determine the interference and heterogeneity preferences of incoming applications, which only introduce minimal scheduling overheads. Across large-scale cluster evaluations, Paragon significantly improved both performance and resource utilization compared to prior systems, without introducing significant scheduling overheads at runtime.
[ASPLOS'13 paper] [TopPicks'14 paper] [TOCS'13 paper]

Quasar---ML-Driven Cluster Management: Traditionally, users overprovision their resource reservations in the cloud to side-step performance unpredictability. Quasar is a cluster manager that introduces a different interface between system and users. Instead of specifying raw resources, the user only specifies a performance target a job must meet. Quasar then leverages efficient data mining techniques to determine the resource preferences of a new job, much like a movie recommendation system finds similarities between previous and new users to recommend movies that they are likely to enjoy. Quasar achieves both high cluster utilization and high per-application performance.
[ASPLOS'14 paper] [demo] [press]

Tarcil---Low-Latency Distributed Scheduling: Tarcil is a scheduler that addresses the disparity between sophisticated, but slow centralized schedulers and fast, but low-quality distributed schedulers. Tarcil uses sampling to lower the scheduling overheads and it accounts for the resource preferences of new jobs, to keep scheduling quality high. It improves performance both for short and long jobs compared to centralized and distributed schedulers.
[SOCC'15 paper]

HCloud---Hybrid Cloud Provisioning: Paragon and Quasar assume that the cluster manager has full control over the entire system. Unfortunately, real life can be more complicated, especially when the resources used are hosted on a public cloud provider. In this work, I designed a system that determines the most cost-efficient instance type (reserved vs. on-demand) and size a job needs to satisfy its QoS constraints. I evaluated this system on a cluster with a few hundred servers on Google Compute Engine.
[ASPLOS'16 paper]

iBench: Paragon and Quasar need to know the sensitivity of an incoming application to various types of interference. iBench is a benchmark suite that consists of a set of microbenchmarks each of which puts pressure on a specific shared resource. iBench enables fast and practical characterization of the interference an application tolerates in various resources and the interference it itself generates.
[IISWC'13 paper]

Datacenter Application Modeling: Previously, I had also worked on characterizing and modeling the behavior of large-scale datacenter applications. I designed and implemented ECHO, a consice analytical model that captures and recreates the network traffic of distributed datacenter applications. I also developed a modeling framework for storage workloads, which generates synthetic load patterns similar to the original applications. Both modeling frameworks were validated against real datacenter applications from Microsoft, and were used in a series of efficiency and cost optimization studies.
[IISWC'12 paper] [IISWC'11 paper] [CAL'12 paper] [TPCTC'11 paper]

Christina Delimitrou

Assistant Professor, MIT

Research

Current Projects

Hardware Specialization and Server Design for the Cloud

Machine-Learning-Driven Cloud Management

Machine-Learning-Driven Cloud Performance Debugging

Cloud Programming Framework Design

Representative Cloud (Micro)Services

Other Projects

Older Projects