I am an Associate Professor of Electrical Engineering and Computer Science in MIT's Computer Science and Artificial Intelligence Laboratory. Currently, my research focuses on building systems for machine learning , and using machine learning for systems. For example, with our work on Learned Indexes we started to explore how we can enhance or even replace core systems components using machine learning models and early results suggest, that we are able to achieve significant performance improvements over state-of-the-art techniques and sometimes are even able to change the complexity class of certain algorithms. On the other hand, with Vizdom and IDEA we are exploring new user interfaces and infrastructure to democratize data science by enabling visual, interactive, and assisted data exploration and model building. One particular focus of our work is to not only make the model building process faster, but also safer by automatically preventing the user from common pitfalls.
Before joining MIT, I was an Assistant Professor at Brown University, spent time at Google Research, and was a PostDoc in the AMPLab at UC Berkeley after I got my PhD from ETH Zurich. I am a 2017 Alfred P. Sloan Research Fellow in computer science and received the 2017 VMware Systems Research Award, an NSF CAREER Award, an Air Force Young Investigator award, two Very Large Data Bases (VLDB) conference best-demo awards, and a best-paper award from the IEEE International Conference on Data Engineering (ICDE).

Current Research Interests

  • ML-enhanced data structures and algorithms
  • Systems for interactive data exploration and model building
  • Infrastructure for rack-scale analytics and machine learning
  • Transaction processing over high-speed networks
  • Hybrid human-machine data management systems

Research Projects

In the following, a list of my current and past research projects:

  • Learned Systems Components - How to Enhance traditional data structures and algorithms through machine learning
  • VizDom - Data Exploration on Interactive Whiteboards
  • Tupleware - Redefining Modern Analytics on Modern Hardware
  • QUDE - Quantifying the Uncertainty in Data Exploration
  • MLBase - The Distributed Machine-Learning Management System
  • S-Store - A streaming OLTP system for big velocity applications
  • MDCC - The Fastest Strong Consistent Multi-Data Center Replication Protocol
  • CrowdDB - Answering Queries with Crowdsourcing
  • PIQL - Performance Insightful Query Language
  • Cloudy/Smoky - a distributed storage and streaming service in the cloud
  • Building a database on cloud infrastructure
  • CloudBench - a benchmark for the cloud
  • Zorba - a general purpose XQuery processor implementing in C++
  • MXQuery - A lightweight, full-featured Java XQuery Engine
  • Mapping Data to Queries (MDQ) - data integration with XQuery
  • XQIB - XQuery In the Browser

Room 32-G914
32 Vassar St.
Cambridge, MA 02139

Phone: +1 (510) 926-5856