I am a member of the Computer Vision Group at Microsoft Research in Redmond. I work in computer vision and machine learning. Most recently, my research is focused on learning from unlabeled and noisy data. I am particularly interested in unsupervised/self-supervised learning from video data, leveraging spatio-temporal and multimodal structures. Before joining Microsoft in 2018, I spent 4 years at Yahoo Research in NYC. I obtained Master's and PhD degrees in Computer Science from MIT in 2010 and 2014, respectively, where I was a member of Computer Science and Artificial Intelligence Laboratory (CSAIL). My dissertation investigated learning from structured data and its applications to video understanding. I was lucky to have my committee Randall Davis, Bill Freeman, John Fisher, and Louis-Philippe Morency.
I am looking for motivated PhD students who are passionate about visual and multimodal representation learning from unlabeled videos. Reach out if you are interested in doing research internship with me at MSR Redmond.