My research is centered around computer vision and machine learning, especially visual perception (classification, detection, segmentation) and visual content generation (summarization and synthesis). Most of my recent work is focused on self-supervised learning from videos, with a primary focus on learning to predict/synthesize future frames in videos. Besides conducting basic research, I am also interested in making real-world impact with computer vision: Some of my work have been deployed to production at Yahoo, including video thumbnail detection at Flickr and Tumblr, video summary generation at Video Guide, and live stream video highlighting at Yahoo eSports.
I obtained Master's and PhD degrees in Computer Science from Massachusetts Institute of Technology in 2010 and 2014, respectively. I was a member of the Computer Science and Artificial Intelligence Laboratory, and my advisor was Randall Davis. My dissertation investigated learning from structured data and its applications to video understanding. I was lucky to have my committee Randall Davis (chair), Bill Freeman, John Fisher, and Louis-Philippe Morency.