Bolei Zhou

Ph.D. Candidate at MIT
Office: 32-D475B
Email: bolei@mit.edu
CVGoogle ScholarGithubLinkedinZhihu

About Me

Updates

Selected Projects and Publications

Bolei Zhou, Alex Andonian, and Antonio Torralba
Temporal Relational Reasoning in Videos.
arXiv:1711.08496, 2017.
[arXiv][Webpage][Demo Video]
Mathew Monfort, Bolei Zhou, Sarah Adel Bargal, Tom Yan, Alex Andonian, Kandan Ramakrishnan, Lisa Brown, Quanfu Fan, Dan Gutfreund, Carl Vondrick, Aude Oliva.
Moments in Time Dataset: one million videos for event understanding.
arXiv, 2017.
[Tech Report][Website]
Bolei Zhou*, David Bau*, Aude Oliva, and Antonio Torralba.
Interpreting Deep Visual Representations via Network Dissection.
under review for TPAMI, arXiv:1711.05611, 2017. *-indicates equal contributions
[arXiv][Webpage][Code]
Bolei Zhou, Agata Lapedriza, Aditya Khosla, Aude Oliva, and Antonio Torralba.
Places: A 10 Million Image Database for Scene Recognition.
IEEE Transactions on Pattern Analysis and Machine Intelligence, July 2017.
[PDF][Places2 Dataset][Challenge Page][Places365 CNN models][Demo]
Yikang Li, Nan Duan, Bolei Zhou, Xiao Chu, Wanli Ouyang, and Xiaogang Wang
Visual Question Generation as Dual Task of Visual Question Answering.
arXiv:1709.07192, 2017.
[arXiv]
Yikang Li, Wanli Ouyang, Bolei Zhou, Kun Wang, and Xiaogang Wang
Scene Graph Generation from Objects, Phrases and Region Captions.
International Conference on Computer Vision (ICCV), 2017.
[PDF][Code]
Hang Zhao, Xavier Puig, Bolei Zhou, Sanja Fidler, and Antonio Torralba
Open Vocabulary Scene Parsing.
International Conference on Computer Vision (ICCV), 2017.
(arXiv:1703.08769).
[PDF][arXiv][Webpage]
Bolei Zhou, Hang Zhao, Xavier Puig, Sanja Fidler, Adela Barriuso and Antonio Torralba.
Scene Parsing through ADE20K Dataset.
Computer Vision and Pattern Recognition (CVPR), 2017.
[PDF][Dataset][Benchmark Page][Challenge Page][Toolkit&Code][Demo]
David Bau*, Bolei Zhou*, Aditya Khosla, Aude Oliva, and Antonio Torralba.
Network Dissection: Quantifying Interpretability of Deep Visual Representations.
Computer Vision and Pattern Recognition (CVPR), 2017. as oral. *-indicates equal contribution.
[PDF][arXiv][webpage][code][Talk Video]
Shuang Li, Tong Xiao, Hongsheng Li, Bolei Zhou, Dayu Yue, and Xiaogang Wang.
Person Search with Natural Language Description.
Computer Vision and Pattern Recognition (CVPR), 2017.
[PDF][Dataset]
J. Wong, V. Kee, T. Le, S.Wagner, G. Mariottini, A. Schneider, L. Hamilton, R. Chiaplkatty, M. Herbert, D. Johnson
J. Wu, B. Zhou, and A. Torralba.
SegICP: Integrated Deep Semantic Segmentation and Pose Estimation.
IEEE International Conference on Intelligent Robots and Systems (IROS'17) as Oral (arXiv:1703.01661)
[PDF]
Bolei Zhou, Hang Zhao, Xavier Puig, Sanja Fidler, Adela Barriuso and Antonio Torralba.
Semantic Understanding of Scenes through ADE20K Dataset.
arXiv:1608.05442, 2016.
[PDF][Dataset][Benchmark Page][Challenge Page][Toolkit&Code][Demo]
Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, and Antonio Torralba
Learning Deep Features for Discriminative Localization.
Computer Vision and Pattern Recognition (CVPR), 2016 (arXiv:1512.04150)
[PDF] [arXiv][Project Page][Video of CNN shifting its attention]
Donglai Wei, Bolei Zhou, Antonio Torralba, William Freeman
Understanding Intra-Class Knowledge inside CNN.
arXiv:1507.02379, 2015.
[PDF][Page][Code]
Bolei Zhou, Yuandong Tian, Sainbar Suhkbaatar, Arthur Szlam, Rob Fergus
Simple Baseline for Visual Question Answering.
arXiv:1512.02167, 2015.
[PDF][Demo][Code]
Zi Wang, Bolei Zhou, Stephanie Jegelka
Optimization as Estimation with Gaussian Processes in Bandit Settings.
Artificial Intelligence and Statistics (AISTATS'16) as oral, 2016. (arXiv:1510.06423)
[PDF][Project][Code]
Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, and Antonio Torralba
Object Detectors Emerge in Deep Scene CNNs.
International Conference on Learning Representations (ICLR) as oral, 2015.(arXiv:1412.6856)
[PDF][Project Page][More Visualization][Code]
Bolei Zhou, Vignesh Jagadeesh, and Robinson Piramuthu
ConceptLearner: Discovering Visual Concepts from Weakly Labeled Image Collections.
Computer Vision and Pattern Recognition (CVPR), 2015.(arXiv:1411.5319)
[PDF][Project Page & Demo]
Bolei Zhou, Agata Lapedriza, Jianxiong Xiao, Antonio Torralba, and Aude Oliva
Learning Deep Features for Scene Recognition using Places Database.
Advances in Neural Information Processing Systems 27 (NIPS) spotlight, 2014.
[PDF][Project Page][Demo]
Bolei Zhou, Liu Liu, Aude Oliva and Antonio Torralba
Recognizing City Identity via Attribute Analysis of Geo-tagged Images.
Proceedings of 13th European Conference on Computer Vision (ECCV) , 2014.
[PDF][Project Page]
Liu Liu, Bolei Zhou, Jinhua Zhao, Brent D. Ryan
C-IMAGE: City Cognitive Mapping through Geo-tagged Photos
GeoJournal, Springer, 2016.
[PDF]
Bolei Zhou, Xiaoou Tang, Hepeng Zhang and Xiaogang Wang
Measuring Crowd Collectiveness.
IEEE transaction on Pattern Analysis and Machine Intelligence (PAMI), 2014.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) oral, 2013.
[PDF(CVPR)][PDF(TPAMI)][Project Page]
Bolei Zhou, Xiaoou Tang and Xiaogang Wang.
Learning Collective Crowd Behaviors with Dynamic Pedestrian-Agents.
International Journal of Computer Vision (IJCV), 2014.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) oral, 2012.
[PDF(CVPR)] [PDF(IJCV)][Project Page]
Bolei Zhou, Xiaoou Tang and Xiaogang Wang.
Coherent Filtering: Detecting Coherent Motions from Crowd Clutters.
In Proceedings of 12th European Conference on Computer Vision (ECCV), 2012.
[PDF] [Project Page]
Bolei Zhou, Xiaogang Wang and Xiaoou Tang.
Random Field Topic Model for Semantic Region Analysis in Crowded Scenes from Tracklets.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011.
[PDF][Project Page]
Go to Google Scholar for full publication list

Honors

Media coverage

Datasets & Benchmarks

Open-source softwares

Professional activities

Talks

Collaborators

Personal interests