Bolei Zhou

Ph.D. Candidate at MIT
Office: 32-D475B
Email: bolei@mit.edu
Google ScholarGithubLinkedinZhihu

About Me

• I am a fifth-year Ph.D. Candidate in Computer Science and Artificial Intelligence Laboratory (CSAIL) at MIT, advised by Prof. Antonio Torralba. I got the M.Eng. in Information Engineering at CUHK in July 2012 and B.Eng. in Biomedical Engineering at SJTU in June 2010.
• My research is on computer vision and machine learning, particularly I am interested in the visual recognition and the research on interpretable AI systems that tries to understand what's going on and what has been learned inside the complex 'black-box' models such as deep neural networks.

Updates

[2017/09/04] Demo of Places365-CNN is updated, which could predict the scene categories, attributes, and the class activation map together. Source code in PyTorch is available.
[2017/08/27] Videos and slides the CVPR'17 Tutorial on Deep Learning for Objects and Scenes are available.
[2017/07/06] An invited talk at ICML'17 Workshop on Visualization for Deep Learning about interpreting deep visual representation. Here is the slide.
[2017/07/01] MIT News and Techcrunch cover our Network Dissection work.
[2017/07/01] Journal extension of Places Database is accepted to IEEE PAMI.
[2017/06/20] Welcome to participate the Places Challenge 2017. This year the Places Challenge 2017 is held in conjunction with COCO at ICCV 2017.
[2017/06/20] I am organizing the Joint Workshop for COCO and Places Challenge at ICCV'17.
[2017/06/20] CVPR'17 5th Scene Understanding Workshop (SUNw) will be hosted on July 26, 2017. Schedule and the list of the Accepted extended abstracts are online.

Selected Publications

Bolei Zhou*, David Bau*, Aude Oliva, and Antonio Torralba.
Interpreting Deep Visual Representations via Network Dissection.
under review for TPAMI, arXiv, Sept 2017. *-indicates equal contributions
[PDF][Webpage][Code]
Bolei Zhou, Agata Lapedriza, Aditya Khosla, Aude Oliva, and Antonio Torralba.
Places: A 10 Million Image Database for Scene Recognition.
IEEE Transactions on Pattern Analysis and Machine Intelligence, July 2017.
[PDF][Places2 Dataset][Challenge Page][Places365 CNN models][Demo]
Yikang Li, Nan Duan, Bolei Zhou, Xiao Chu, Wanli Ouyang, and Xiaogang Wang
Visual Question Generation as Dual Task of Visual Question Answering.
arXiv:1709.07192, 2017.
[arXiv]
Yikang Li, Wanli Ouyang, Bolei Zhou, Kun Wang, and Xiaogang Wang
Scene Graph Generation from Objects, Phrases and Region Captions.
International Conference on Computer Vision (ICCV), 2017.
[PDF][Code]
Hang Zhao, Xavier Puig, Bolei Zhou, Sanja Fidler, and Antonio Torralba
Open Vocabulary Scene Parsing.
International Conference on Computer Vision (ICCV), 2017.
(arXiv:1703.08769).
[PDF][arXiv]
Bolei Zhou, Hang Zhao, Xavier Puig, Sanja Fidler, Adela Barriuso and Antonio Torralba.
Scene Parsing through ADE20K Dataset.
Computer Vision and Pattern Recognition (CVPR), 2017.
[PDF][Dataset][Benchmark Page][Challenge Page][Toolkit&Code][Demo]
David Bau*, Bolei Zhou*, Aditya Khosla, Aude Oliva, and Antonio Torralba.
Network Dissection: Quantifying Interpretability of Deep Visual Representations.
Computer Vision and Pattern Recognition (CVPR), 2017. as oral. *-indicates equal contribution.
[PDF][arXiv][webpage][code][Talk Video]
Shuang Li, Tong Xiao, Hongsheng Li, Bolei Zhou, Dayu Yue, and Xiaogang Wang.
Person Search with Natural Language Description.
Computer Vision and Pattern Recognition (CVPR), 2017.
[PDF][Dataset]
J. Wong, V. Kee, T. Le, S.Wagner, G. Mariottini, A. Schneider, L. Hamilton, R. Chiaplkatty, M. Herbert, D. Johnson
J. Wu, B. Zhou, and A. Torralba.
SepICP: Integrated Deep Semantic Segmentation and Pose Estimation.
IEEE International Conference on Intelligent Robots and Systems (IROS'17) as Oral (arXiv:1703.01661)
[PDF]
Bolei Zhou, Hang Zhao, Xavier Puig, Sanja Fidler, Adela Barriuso and Antonio Torralba.
Semantic Understanding of Scenes through ADE20K Dataset.
arXiv:1608.05442, 2016.
[PDF][Dataset][Benchmark Page][Challenge Page][Toolkit&Code][Demo]
Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, and Antonio Torralba
Learning Deep Features for Discriminative Localization.
Computer Vision and Pattern Recognition (CVPR), 2016 (arXiv:1512.04150)
[PDF] [arXiv][Project Page][Video of CNN shifting its attention]
Donglai Wei, Bolei Zhou, Antonio Torralba, William Freeman
Understanding Intra-Class Knowledge inside CNN.
arXiv:1507.02379, 2015.
[PDF][Page][Code]
Bolei Zhou, Yuandong Tian, Sainbar Suhkbaatar, Arthur Szlam, Rob Fergus
Simple Baseline for Visual Question Answering.
arXiv:1512.02167, 2015.
[PDF][Demo][Code]
Zi Wang, Bolei Zhou, Stephanie Jegelka
Optimization as Estimation with Gaussian Processes in Bandit Settings.
Artificial Intelligence and Statistics (AISTATS'16) as oral, 2016. (arXiv:1510.06423)
[PDF][Project][Code]
Bolei Zhou, Aditya Khosla, Agata Lapedriza, Aude Oliva, and Antonio Torralba
Object Detectors Emerge in Deep Scene CNNs.
International Conference on Learning Representations (ICLR) as oral, 2015.(arXiv:1412.6856)
[PDF][Project Page][More Visualization][Code]
Bolei Zhou, Vignesh Jagadeesh, and Robinson Piramuthu
ConceptLearner: Discovering Visual Concepts from Weakly Labeled Image Collections.
Computer Vision and Pattern Recognition (CVPR), 2015.(arXiv:1411.5319)
[PDF][Project Page & Demo]
Bolei Zhou, Agata Lapedriza, Jianxiong Xiao, Antonio Torralba, and Aude Oliva
Learning Deep Features for Scene Recognition using Places Database.
Advances in Neural Information Processing Systems 27 (NIPS) spotlight, 2014.
[PDF][Project Page][Demo]
Bolei Zhou, Liu Liu, Aude Oliva and Antonio Torralba
Recognizing City Identity via Attribute Analysis of Geo-tagged Images.
Proceedings of 13th European Conference on Computer Vision (ECCV) , 2014.
[PDF][Project Page]
Liu Liu, Bolei Zhou, Jinhua Zhao, Brent D. Ryan
C-IMAGE: City Cognitive Mapping through Geo-tagged Photos
GeoJournal, Springer, 2016.
[PDF]
Bolei Zhou, Xiaoou Tang, Hepeng Zhang and Xiaogang Wang
Measuring Crowd Collectiveness.
IEEE transaction on Pattern Analysis and Machine Intelligence (PAMI), 2014.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) oral, 2013.
[PDF(CVPR)][PDF(TPAMI)][Project Page]
Bolei Zhou, Xiaoou Tang and Xiaogang Wang.
Learning Collective Crowd Behaviors with Dynamic Pedestrian-Agents.
International Journal of Computer Vision (IJCV), 2014.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR) oral, 2012.
[PDF(CVPR)] [PDF(IJCV)][Project Page]
Bolei Zhou, Xiaoou Tang and Xiaogang Wang.
Coherent Filtering: Detecting Coherent Motions from Crowd Clutters.
In Proceedings of 12th European Conference on Computer Vision (ECCV), 2012.
[PDF] [Project Page]
Bolei Zhou, Xiaogang Wang and Xiaoou Tang.
Random Field Topic Model for Semantic Region Analysis in Crowded Scenes from Tracklets.
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2011.
[PDF][Project Page]
      Click here for the full publication list

Datasets & Benchmarks

Open-source softwares

Honors

Professional activities

Talks

Media coverage

Collaborators

Personal interests