panbowen0607 [at] gmail [dot] com
Bowen Pan
I am a research scientist at Apple foundation model (AFM) team. I work on omni-modal model training.
I completed my Ph.D. and M.Sc. at MIT CSAIL. My Ph.D. thesis focuses on efficient algorithms for the training and inference of multimodal agents. Prior to that, I obtained my B.E. from Shanghai Jiao Tong University.
Publications
[Full list]*: equal contribution
HoloAssist: an Egocentric Human Interaction Dataset for Interactive AI Assistants in the Real World
ICCV 2023
Egocentric Vision 2022/2023 Distinguished Paper Award
Multi-Moments in Time: Learning and Interpreting Models for Multi-Action Video Understanding
TPAMI 2021
Argoverse 2.0: Next Generation Datasets for Self-Driving Perception and Forecasting
NeurIPS 2021 (Track on Datasets and Benchmarks)
IA-RED2: Interpretability-Aware Redundancy Reduction for Vision Transformer
NeurIPS 2021
Misc
In my spare time, I play soccer and go to the gym.