|
Yichen Li
I am a final year Ph.D. student at MIT CSAIL, advised by Prof. Antonio Torralba.
My currently focus on multimodal model, video world model, and reinforcement learning, while I also have research background in physics simulation, optimization, and 3D vision.
- World Model: multimdoal model & video generative model.
- Post-training: Scalable post-training method, RL, ES.
Before coming to MIT, I have been fortunate to worked with Prof. Leonidas Guibas and Prof. Gordon Wetzstein at Stanford. I also worked with Prof.Emily Whiting and Prof. Kate Saenko.
I am on the industry job market. Reach out if I can contribute.
Email: yichenl [at] mit [dot] edu
Google Scholar  / 
Twitter / 
GitHub
|
Photo credit: Jiayuan Mao
|
|
ESES: Efficient and Stable Evolutionary RL for LLM Post-training
Nov 23, 2025
Evolutionary algorithm for language model post-training is sample inefficient compared to gradient based approaches, (e.g. traditional RL and SFT). We explore ways to more efficient evolutionary RL, focusing on Low Rank parameter updates and inherent Quantization.
|
|
VARL: Reinforcing Video Autoregressive Generation
Nov 28, 2025
Autoregressive block diffusion based video generation is prohibitively expensive to rollout efficiently for post-training. We explore ways to densifying reward backpropagation over the sequence length by generating diverse shorter block sequences, compute rewards, then randomly extend a subset for reward recomputation.
|
|
|
MultiModal Action Conditioned Video Generation
Yichen Li,
Antonio Torralba,
ICCV,
2025
[paper]  
[project page]  
[code]
|
|
|
Generalized Dynamics Generation towards Physical World Model
Yichen Li,
Zhiyi Li,
Brandon Feng,
Antonio Torralba,
Preprint,
2025
[paper]  
[project page]  
|
|
|
Learning to Jointly Understand Visual and Tactile Signals
Yichen Li,
Yilun Du,
Chao Liu,
Chao Liu,
Mike Foshey,
Francis Williams,
Joshua B. Tenenbaum,
Wojciech Matusik,
Antonio Torralba,
ICLR ,
2024
[paper]  
[project page]  
[dataset]  
|
|
|
Category-Level Multi-Part Multi-Joint 3D Shape Assembly
Yichen Li,
Kaichun Mo,
Yueqi Duan,
He Wang,
Jiequan Zhuang,
Lin Shao,
Wojciech Matusik,
Leonidas Guibas,
CVPR ,
2024
[paper]  
[project page]  
[data]  
[code]  
|
|
|
Learning Preconditioners for Conjugate Gradient PDE Solver
Yichen Li,
Peter Yichen Chen,
Tao Du,
Wojciech Matusik
ICML,
2023
[paper]  
[video]  
[project page]  
[code]  
|
|
|
ASAP: Automated Sequence Planning for Complex Assembly with Physical Feasibility
Yunsheng Tian,
Karl D.D. Willis,
Bassel Al Omari,
Jieliang Luo,
Pingchuan Ma
Yichen Li,
Farhad Javid
Edward Gu
Joshua Jacob
Shinjiro Sueda,
Hui Li,
Sachin Chitta,
Wojciech Matusik
ICRA,
2024
[paper]  
[project page]  
[dataset]
[code]  
|
|
|
Assemble Them All: Physics-Based Planning for Generalizable Assembly by Disassembly
Yunsheng Tian,
Jie Xu
Yichen Li,
Jieliang Luo,
Shinjiro Sueda,
Hui Li,
Karl D.D. Willis,
Wojciech Matusik
Siggraph Asia, 2022
[paper]  
[project page]  
[code]  
|
|
|
3D Part Assembly from A Single Image
Yichen Li*,
Kaichun Mo*,
Lin Shao,
Minhyuk Sung,
Leonidas Guibas,
ECCV, 2020
[paper]  
[project page]  
[code]  
|
|
|
Domain2Vec: Domain Embedding for Unsupervised Domain Adaptation
Xingchao Peng*,
Yichen Li*,
Kate Saenko,
ECCV, 2020
[paper]  
[project page]  
[code]  
|
|
|
Revisiting Image-Language for Open-ended Phrase Detection
Bryan Plummer,
Kevin Shih,
Yichen Li,
Ke Xu,
Svetlana Lazebnik,
Stan Sclaroff,
Kate Saenko
TPAMI, 2019
[paper]  
|
Professional Experience
- Research Intern at NVIDIA Summer 2023
- Research Intern at Adobe Summer 2021
- Research Intern at NVIDIA Summer 2020
|
Awards
- Robert J. Shillman Fellowship
- College Prize For Excellence in Computer Science (GPA Rank: 1st)
|
|