Yichen Li

I am a final year Ph.D. student at MIT CSAIL, advised by Prof. Antonio Torralba. My currently focus on multimodal model, video world model, and reinforcement learning, while I also have research background in physics simulation, optimization, and 3D vision.

World Model: multimdoal model & video generative model.
Post-training: Scalable post-training method, RL, ES.

Before coming to MIT, I have been fortunate to worked with Prof. Leonidas Guibas and Prof. Gordon Wetzstein at Stanford. I also worked with Prof.Emily Whiting and Prof. Kate Saenko.

Email: yichenl [at] mit [dot] edu

Google Scholar / Twitter / GitHub

Photo credit: Jiayuan Mao

News

[new] [June, 2025] We are hosting the first Multimodal MultiSensory Robotics workshop at RSS 2025.
[new] [May, 2025] Our paper MultiModal Action Conditioned Video Generation is accepted to ICCV2025.

Blog

ESES: Efficient and Stable Evolutionary RL for LLM Post-training
Nov 23, 2025

Evolutionary algorithm for language model post-training is sample inefficient compared to gradient based approaches, (e.g. traditional RL and SFT). We explore ways to more efficient evolutionary RL, focusing on Low Rank parameter updates and inherent Quantization.

VARL: Reinforcing Video Autoregressive Generation
Nov 28, 2025

Autoregressive block diffusion based video generation is prohibitively expensive to rollout efficiently for post-training. We explore ways to densifying reward backpropagation over the sequence length by generating diverse shorter block sequences, compute rewards, then randomly extend a subset for reward recomputation.

Publication

	MultiModal Action Conditioned Video Generation Yichen Li, Antonio Torralba, ICCV, 2025 [paper] [project page] [code]
	Generalized Dynamics Generation towards Physical World Model Yichen Li, Zhiyi Li, Brandon Feng, Antonio Torralba, Preprint, 2025 [paper] [project page]
	Learning to Jointly Understand Visual and Tactile Signals Yichen Li, Yilun Du, Chao Liu, Chao Liu, Mike Foshey, Francis Williams, Joshua B. Tenenbaum, Wojciech Matusik, Antonio Torralba, ICLR , 2024 [paper] [project page] [dataset]
	Category-Level Multi-Part Multi-Joint 3D Shape Assembly Yichen Li, Kaichun Mo, Yueqi Duan, He Wang, Jiequan Zhuang, Lin Shao, Wojciech Matusik, Leonidas Guibas, CVPR , 2024 [paper] [project page] [data] [code]
	Learning Preconditioners for Conjugate Gradient PDE Solver Yichen Li, Peter Yichen Chen, Tao Du, Wojciech Matusik ICML, 2023 [paper] [video] [project page] [code]
	ASAP: Automated Sequence Planning for Complex Assembly with Physical Feasibility Yunsheng Tian, Karl D.D. Willis, Bassel Al Omari, Jieliang Luo, Pingchuan Ma Yichen Li, Farhad Javid Edward Gu Joshua Jacob Shinjiro Sueda, Hui Li, Sachin Chitta, Wojciech Matusik ICRA, 2024 [paper] [project page] [dataset] [code]
	Assemble Them All: Physics-Based Planning for Generalizable Assembly by Disassembly Yunsheng Tian, Jie Xu Yichen Li, Jieliang Luo, Shinjiro Sueda, Hui Li, Karl D.D. Willis, Wojciech Matusik Siggraph Asia, 2022 [paper] [project page] [code]
	3D Part Assembly from A Single Image Yichen Li, Kaichun Mo, Lin Shao, Minhyuk Sung, Leonidas Guibas, ECCV, 2020 [paper] [project page] [code]
	Domain2Vec: Domain Embedding for Unsupervised Domain Adaptation Xingchao Peng, Yichen Li, Kate Saenko, ECCV, 2020 [paper] [project page] [code]
	Revisiting Image-Language for Open-ended Phrase Detection Bryan Plummer, Kevin Shih, Yichen Li, Ke Xu, Svetlana Lazebnik, Stan Sclaroff, Kate Saenko TPAMI, 2019 [paper]

Academic Services

Conference Reviewer: CVPR, ICCV, ECCV, ICML, ICLR,NeurIPS, ACM SIGGRAPH, Journal Reviewer: ACM TOG, IEEE-TPAMI,

Teaching
	CS231N:Convolutional Neural Networks for Visual Recognition (Spring 2021) Course Assistant (CA)
	CS468: Geometric Algorithms: Non-Euclidean Methods (Fall 2020) Course Assistant (CA)
	6.S898: Deep Learning (Fall 2023) Course Assistant (CA)

Professional Experience

Research Intern at NVIDIA Summer 2023
Research Intern at Adobe Summer 2021
Research Intern at NVIDIA Summer 2020

Awards

Robert J. Shillman Fellowship
College Prize For Excellence in Computer Science (GPA Rank: 1st)

This guy makes a nice webpage.