All Publications
The Event Horizon Telescope Image of the Quasar NRAO 530
The Astrophysical Journal, 2023
Can Shadows Reveal Biometric Information?
Winter Conference on Applications of Computer Vision (WACV) 2023
Structure and motion from casual videos
European Conference on Computer Vision (ECCV) 2022
Disentangling architecture and training for optical flow
European Conference on Computer Vision (ECCV) 2022
Resolving the Inner Parsec of the Blazar J1924–2914 with the Event Horizon Telescope
The Astrophysical Journal, 2022
Maskgit: Masked generative image transformer
Conference on Computer Vision and Pattern Recognition (CVPR) 2022
A universal power-law prescription for variability from synthetic images of black hole accretion flows
The Astrophysical Journal Letters, 2022
Characterizing and mitigating intraday variability: reconstructing source structure in accreting black holes with mm-VLBI
The Astrophysical Journal Letters, 2022
First Sagittarius A* Event Horizon Telescope results. I. The shadow of the supermassive black hole in the center of the Milky Way
The Astrophysical Journal Letters, 2022
First Sagittarius A* Event Horizon Telescope results. I. The shadow of the supermassive black hole in the center of the Milky Way
The Astrophysical Journal Letters, 2022
First Sagittarius A* Event Horizon Telescope results. II. EHT and multiwavelength observations, data processing, and calibration
The Astrophysical Journal Letters, 2022
First Sagittarius A* event horizon telescope results. III. Imaging of the galactic center supermassive black hole
The Astrophysical Journal Letters, 2022
First Sagittarius A* Event Horizon Telescope results. IV. Variability, morphology, and black hole mass
The Astrophysical Journal Letters, 2022
First Sagittarius A* event horizon telescope results. VI. Testing the black hole metric
The Astrophysical Journal Letters, 2022
Millimeter Light Curves of Sagittarius A* Observed during the 2017 Event Horizon Telescope Campaign
The Astrophysical Journal Letters, 2022
Unsupervised semantic segmentation by distilling feature correspondences
International Conference on Learning Representations (ICLR) 2022
Nerfactor: Neural factorization of shape and reflectance under an unknown illumination
ACM Transactions on Graphics (TOG), 2021
Light field networks: Neural scene representations with single-evaluation rendering
Advances in Neural Information Processing Systems (NeurIPS) 2021
Explaining in style: Training a gan to explain a classifier in stylespace
International Conference on Computer Vision (ICCV) 2021
Slide: Single image 3d photography with soft layering and depth-aware inpainting
International Conference on Computer Vision (ICCV) 2021
Thundr: Transformer-based 3d human reconstruction with markers
International Conference on Computer Vision (ICCV) 2021
What you can learn by staring at a blank wall
International Conference on Computer Vision (ICCV) 2021
Toward Automatic Interpretation of 3D Plots
Document Analysis and Recognition–ICDAR 2021: 16th International Conference
MosAIc: Finding Artistic Connections across Culture with Conditional Image Retrieval
NeurIPS 2020 Competition and Demonstration Track
Quantaichi: a compiler for quantized simulations
ACM Transactions on Graphics (TOG), 2021
Autoflow: Learning a better training set for optical flow
Conference on Computer Vision and Pattern Recognition (CVPR) 2021
Lasr: Learning articulated shape reconstruction from a monocular video
Conference on Computer Vision and Pattern Recognition (CVPR) 2021
Neural Descent for Visual 3D Human Pose and Shape
Conference on Computer Vision and Pattern Recognition (CVPR) 2021
Omnimatte: associating objects and their effects in video
Conference on Computer Vision and Pattern Recognition (CVPR) 2021
First M87 event horizon telescope results. VII. Polarization of the ring
The Astrophysical Journal Letters, 2021
Polarimetric properties of event horizon telescope targets from ALMA
The Astrophysical Journal Letters, 2021
Axiomatic Explanations for Visual Search, Retrieval, and Similarity Learning
International Conference on Learning Representations (ICLR) 2022
Neural light transport for relighting and view synthesis
ACM Transactions on Graphics (TOG), 2021
Large-scale intelligent microservices
IEEE International Conference on Big Data (Big Data) 2020
Multi-plane program induction with 3d box priors
Advances in Neural Information Processing Systems (NeurIPS) 2020
Two-Dimensional Non-Line-of-Sight Scene Estimation From a Single Edge Occluder
IEEE Transactions on Computational Imaging, 2020
Weakly Supervised 3D Human Pose and Shape Reconstruction with Normalizing Flows
European Conference on Computer Vision (ECCV), 2020
GHUM & GHUML: Generative 3D Human Shape and Articulated Pose Models
Conference on Computer Vision and Pattern Recognition (CVPR), 2020
SpeedNet: Learning the Speediness in Videos
Conference on Computer Vision and Pattern Recognition (CVPR), 2020
Perspective Plane Program Induction From a Single Image
Computer Vision and Pattern Recognition(CVPR), 2020
Semantic Pyramid for Image Generation
Computer Vision and Pattern Recognition (CVPR), 2020
Layered Neural Rendering for Retiming People in Video
ACM Transactions on Graphics, 2020
Deep audio priors emerge from harmonic convolutional networks
International Conference on Learning Representations (ICLR) 2020
MannequinChallenge: Learning the Depths of Moving People by Watching Frozen People
IEEE Transactions on Pattern Analysis and Machine Intelligence, 2020
Visual Deprojection: Probabilistic Recovery of Collapsed Dimensions
International Conference on Computer Vision (ICCV) 2019
Boundless: Generative adversarial networks for image extension
IEEE International Conference on Computer Vision(ICCV), 2019
Learning shape templates with structured implicit functions
IEEE International Conference on Computer Vision(ICCV), 2019
Program-Guided Image Manipulators
IEEE International Conference on Computer Vision (ICCV), 2019
Using unknown occluders to recover hidden scenes
IEEE Conference on Computer Vision and Pattern Recognition(CVPR), 2019
Reasoning about physical interactions with object-centric models
International Conference on Learning Representations (ICLR), 2019
Deep Audio Priors Emerge From Harmonic Convolutional Networks
International Conference on Learning Representations (ICLR), 2019
Learning the Depths of Moving People by Watching Frozen People
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019
Best Paper Honorable Mention.
Speech2Face: Learning the Face Behind a Voice
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019
Using Unknown Occluders to Recover Hidden Scenes
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019
Visual Dynamics: Stochastic Future Generation via Layered Cross Convolutional Networks
IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2019
Corner Occluder Computational Periscopy: Estimating a Hidden Scene from a Single Photograph
IEEE International Conference on Computational Photography (ICCP), 2019
First M87 event horizon telescope results. IV. Imaging the central supermassive black hole
The Astrophysical Journal Letters, 2019
Learning to Infer and Execute 3D Shape Programs
International Conference on Learning Representations (ICLR), 2019
Reasoning About Physical Interactions with Object-Centric Models
International Conference on Learning Representations (ICLR), 2019
GAN Dissection: Visualizing and Understanding Generative Adversarial Networks
International Conference on Learning Representations (ICLR), 2019
Unsupervised Discovery of Parts, Structure, and Dynamics
International Conference on Learning Representations (ICLR), 2019
Learning to Describe Scenes with Programs
International Conference on Learning Representations (ICLR), 2019
ChainQueen: A Real-Time Differentiable Physical Simulator for Soft Robotics
IEEE Conference on Robotics and Automation (ICRA), 2019
Video Enhancement with Task-Oriented Flow
International Journal of Computer Vision (IJCV), 2019
Learning to Reconstruct Shapes from Unseen Classes
Neural Information Processing Systems (NeurIPS), 2018. Oral presentation.
3D-Aware Scene Manipulation via Inverse Graphics
Neural Information Processing Systems (NeurIPS), 2018
Learning to Exploit Stability for 3D Scene Parsing
Neural Information Processing Systems (NeurIPS), 2018
Visual Object Networks: Image Generation with Disentangled 3D Representations
Neural Information Processing Systems (NeurIPS), 2018
ShadowCam: Real-Time Detection of Moving Obstacles Behind A Corner For Autonomous Vehicles
International Conference on Intelligent Transportation Systems (ITSC), 2018
MoSculp: Interactive Visualization of Shape and Time
ACM Symposium on User Interface Software and Technology (UIST), 2018
3D Shape Perception from Monocular Vision, Touch, and Shape Priors
IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), 2018
Learning Shape Priors for Single-View 3D Completion and Reconstruction
European Conference on Computer Vision (ECCV), 2018
Physical Primitive Decomposition
European Conference on Computer Vision (ECCV), 2018
Learning-based Video Motion Magnification
European Conference on Computer Vision (ECCV), 2018. Oral presentation.
Seeing Tree Structure from Vibration
European Conference on Computer Vision (ECCV), 2018
Best-buddies similarity—robust template matching using mutual nearest neighbors
IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), 2018
Unsupervised Training for 3D Morphable Model Regression
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018
Inferring Light Fields from Shadows
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018. Spotlight presentation.
Learning and Using the Arrow of Time
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018
Pix3D: Dataset and Methods for Single-Image 3D Shape Modeling
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018
Smart, Sparse Contours to Represent and Edit Images
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2018
Exploiting Occlusion in Non-Line-of-Sight Active Imaging
IEEE Transactions on Computational Imaging, 2018
Looking to Listen at the Cocktail Party: A Speaker-Independent Audio-Visual Model for Speech Separation
SIGGRAPH, 2018
Cognitive Load Estimation in the Wild
CHI Conference on Human Factors in Computing Systems, 2018
3D Interpreter Networks for Viewer-Centered Wireframe Modeling
International Journal of Computer Vision (IJCV), 2018
Learning Sight from Sound: Ambient Sound Provides Supervision for Visual Learning
Interactional Journal of Computer Vision (IJCV), 2018
Reconstructing Video of Time-Varying Sources from Radio Interferometric Measurements
IEEE Transactions on Computational Imaging, 2018
Learning to See Physics via Visual De-animation
Neural Information Processing Systems (NIPS), 2017. Spotlight presentation.
Shape and Material from Sound
Neural Information Processing Systems (NIPS), 2017. Spotlight presentation.
Marrnet: 3D shape reconstruction via 2.5D sketches
Neural Information Processing Systems (NIPS), 2017
Turning Corners into Cameras: Principles and Methods
International Conference on Computer Vision (ICCV), 2017
Generative modeling of audible shapes for object perception
International Conference on Computer Vision (ICCV), 2017
Motion microscopy for visualizing and quantifying small motions
Proceedings of the National Academy of Sciences (PNAS), 2017
Guest Editorial Special Issue on Extreme Imaging
IEEE Transactions on Computational Imaging 3 (3), 382-383
3DTV at home: eulerian-lagrangian stereo-to-multiview conversion
ACM Transactions on Graphics (TOG) 36 (4), 146 (SIGGRAPH) July, 2017
Synthesizing normalized faces from facial identity features
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017
On the Effectiveness of Visible Watermarks
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2017
Population based image imputation
International Conference on Information Processing in Medical Imaging (IPMI), 2017. Best Poster Award.
Eulerian Video Magnification and Analysis
Communications of the ACM, Vol. 60 No. 1, Pages 87-95, January 2017
Learning a Probabilistic Latent Space of Object Shapes via 3D Generative-Adversarial Modeling
Neural Information Processing Systems (NIPS), 2016
Visual Dynamics: Probabilistic Future Frame Synthesis via Cross Convolutional Networks
Neural Information Processing Systems (NIPS), 2016. Oral presentation.
Video Camera–Based Vibration Measurement for Civil Infrastructure Applications
Journal of Infrastructure Systems, Vol 3 (2)
Interactive Visualization of Spatially Amplified GNSS Time‐Series Position Fields
Seismological Research Letters, Vol 88(1), pp 126-130, 2016
Observing—and imaging—active galactic nuclei with the Event Horizon Telescope
Galaxies 4(4) p. 54, 2016
Ambient Sound Provides Supervision for Visual Learning
European Conference on Computer Vision (ECCV), 2016. Oral presentation.
Single Image 3D Interpreter Network
European Conference on Computer Vision (ECCV), 2016. Oral presentation.
Physics 101: Learning Physical Object Properties from Unlabeled Videos
British Machine Vision Conference (BMVC), 2016.
A Comparative Evaluation of Approximate Probabilistic Simulation and Deep Neural Networks as Accounts of Human Physical Scene Understanding
Annual Meeting of the Cognitive Science Society (CogSci), 2016. Oral presentation.
Visually Indicated Sounds
Computer Vision and Pattern Recognition (CVPR) 2016
Computational Imaging for VLBI Image Reconstruction
Computer Vision and Pattern Recognition (CVPR) 2016
Galileo: Perceiving Physical Object Properties by Integrating a Physics Engine with Deep Learning
Neural Information Processing Systems, 127-135 (NIPS), 2015
Deviation magnification: revealing departures from ideal geometries
ACM Transactions on Graphics (TOG) 34 (6), 226 (SIGGRAPH Asia), 2015
Revealing and modifying non-local variations in a single image
ACM Transactions on Graphics (TOG) 34 (6), 227 (SIGGRAPH Asia), 2015
A computational approach for obstruction-free photography
ACM Transactions on Graphics (TOG) 34 (4), (SIGGRAPH), 2015
Modal identification of simple structures with high-speed video using motion magnification
Journal of Sound and Vibration vol 345, pages 58-71, 2015
Best-Buddies Similarity for Robust Template Matching
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
Reflection Removal using Ghosting Cues
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
The Aperture Problem for Refractive Motion
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
Video Magnification in Presence of Large Motions
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
Visual Vibrometry: Estimating Material Properties from Small Motions in Video
Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015
Developments with Motion Magnification for Structural Modal Identification Through Camera Video
Dynamics of Civil Structures, Volume 2, pages 49-57, 2015
Refraction Wiggles for Measuring Fluid Depth and Velocity from Video
European Conference on Computer Vision (ECCV), 2014
Style Transfer for Headshot Portraits
ACM Transactions on Graphics (SIGGRAPH), 2014
The Visual Microphone: Passive Recovery of Sound from Video
ACM Transactions on Graphics, Volume 33, Number 4 (Proc. SIGGRAPH), 2014.
Camouflaging an Object from Many Viewpoints
IEEE Computer Vision and Pattern Recognition (CVPR), 2014
Seeing the Arrow of Time
IEEE Computer Vision and Pattern Recognition (CVPR), 2014
A Compositional Model for Low-Dimensional Image Set Representation
IEEE Computer Vision and Pattern Recognition (CVPR), 2014
Rethinking color cameras
IEEE International Conference on Computational Photography (ICCP), 2014.
Accidental Pinhole and Pinspeck Cameras
International Journal of Computer Vision
110 (2), 92-112
Riesz Pyramids for Fast Phase-Based Video Magnification
International Conference on Computational Photography (ICCP), 2014.
Structural modal identification through high speed camera video
Topics in Modal Analysis I, Volume 7, pages 191-197, Springer International Publishing, 2014.
Fabricating BRDFs at High Spatial Resolution Using Wave Optics
ACM Transactions on Graphics, Volume 32, Number 4 (Proc. SIGGRAPH) 2013
Phase-based Video Motion Processing
ACM Transactions on Graphics, Volume 32, Number 4 (Proc. SIGGRAPH) 2013
Estimating the Material Properties of Fabric from Video
2013 IEEE International Conference on Computer Vision (ICCV)
Group Norm for Learning Structured SVMs with Unstructured Latent Variables
2013 IEEE International Conference on Computer Vision (ICCV)
Shape Anchors for Data-Driven Multi-view Reconstruction
International Conference on Computer Vision (ICCV), 2013
Data-driven Hallucination of Different Times of Day from a Single Outdoor Photo
ACM Transactions on Graphics (SIGGRAPH Asia) 2013
Eulerian Video Magnification for Revealing Subtle Changes in the World
ACM Transactions on Graphics, Volume 31, Number 4 (Proc. SIGGRAPH) 2012
Towards Longer Long-Range Motion Trajectories
British Machine Vision Conference (BMVC) 2012
Annotation Propagation in Large Image Databases via Dense Image Correspondence
European Conference on Computer Vision (ECCV), October 2012
Patch Complexity, Finite Pixel Correlations and Optimal Denoising
European Conference on Computer Vision (ECCV), October 2012
Shapecollage: Occlusion-Aware, Example-Based Shape Interpretation
European Conference on Computer Vision (ECCV), October 2012
Exploiting compositionality to explore a large space of model structures
Conf. on Uncertainty in Artificial Intelligence (UAI), August 2012
Best Student Paper Prize
Accidental pinhole and pinspeck cameras: revealing the scene outside the picture
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 2012
Laser Speckle Photography for Surface Tampering Detection
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 2012
CG2Real: Improving the Realism of Computer Generated Images using a Large Collection of Photographs
IEEE Transactions on Visualization and Computer Graphics (IEEE TVCG) 2011
Diffuse Reflectance Imaging with Astronomical Applications
IEEE Intl. Conf. on Computer Vision (ICCV), 2011
Evaluation of Image Features Using a Photorealistic Virtual World
IEEE Intl. Conf. on Computer Vision (ICCV), 2011
A Perfect Match (technical perspective)
Communications of the ACM, November, 2011, vol. 54, no. 11
Blur Kernel Estimation Using the Radon Transform
Proc. 23rd IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2011
Efficient Marginal Likelihood Optimization in Blind Deconvolution
IEEE Conf. on Computer Vision and Pattern Recognition, June 2011
Motion Denoising with Application to Time-lapse Photography
Proc. 23rd IEEE Conference on Computer Vision and Pattern Recognition, CVPR 2011
Where computer vision needs help from computer science
ACM-SIAM Symposium on Discrete Algorithms, January, 2011
Infinite Images: Creating and Exploring a Large Photorealistic Virtual Space
Proceedings of the IEEE, volume 98, issue 8, pages 1391 - 1407, 2010
Matching and Predicting Street Level Images
Workshop for Vision on Cognitive Tasks, European Conf. on Computer Vision (ECCV) 2010
Motion blur removal with orthogonal parabolic exposures
IEEE Intl. Conf. on Computational Photography (ICCP), 2010
Search-and-Replace Editing for Personal Photo Collections
IEEE Intl. Conf. on Computational Photography (ICCP), 2010
The Patch Transform
IEEE Transactions on Pattern Analysis and Machine Intelligence (PAMI), vol. 32, issue 8, pages 1489 - 1501, August, 2010
A Content-Aware Image Prior
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 2010
A Probabilistic Image Jigsaw Puzzle Solver
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 2010
Analyzing Spatially-varying Blur
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 2010
Latent Hierarchical Structural Learning for Object Detection
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 2010
Noise-Optimal Capture for High Dynamic Range Photography
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 2010
Part and Appearance Sharing: Recursive Compositional Models for Multi-View Multi-Object Detection
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 2010
Using the Forest to See the Trees: Exploiting Context for Visual Object Detection and Localization
Communications of the ACM, March 2010, Vol. 53, No. 3
Ground-truth dataset and baseline evaluations for intrinsic image algorithms
International Conference on Computer Vision, 2009
Nonparametric Bayesian Texture Learning and Synthesis
Neural Information Processing Systems (NIPS) 2009
Segmenting Scenes by Matching Image Composites
Neural Information Processing Systems (NIPS) 2009
Time-constrained Photography
Proc. 12th IEEE International Conference on Computer Vision (ICCV 2009, oral presentation)
Informative Sensing of Natural Images
IEEE Int. Conf. Image Processing, Egypt, Nov. 2009
4D Frequency Analysis of Computational Cameras for Depth of Field Extension
SIGGRAPH, ACM Transactions on Graphics, Aug 2009
Understanding and evaluating blind deconvolution algorithms
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), June 2009
Best paper award runner up
LabelMe: a Database and Web-based Tool for Image Annotation
International Journal of Computer Vision, 77(1-3):157-173, 2008
SIFT Flow: Dense Correspondence across Different Scenes
European Conference on Computer Vision, ECCV 2008
Understanding camera trade-offs through a Bayesian analysis of light field projections
European Conference on Computer Vision, ECCV 2008
80 million tiny images: a large dataset for non-parametric object and scene recognition
IEEE Transactions on Pattern Analysis and Machine Intelligence., Volume 30 , Issue 11 (November 2008), Pages: 1958-1970
Motion-Invariant Photography
ACM Transactions on Graphics, 27(3), (Proc. SIGGRAPH), August, 2008
Creating and exploring a large photorealistic virtual space
First IEEE Workshop on Internet Vision, associated with CVPR 2008
Human-assisted motion annotation
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2008
The patch transform and its applications to image editing
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2008
Best Poster Award, CVPR 2008
Unsupervised Discovery of Visual Object Class Hierarchies
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2008
Describing visual scenes using transformed objects and parts
International Journal of Computer Vision, 77, May 2008
Signal and Image Processing with Belief Propagation
DSP Application Column, IEEE Signal Processing Magazine, Mar. 2008
Automatic estimation and removal of noise from a single image
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), Vol 30, No. 2, pp. 299-314, Feb., 2008
A reliable skin mole localization scheme
2007 IEEE Workshop on Mathematical Methods in Biomedical Image Analysis (MMBIA), in conjunction with 2007 ICCV
Image and depth from a conventional camera with a coded aperture
ACM Trans. On Graphics (Proc. SIGGRAPH) 2007
Learning Compressed Sensing
45th Allerton Conference on Communication, Control, and Computing, 2007
Object Recognition by Scene Alignment
Advances in Neural Information Processing Systems (NIPS), 2007
Face Hallucination: theory and practice
International Journal of Computer Vision, Vol. 75, no. 1, pp. 115-134, October, 2007
Learning Gaussian Conditional Random Fields for Low-Level Vision
IEEE Computer Vision and Pattern Recognition (CVPR) 2007
What makes a good model of natural images?
IEEE Computer Vision and Pattern Recognition (CVPR) 2007
Sharing visual features for multiclass and multiview object detection
IEEE Transactions on Pattern Analysis and Machine Intelligence , vol. 29, no. 5, pp. 854-869, May, 2007
Exploring defocus matting: non-parametric acceleration, super-resolution, and off-center matting
IEEE Computer Graphics and Applications, special issue on Computational Photography, March, 2007
Analysis of contour motions
Advances in Neural Information Processing Systems (NIPS 2006)
Received Outstanding Student Paper Award
Bayesian model of human color constancy
Journal of Vision, 6, 1267-1281, doi:10.1167/6.11.10. 2006
Depth from familiar objects: a hierarchical model for 3d scenes
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR) New York, NY, June, 2006
Estimating Intrinsic Component Images using Non-Linear Regression
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR) New York, NY, June, 2006
Noise estimation from a single image
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR) New York, NY, June, 2006
Using multiple segmentations to discover objects and their extent in image collections
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR) New York, NY, June, 2006
Object detection and localization using local and global features
Lecture Notes in Computer Science (unrefeered). Sicily workshop on object recognition, 2005
Shared features for multiclass object detection
Towards Category-Level Object Recognition. Springer Lecture Notes in Computer Science (invited submission). 2005
Describing Visual Scenes using Transformed Dirichlet
Neural Information Processing Systems (NIPS), Vancouver, B.C., Dec. 2005
An Ensemble Prior of Image Structure for Cross-modal Inference
International Conference on Computer Vision (ICCV), Beijing, China, vol. 1, pp. 871-876, Oct. 2005
Discovering Objects and their Location in Images
International Conference on Computer Vision (ICCV), Beijing, China, Oct. 2005
Received 2017 Helmholtz prize, test-of-time award.
Learning Hierarchical Models of Scenes, Objects, and Parts
International Conference on Computer Vision (ICCV), Beijing, China, Oct. 2005
LabelMe: a database and web-based tool for image annotation
MIT AI Lab Memo AIM-2005-025, September, 2005
Recovering Intrinsic Images from a Single Image
IEEE Transactions on Pattern Analysis and Machine Intelligence, Volume 27, Issue 9, September 2005, Pages 1459 – 1472
Constructing Free Energy Approximations and Generalized Belief Propagation Algorithms
IEEE Transactions on Information Theory, ISSN; 0018-9448, Vol. 51, Issue 7, pp. 2282-2312, July 2005
Distributed Occlusion Reasoning for Tracking with Nonparametric Belief Propagation
Neural Information Processing Systems (NIPS) 2004
Efficient multiscale sampling from products of Gaussian mixtures
Advances in Neural Information Processing Systems 16 (NIPS), Vancouver, BC, MIT Press, 2004
Using the forest to see the trees: a graphical model relating features, objects, and scenes
Advances in Neural Information Processing Systems 16 (NIPS), Vancouver, BC, MIT Press, 2004
Contextual Models for Object Detection Using Boosted Random Fields
Neural Information Processing Systems (NIPS), Vancouver, B.C., Dec. 2004
Single-frame Text Super-resolution: A Bayesian Approach
International Conference on Image Processing (ICIP), Oct. 2004
Efficient graphical models for processing images
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR) Washington, DC, 2004
Sharing visual features for multiclass and multiview object detection
IEEE Conf. on Computer Vision and Pattern Recognition (CVPR) Washington, DC, 2004; MIT CSAIL technical report
Visual Hand Tracking Using Nonparametric Belief Propagation
Workshop on Generative Model Based Vision, CVPR, June 2004
Comparison of graph cuts with belief propagation for stereo, using identical MRF parameters
IEEE Intl. Conference on Computer Vision (ICCV), Nice, France, October, 2003
Context-based vision system for place and object recognition
IEEE Intl. Conference on Computer Vision (ICCV), Nice, France, October, 2003
Exploiting spatial and spectral image regularities for color constancy
3rd Intl. Workshop on Statistical and Computational Theories of Vision (associated with Intl. Conf. on Computer Vision), Nice, France, October, 2003
Exploiting the sparse derivative prior for super-resolution and image demosaicing
3rd Intl. Workshop on Statistical and Computational Theories of Vision (associated with Intl. Conf. on Computer Vision), Nice, France, October, 2003
Nonparametric Belief Propagation and Facial Appearance Estimation
IEEE Computer Vision and Pattern Recognition (CVPR), Madison, WI, June, 2003
Properties and Applications of Shape Recipes
IEEE Computer Vision and Pattern Recognition (CVPR), Madison, WI, June, 2003
Shape-Time Photography
IEEE Computer Vision and Pattern Recognition (CVPR), Madison, WI, June, 2003
Learning style translation for the lines of a drawing
ACM Transactions on Graphics, January, 2003
Shape Recipes: Scene Representations that Refer to the Image
Neural Information Processing Systems (NIPS) 2002
Example-based super-resolution
IEEE Computer Graphics and Applications, March/April, 2002.
Test-of-time award given in 2023 from IEEE CG&A.
Generalized Belief Propagation
Neural Information Processing Systems 13, edited by T. K. Leen, T. G. dietterich, and V. Tresp, pp. 689-695, 2001
Learning Joint Statistical Models for Audio-Visual Fusion and Segregation
Advances in Neural Information Processing Systems 13, edited by T. K. Leen, T. G. dietterich, and V. Tresp, pp. 772-778, 2001
Learning local evidence for shading and reflectance
International Conference on Computer Vision, Vancouver, BC, Canada, 2001
Learning Motion Analysis
Statistical Theories of the Brain, edited by R. Rao, B. Olshausen, and M. Lewicki, MIT Press, 2001
On the optimality of solutions of the max-product belief propagation algorithm in arbitrary graphs
IEEE Trans. Information Theory, Special Issue on Codes on Graphs and Iterative Algorithms, 47(2), pp. 723-735, 2001
Teaching applied computing without programming: a case-based introductory course for general education
Proceedings of the thirty-second SIGCSE technical symposium on Computer Science Education, Charlotte, North Carolina, 2001
Understanding belief propagation and its generalizations
International Joint Conference on Artificial Intelligence (IJCAI 2001), Distinguished Papers Track
Bayesian Reconstruction of 3D Human Motion from Single-Camera Video
Advances in Neural Information Processing Systems 12, edited by S. A. Solla, T. K. Leen, and K-R Muller, 2000
Correctness of Belief Propagation in Gaussian Graphical Models of Arbitrary Topology
Advances in Neural Information Processing Systems 12, edited by S. A. Solla, T. K. Leen, and K-R Muller, 2000
Learning Low-Level Vision
International Journal of Computer Vision, 40(1), pp. 25-47, 2000
Separating style and content with bilinear models
Neural Computation 12(6), pp. 1247-1283, 2000
Markov networks for super-resolution
Proceedings of 34th Annual Conference on Information Sciences and Systems (CISS 2000), Dept. Electrical Engineering, Princeton University, Princeton, NJ 08544-5263, March, 2000
Review of "Biometrics: personal identification in a networked society"
Pattern Analysis and Applications, March, 2000
Learning low-level vision
Appeared in IEEE International Conference on Computer Vision, Corfu, Greece, 1999
Artificial retina chips as on-chip image processors and gesture-oriented interfaces
Optical Engineering, Vol. 38, No. 12, December, 1999
Computer vision for computer interaction
SIGGRAPH Computer Graphics magazine, November, 1999
An Inexpensive, All Solid-state Video and Data Recorder for Accident Reconstruction
Presented at the 1999 SAE International Congress and Exposition in Detroit, Michigan on March 3, 1999; published as SAE Technical Paper number 1999-10-1299
An example-based approach to style translation for line drawings
Tech. Rep. TR99-11, Mitsubishi Electric Research Laboratories, Cambridge, MA, February 1999
Markov networks for low-level vision
Presented at Workshop on Statistical and Computational Theories of Vision
Learning to estimate scenes from images
Neural Information Processing Systems, volume 11, 1999
A factorization approach to grouping
Proceedings, European Conference on Computer Vision, 1998
Bayesian model of surface perception
Neural Information Processing Systems, volume 10, pp. 787-793, 1998
Bayesian Estimation of 3-D Human Motion
Tech. Rep. TR98-06, Mitsubishi Electric Research Laboratories, Cambridge, MA, July 1998
Computer vision for interactive computer graphics
IEEE Computer Graphics and Applications, volume 18, number 3, May-June, pp. 42-53, 1998
Separating Style and Content
Neural Information Processing Systems 9, M. C. Mozer, M. I. Jordan and T. Petsche, Eds., Morgan Kaufmann, San Mateo, CA., 1997
Design Galleries: A General Approach to Setting Parameters for Computer Graphics and Animation
ACM Computer Graphics, vol. 31, no. 4, (SIGGRAPH '97) August, 1997
Bayesian Color Constancy
Journal of the Optical Society of America, A, 14(7), pp. 1393-1411, July, 1997
Learning bilinear models for two-factor problems in vision
IEEE Conference on Computer Vision and Pattern Recognition (CVPR '97), Puerto Rico, U. S. A., June, 1997
Received Outstanding Paper prize, CVPR '97
Exploiting the generic viewpoint assumption
International Journal Computer Vision, 20 (3), 243-261, 1996
The generic viewpoint assumption in a Bayesian framework
Perception as Bayesian Inference, D. Knill and W. Richards, eds., Cambridge University Press, 365 - 390, 1996
Computer vision for computer games
, 2nd International Conference on Automatic Face and Gesture Recognition, Killington, VT, USA, pp. 100-105
Example-based head tracking
2nd International Conference on Automatic Face and Gesture Recognition, Killington, VT, USA.
A gesture controlled human interface using an artificial retina chip
IEEE Lasers and Electro-Optics (LEOS '96), July, 1996
Artificial retina chips as image input interfaces for multimedia systems
Optoelectronics and Communications Conference, OECC'96, Chiba, Japan, July, 1996
The steerable pyramid: a flexible architecture for multi-scale derivative computation
2nd Annual IEEE International Conference on Image Processing, Washington, DC. October, 1995
Bayesian decision theory, the maximum local mass estimate, and color constancy
Fifth International Conference on Computer Vision, IEEE Computer Society, Cambridge, MA, U.S.A, June, 1995, pp. 210 - 217
Orientation histograms for hand gesture recognition
International Workshop on Automatic Face- and Gesture- Recognition, IEEE Computer Society, Zurich, Switzerland, June, 1995, pp. 296-301
Winner, 2013 Test-of-time award from Face and Gesture Recognition conference. Here is a video prepared to accept the test-of-time award, describing the work in its context, in .mov format, or in .mpeg format.
Television control by hand gestures
International Workshop on Automatic Face- and Gesture- Recognition, IEEE Computer Society, Zurich, Switzerland, June, 1995, pp. 179-183
Bayesian method for recovering surface and illuminant properties from photosensor responses
Human Vision, Visual Processing and Digital Display V, SPIE Proceedings Series, vol. 2179, 1994
Computer vision for computer graphics
SIGGRAPH '94 and '95 course notes
Demonstration of an interactive environment for collaboration and learning
IEEE Computer, Vol. 27, No. 12, Dec. 1994
The generic viewpoint assumption in a framework for visual perception
Nature, vol. 368, p. 542 - 545, April 7, 1994
Exploiting the generic view assumption to estimate scene parameters
IEEE International Conference on Computer Vision, Berlin, Germany, 1993
Building and using catalogs of grey-level junctions
Proc. 15th European Conference on Visual Perception, Edinburgh, Scotland. August, 1993
Steerable Filters and Local Analysis of Image Structure
Ph.D. Thesis, Massachusetts Institute of Technology, 1992
Shiftable Multi-Scale Transforms
IEEE Trans. Information Theory, Special Issue on Wavelets. Vol. 38, No. 2, pp. 587-607, March 1992
The design and use of steerable filters
IEEE Trans. on Pattern Analysis and Machine Intelligence, vol. 13, no. 9, pp. 891 - 906, September, 1991
Motion without movement
ACM Computer Graphics, vol. 25, no. 4, (SIGGRAPH '91), pp. 27 - 30, July, 1991
A neural network for image noise removal
1st National Conference on Neural Networks and their Applications, Beijing, 1990
(in Chinese)
Pyramids and multiscale representations
Proc. 13th European Conference on Visual Perception, Paris, 1990
Steerable filters for early vision, image analysis, and wavelet decomposition
IEEE International Conference on Computer Vision, Osaka, Japan, 1990
Helmholtz Prize--test-of-time award winner.
Applications of neural networks in image processing
Automation Soc. of China Symp. on Neural Networks, pp. 46 - 55, Beijing, 1989
(in Chinese)
Steerable filters
OSA Topical Meeting on Image Understanding and Machine Vision, Technical Digest Series Volume 14, June, 1989
Image processing to remove grain from photographs
Society of Photographic Scientists and Engineers 42nd Annual Conference, pp. 457 - 460, May, 1989
Computer Image Processing of STEM Images of Tobacco Mosaic Virus
Ultramicroscopy 6, 367-76 (1981)
Deep Radio Occultations and 'Evolute Flashes': Their Characteristics and Utility for Planetary Studies
Icarus 37, 612-26 (1979)