Michael (Miki) Rubinstein

Home PublicationsCodeTeachingMisc

 

 
+1 617-253-3497

MIT Computer Science and Artificial Intelligence Lab
32 Vassar Street
Cambridge, MA 02139


 

I'm a PhD student in the EECS department at MIT, working in computer vision. My advisor is Prof. Bill Freeman.
I've completed my Masters in computer science at Tel-Aviv University and the Interdisciplinary Center (IDC) in May 2009, under the supervision of Prof. Ariel Shamir.
My main research interests are in the areas of image/video analysis, motion analysis and computational photography and video.

CV [pdf] [LinkedIn]


News

May 20, 2012   "Eulerian Video Magnification for Revealing Subtle Changes in the World" accpeted to SIGGRAPH 2012
Mar 05, 2012   I am supported by the Microsoft Research PhD Fellowship (2012-2013)
May 23 2011   Spending the summer at Microsoft Research New England
May 03 2011   "Motion Denoising with Application to Time-lapse Photography" accepted to CVPR 2011
May 03 2011   I am a recipient of the 2011 NVIDIA Graduate Fellowship
Sep 12 2010   RetargetMe dataset is now online
Aug 15 2010   "A Comparative Study of Image Retargeting" conditionally accepted to SIGGRAPH Asia 2010

 


Links

 


Publications

List of publications and patents @ Google Scholar

  Hao-Yu Wu, Michael Rubinstein, Eugene Shih, John Guttag, Fredo Durand, William T. Freeman
Eulerian Video Magnification for Revealing Subtle Changes in the World
ACM Transactions on Graphics, Volume 31, Number 4 (Proc. SIGGRAPH) 2012
[abstract] [pdf] [www] [BibTeX]
Our goal is to reveal temporal variations in videos that are difficult or impossible to see with the naked eye and display them in an indicative manner. Our method, which we call Eulerian Video Magnification, takes a standard video sequence as input, and applies spatial decomposition, followed by temporal filtering to the frames. The resulting signal is then amplified to reveal hidden information. Using our method, we are able to visualize the flow of blood as it fills the face and to amplify and reveal small motions. Our technique can be run in real time to instantly show phenomena occurring at the temporal frequencies selected by the user.
  Michael Rubinstein, Ce Liu, Peter Sand, Fredo Durand, William T. Freeman
Motion Denoising with Application to Time-lapse Photography

IEEE Computer Vision and Pattern Recognition (CVPR) 2011
[abstract] [pdf] [www] [BibTeX]
Motions can occur over both short and long time scales. We introduce motion denoising, which treats short-term changes as noise, long-term changes as signal, and rerenders a video to reveal the underlying long-term events. We demonstrate motion denoising for time-lapse videos. One of the characteristics of traditional time-lapse imagery is stylized jerkiness, where short-term changes in the scene appear as small and annoying jitters in the video, often obfuscating the underlying temporal events of interest. We apply motion denoising for resynthesizing time-lapse videos showing the long-term evolution of a scene with jerky short-term changes removed. We show that existing filtering approaches are often incapable of achieving this task, and present a novel computational approach to denoise motion without explicit motion analysis. We demonstrate promising experimental results on a set of challenging time-lapse sequences.
  Michael Rubinstein, Diego Gutierrez, Olga Sorkine, Ariel Shamir
A Comparative Study of Image Retargeting
ACM Transactions on Graphics, Volume 29, Number 5 (Proc. SIGGRAPH Asia) 2010
[abstract] [pdf] [www] [BibTeX]
The numerous works on media retargeting call for a methodological approach for evaluating retargeting results. We present the first comprehensive perceptual study and analysis of image retargeting. First, we create a benchmark of images and conduct a large scale user study to compare a representative number of state-of-the-art retargeting methods. Second, we present analysis of the users’ responses, where we find that humans in general agree on the evaluation of the results and show that some retargeting methods are consistently more favorable than others. Third, we examine whether computational image distance metrics can predict human retargeting perception. We show that current measures used in this context are not necessarily consistent with human rankings, and demonstrate that better results can be achieved using image features that were not previously considered for this task. We also reveal specific qualities in retargeted media that are more important for viewers. The importance of our work lies in promoting better measures to assess and guide retargeting algorithms in the future. The full benchmark we collected, including all images, retargeted results, and the collected user data, are available to the research community for further investigation.
  Michael Rubinstein
Discrete Approaches to Content-aware Image and Video Retargeting

MSc Thesis, May 2009
[pdf] [high-res pdf (70mb)] [BibTeX]
  Michael Rubinstein, Ariel Shamir, Shai Avidan
Multi-operator Media Retargeting
ACM Transactions on Graphics, Volume 28, Number 3 (Proc. SIGGRAPH) 2009
[abstract] [pdf] [www] [BibTeX]
Content aware resizing gained popularity lately and users can now choose from a battery of methods to retarget their media. However, no single retargeting operator performs well on all images and all target sizes. In a user study we conducted, we found that users prefer to combine seam carving with cropping and scaling to produce results they are satisfied with. This inspires us to propose an algorithm that combines different operators in an optimal manner. We define a resizing space as a conceptual multi-dimensional space combining several resizing operators, and show how a path in this space defines a sequence of operations to retarget media. We define a new image similarity measure, which we term Bi-Directional Warping (BDW), and use it with a dynamic programming algorithm to find an optimal path in the resizing space. In addition, we show a simple and intuitive user interface allowing users to explore the resizing space of various image sizes interactively. Using key-frames and interpolation we also extend our technique to retarget video, providing the flexibility to use the best combination of operators at different times in the sequence.
  Michael Rubinstein, Ariel Shamir, Shai Avidan
Improved Seam Carving for Video Retargeting
ACM Transactions on Graphics, Volume 27, Number 3 (Proc. SIGGRAPH) 2008
[abstract] [pdf] [www] [code] [BibTex]
This algorithm appears in Photoshop ≥CS4 as Content-aware scaling
Video, like images, should support content aware resizing. We present video retargeting using an improved seam carving operator. Instead of removing 1D seams from 2D images we remove 2D seam manifolds from 3D space-time volumes. To achieve this we replace the dynamic programming method of seam carving with graph cuts that are suitable for 3D volumes. In the new formulation, a seam is given by a minimal cut in the graph and we show how to construct a graph such that the resulting cut is a valid seam. That is, the cut is monotonic and connected. In addition, we present a novel energy criterion that improves the visual quality of the retargeted images and videos. The original seam carving operator is focused on removing seams with the least amount of energy, ignoring energy that is introduced into the images and video by applying the operator. To counter this, the new criterion is looking forward in time - removing seams that introduce the least amount of energy into the retargeted result. We show how to encode the improved criterion into graph cuts (for images and video) as well as dynamic programming (for images). We apply our technique to images and videos and present results of various applications.
  Ariel Shamir, Michael Rubinstein, Tomer Levinboim
Inverse Computer Graphics: Parametric Comics Creation from 3D Interaction
IEEE Computer Graphics & Applications, Volume 26, Number 3, 30-38, 2006
[abstract] [pdf] [www] [BibTeX]
There are times when Computer Graphics is required to be succinct and simple. Carefully chosen simplified and static images can portray a narration of a story as effectively as 3D photo-realistic continuous graphics. In this paper we present an automatic system which transforms continuous graphics originating from real 3D virtualworld interactions into a sequence of comics images. The system traces events during the interaction and then analyzes and breaks them into scenes. Based on user defined parameters of point-ofview and story granularity it chooses specific time-frames to create static images, renders them, and applies post-processing to reduce their cluttering. The system utilizes the same principal of intelligent reduction of details in both temporal and spatial domains for choosing important events and depicting them visually. The end result is a sequence of comics images which summarize the main happenings and present them in a coherent, concise and visually pleasing manner.

 


Code and Data


 
A dataset of 80 images and retargeted results, ranked by human viewers. The project website contains all the data we collected and also provides a nice synopsis for the current state of image retargeting research.
  Image Retargeting Survey (v2.3, 110mb, 2010-08-30)
This is the system I've developed for collecting user feedback on image retargeting results. It is based on the linked-paired comparison design to collect and analyze data when the number of stimuli is very large.
The code is written in HTML, PHP and javascript. It supports different experiment designs, and can be easily used with Amazon Mechanical Turk. See my paper and the project website for further details.
A live demo of the system is available here.
  Seam Carving (v1.0, 2009-04-10)
A MATLAB reimplementation of the seam carving method I worked on at MERL. It is provided for research/educational purposes only. This algorithm is patented and owned by Mitsubishi Electric Research Labs, Cambridge MA.
The code supports backward and forward energy using both the dynamic programming and graph cut formulations (using the maxflow library below). See demo.m for usage example.
Please cite my Masters thesis if you use this code.
maxflow (v1.1, 2008-09-15)
A MATLAB wrapper for the Boykov-Kolmogorov max-flow algorithm. Also available at MATLAB Central.

Algorithms I've implemented

By popular demand: Chuang et al.'s Bayesian matting. See implmenetation details here.

 


Teaching

  Spring 2011 6.869 Advances in Computer Vision


Some non-conference talks I gave here and there:

Seam Carving and Content-driven Retargeting of Images and Video
Presented at: 6.865 Computational Photography, 2010 2011, MIT.
[ppt] [pdf]
Introduction to Recursive Bayesian Filtering
Presented at: Seminar on Advanced Topics in Computer Graphics, 2009, Tel Aviv University.
[ppt] [pdf]
Tracking with focus on Particle Filters
Presented at: Seminar on Vision-based Security, 2009, IDC.
Part I: [ppt] [pdf]
Part II: [ppt] [pdf]
Dimensionality Reduction by Random Mapping
Presented at: Seminar on Advanced Topics in Computer Graphics, 2006, Tel Aviv University.
[ppt] [pdf]

 


Miscellaneous


  SpaceCam (with Adrian Dalca)
Cool videos of earth taken with a DIY high-altitude balloon (March 2012)

 

 

 

 

Last updated: May 2012