Lavanya Sharan

I was a research scientist in Ruth Rosenholtz's group in the Dept. of Brain & Cognitive Sciences at MIT from 2012 to 2015. I received my PhD in Computer Science with Edward Adelson at MIT in 2009, and from 2009 to 2012, I was a postdoctoral researcher in Jessica Hodgins's group at Disney Research, Pittsburgh.

As an academic, I studied visual perception from behavioral and computational perspectives. In my work, I used psychophysical methods to measure specific visual abilities, and I built computational models to understand how the human visual system might support those abilities. My research focused on explaining visual perception in real-world conditions rather than simplified, abstract settings, and utilized techniques from computer science, specifically, computer vision and computer graphics, to handle the complexity of real-world visual inputs.

Material recognition

JOV'14

IJCV'13

CVPR'10

Our world consists not only of objects and scenes but also of materials of various kinds. Being able to recognize the materials that surround us (e.g., fabric, glass, metal) is important for humans as well as for computer vision systems. As far as we are aware, we were the first to systematically study how humans recognize material categories as well as the first to design computer vision systems to recognize material categories.

We gathered a diverse set of real-world photographs and presented them to human observers under a variety of conditions to establish the accuracy and speed of material category recognition. We found that observers could identify material categories (e.g., leather, plastic) reliably and quickly. Simple strategies based on color, texture, or surface shape could not account for observers' performance. Nor could the results be explained by observers merely performing shape-based object recognition. Rather, fast and accurate material categorization is a distinct, basic ability of the human visual system.

Inspired by these findings, we designed computer vision systems for recognizing high-level material categories. We proposed a set of low and mid-level image features and combined them in LDA and SVM-based frameworks. Our systems outperformed state-of-the-art recognition systems of their time on our challenging dataset of material categories, achieving categorization accuracies in the range, 42-57% (chance: 10%).

Our work was published in JoV, IJCV, and CVPR. In the human vision community, our findings have motivated a number of studies on the relationship of material categorization to object recognition, material quality estimation, visual search, etc. In the computer vision community, our dataset of material categories, the Flickr Material Database (FMD), has become a benchmark for evaluating material recognition systems. In their classic textbook on computer vision, Forsyth & Ponce have praised FMD for being 'an alternative and very difficult material dataset'.

L. Sharan, R. Rosenholtz & E. H. Adelson, Accuracy and speed of material categorization in real-world images, Journal of Vision (JoV), 2014

L. Sharan, C. Liu, R. Rosenholtz & E. H. Adelson, Recognizing materials using perceptually inspired features, Intl. Journal of Computer Vision (IJCV), 2013

C. Liu, L. Sharan, E. H. Adelson & R. Rosenholtz, Exploring features in a Bayesian framework for material recognition, in Proc. of IEEE Conf. on Computer Vision and Pattern Recognition (CVPR), 2010

L. Sharan, The perception of material qualities in real-world images, Ph.D. thesis, MIT, 2009