Kate Saenko

PhD Candidate
Computer Science and Artificial Intelligence Laboratory
Massachusetts Institute of Technology
Office D510, 32 Vassar Street, Cambridge, Massachusetts 02139 USA
email: saenko at mit.edu

Update: I have graduated from MIT. Currently I am a postdoctoral researcher at the International Computer Science Institute in Berkeley, CA.

 

 

Publications

 

2009

 

K. Saenko, “Image Sense Disambiguation: A Multimodal Approach”. Doctoral Thesis, Massachusetts Institute of Technology. August 2009. [pdf] [slides]

 

K, Saenko and T. Darrell, “Filtering Abstract Senses From Image Search Results” In Proc. NIPS, December 2009, Vancouver, Canada, to appear.

K.Saenko, K. Livescu, J. Glass, and T. Darrell, " Multistream Articulatory Feature-Based Models for Visual Speech Recognition". In IEEE Transactions on Pattern Analysis and Machine Intelligence, 2009.

 

2008

 

K, Saenko and T. Darrell, "Unsupervised Learning of Visual Sense Models for Polysemous Words". Proc. NIPS, December 2008, Vancouver, Canada.

 

2007

 

M. Hasegawa, K. Livescu, P. Lal, and K. Saenko, “Audiovisual Speech Recognition with Articulator Positions as Hidden Variables.” Proc. International Congress of Phonetic Sciences, August 2007, Saarbruecken, Germany.

 

K. Saenko and T. Darrell, “Object Category Recognition Using Probabilistic Fusion of Speech and Image Classifiers”. Proc. MLMI, June 2007, Brno, Czech Republic.

 

Karen Livescu, Ozgur Cetin, Mark Hasegawa-Johnson, Simon King, Chris Bartels, Nash Borges, Arthur Kantor, Partha Lal, Lisa Yung, Ari Bezman, Stephen Dawson-Haggerty, Bronwyn Woods, Joe Frankel, Matthew Magimai-Doss, and Kate Saenko, "Articulatory Feature-Based Methods for Acoustic and Audio-Visual Speech Recognition: Summary from the 2006 JHU Summer Workshop". ICASSP, May 2007.

 

2006

 

K. Saenko and K. Livescu, “An Asynchronous DBN for Audio-Visual Speech Recognition”. In Proc. IEEE 2006 Workshop on Spoken Language Technology (SLT), December 2006, Palm Beach, Aruba.

 

C. Christoudias, K. Saenko, L.-P. Morency and T. Darrell, “Co-Adaptation of Audio-Visual Speech and Gesture Classifiers”. Proc. ICMI, November 2006, Banff, Canada.

 

2005


K. Saenko, K. Livescu, M. Siracusa, K. Wilson, J. Glass, and T. Darrell, "Visual Speech Recognition with Loosely Synchronized Feature Streams". Proc. ICCV, October 2005, Beijing.

K. Saenko, K. Livescu, J. Glass, and T. Darrell, "Production Domain Modeling of Pronunciation for Visual Speech Recognition". Proc. ICASSP, March 2005, Philadelphia.

2004


K. Saenko, T. Darrell, and J. Glass, "Articulatory Features for Robust Visual Speech Recognition". Proc. ICMI, pp. 152-158, October 2004, State College, PA.

T. Hazen, K, Saenko, C. La, and J. Glass, "A Segment-based Audio-Visual Speech Recognizer: Data Collection, Development, and Initial Experiments" . Proc. ICMI, pp. 235-242, October 2004, State College, PA.