Stephen H. Shum
sshum (at) csail (dot) mit (dot) edu
In June 2016, I completed my Ph.D. in Electrical Engineering and Computer Science (EECS) at MIT.
For the last seven years, I have been lucky to be a part of the Spoken Language Systems (SLS) group in the MIT Computer Science and Artificial Intelligence Laboratory (CSAIL). My advisors are Jim Glass and Najim Dehak.
The general scope of my research is in signal processing and machine learning as applied to speech, as well as other forms of audio. More specifically, I am interested in the use of both unsupervised and semi-supervised methods to perform statistical inference on an audio signal. For the last few years, I have been working primarily in the realm of language identification, speaker recognition, and speaker diarization, but I continue to be very interested in techniques from computational auditory scene analysis (CASA) that can be applied to more general, non-speech audio.
In May 2009, I graduated from UC Berkeley. Go Bears!