Hao Tang 唐顥

Papers

Phonetic analysis of self-supervised representations of english speech
Dan Wells, Hao Tang, Korin Richmond
Interspeech, 2022
Speech audio corrector: using speech from non-target speakers for one-off correction of mispronunciations in grapheme-input text-to-speech
Jason Fong, Daniel Lyth, Gustav Eje Henter, Hao Tang, Simon King
Interspeech, 2022
Autoregressive Co-Training for Learning Discrete Speech Representation
Sung-Ling Yeh, Hao Tang
Interspeech, 2022
Hierarchical sketch induction for paraphrase generation
Tom Hosking, Hao Tang, Mirella Lapata
ACL, 2022
Supervised attention in sequence-to-sequence models for speech recognition
Gene-Ping Yang, Hao Tang
ICASSP, 2022
On the difficulty of segmenting words with attention
Ramon Sanabria, Hao Tang, Sharon Goldwater
Workshop of Insights from Negative Results in NLP, 2021
Vector-quantized autoregressive predictive coding
Yu-An Chung, Hao Tang, James Glass
Interspeech, 2020
(best student paper award)
Audio-visual calibration with polynomial regression for 2-D projection using SVD-PHAT
Francois Grondin, Hao Tang, James Glass
ICASSP, 2020
A deep residual network for large-scale acoustic scene analysis
Logan Ford, Hao Tang, Francois Grondin, James Glass
Interspeech, 2019
An unsupervised autoregressive model for speech representation learning
Yu-An Chung, Wei-Ning Hsu, Hao Tang, James Glass
Interspeech, 2019
VoiceID loss: speech enhancement for speaker verification
Suwon Shon, Hao Tang, James Glass
Interspeech, 2019
Time-contrastive learning based deep bottleneck features for text-dependent speaker verification
Achintya Kr. Sarkar, Zheng-Hua Tan, Hao Tang, Suwon Shon, James Glass
IEEE Transactions on Audio, Speech and Language Processing, 2019
On the inductive bias of words in acoustics-to-word models
Hao Tang, James Glass
arXiv:1810.13407
On training recurrent networks with truncated backpropagation through time in speech recognition
Hao Tang, James Glass
SLT, 2018
Frame-level speaker embeddings for text-independent speaker recognition and analysis of end-to-end model
Suwon Shon, Hao Tang, James Glass
SLT, 2018
A study of enhancement, augmentation, and autoencoder methods for domain adaptation in distant speech recognition
Hao Tang, Wei-Ning Hsu, Francois Grondin, James Glass
Interspeech, 2018
Unsupervised adaptation with interpretable disentangled representations for distant conversational speech recognition
Wei-Ning Hsu, Hao Tang, James Glass
Interspeech, 2018
End-to-end neural segmental models for speech recognition
Hao Tang, Liang Lu, Lingpeng Kong, Kevin Gimpel, Karen Livescu, Chris Dyer, Noah A. Smith, Steve Renals
IEEE Journal of Selected Topics in Signal Processing, 2017
Lexicon-free fingerspelling recognition from video: data, models, and signer adaptation
Taehwan Kim, Jonathan Keane, Weiran Wang, Hao Tang, Jason Riggle, Gregory Shakhnarovich, Diane Brentari, Karen Livescu
Computer Speech and Language, 2017
Multitask learning with low-level auxiliary tasks for encoder-decoder based speech recognition
Shubham Toshniwal, Hao Tang, Liang Lu, Karen Livescu
Interspeech, 2017
ASR for under-resourced languages from probabilistic transcription
Mark Hasegawa-Johnson, Preethi Jyothi, Daniel McCloy, Majid Mirbagheri, Giovanni di Liberto, Amit Das, Bradley Ekin, Chunxi Liu, Vimal Manohar, Hao Tang, Edmund C. Lalor, Nancy Chen, Paul Hager, Tyler Kekona, Rose Sloan, Adrian KC Lee
IEEE Transactions on Audio, Speech and Language Processing, 2017
End-to-end training approaches for discriminative segmental models
Hao Tang, Weiran Wang, Kevin Gimpel, Karen Livescu
SLT, 2016
Efficient segmental cascades for speech recognition
Hao Tang, Weiran Wang, Kevin Gimpel, Karen Livescu
Interspeech, 2016
Triphone state-tying via deep canonical correlation analysis
Weiran Wang, Hao Tang, Karen Livescu
Interspeech, 2016
Adapting ASR for under-resourced languages using mismatched transcriptions
Chunxi Liu, Preethi Jyothi, Hao Tang, Vimal Manohar, Rose Sloan, Tyler Kekona, Mark Hasegawa-Johnson, Sanjeev Khudanpur
ICASSP, 2016
(speech and language processing student paper award)
Signer-independent fingerspelling recognition with deep neural network adaptation
Taehwan Kim, Weiran Wang, Hao Tang, Karen Livescu
ICASSP, 2016
(best student paper of speech and language processing)
Discriminative segmental cascades for feature-rich phone recognition
Hao Tang, Weiran Wang, Kevin Gimpel, Karen Livescu
ASRU, 2015
(best paper nominee)
A comparison of training approaches for discriminative segmental models
Hao Tang, Kevin Gimpel, Karen Livescu
Interspeech, 2014
Log-linear dialog manager
Hao Tang, Shinji Watanabe, Tim K. Marks, John Hershey
ICASSP, 2014
Discriminative pronunciation modeling: a large-margin feature-rich approach
Hao Tang, Joseph Keshet, Karen Livescu
ACL, 2012
An initial attempt for phoneme recognition using structured SVM
Hao Tang, Chao-Hong Meng, Lin-Shan Lee
ICASSP, 2010
Spoken term detection from bilingual spontaneous speech using code-switched lattice-based structure for words and subword units
Hung-Yi Lee, Yueh-Lien Tang, Hao Tang, Lin-Shan Lee
ASRU, 2009
Query term selection strategies for web-based Chinese factoid question answering
Hao Tang, Cheng-Wei Lee, Tian-Jian Jiang, Wen-Lian Hsu
TAAI, 2006