MIT EECS PhD Student @ CSAIL Office: 32-G436
Hi! I'm a forth-year PhD student in Electrical Engineering and Computer Science at Massachusetts Institute of Technology, where I work with Jim Glass at CSAIL.
My research primarily focuses on natural language processing and large language models (LLMs), with a particular interest in improving their factuality and reliability. In Lookback Lens, we introduced a method that detects and mitigates contextual hallucinations in LLMs using attention maps. In DoLa, we proposed a decoding strategy that enhances LLM factuality by contrasting the knowledge across different transformer layers.
I also explore retrieval-based approaches to strengthen LLM by grounding answers in real documents. For instance, in Expand, Rerank, and Retrieve, we proposed query reranking to achieve more accurate retrieval results for open-domain QA. In DiffCSE, we built a contrastive learning method based on the differences between similar sentences to further boost the quality of sentence embeddings.
I was fortunate to intern at FAIR Meta, Microsoft, and MIT-IBM Watson AI Lab. Before joining MIT, I was an undergraduate student in Electrical Engineering at National Taiwan University, where I worked with Hung-Yi Lee, Yun-Nung (Vivian) Chen, and Lin-shan Lee. Here is my Curriculum Vitae.
Reducing WER from 80% to 20% for impaired voice speaker via personalized adaptation (in Mandarin). Final Project in Introduction to Biomedical Engineering 2020 Spring@NTU.
A Decentralized Publishing Platform created with Blockchain and Etheruem smart contract. Final Project in Networking and Multinmedia Lab 2020 Spring@NTU.
Ranking 2nd place out of 44 groups by A*T value (Area * Clock Time). Final Project in Computer Architecture 2019 Fall@NTU.
ICCAD 2019 CAD Contest - Problem E. Final Project in Algorithms 2019 Spring@NTU.
Conducted experiments on unsupervised domain adaptation (UDA) for multi-source dataset from ICCV2019 Workshop Challenge. Final Project in Deep Learning for Computer Vision 2019 Spring@NTU.
Developed an open source state-of-the-art Chinese word segmentation system with BiLSTM and ELMo, helping the downstream Chinese NLP task. Final project in Digital Speech Processing 2018 Fall@NTU.
Developed a human-computer game program of the big-two game. Final Project in Computer Programming 2017 Fall@NTU.