MIT EECS PhD Student @ CSAIL Office: 32-G436
Hi! I'm a fourth-year PhD student in Electrical Engineering and Computer Science at Massachusetts Institute of Technology, where I work with Jim Glass at CSAIL.
My research primarily focuses on natural language processing and large language models (LLMs), with a particular interest in improving their factuality and reliability. In DoLa, we proposed a decoding strategy that enhances LLM factuality by contrasting the knowledge across different transformer layers. In Lookback Lens, we introduced a method that detects and mitigates contextual hallucinations in LLMs using attention maps. Most recently, in SelfCite, we developed a self-supervised framework that enables LLMs to generate fine-grained, sentence-level citations by leveraging context ablation as a reward signal.
I also explore retrieval-based approaches to strengthen LLM by grounding answers in real documents. For instance, in Expand, Rerank, and Retrieve, we proposed query reranking to achieve more accurate retrieval results for open-domain QA. In DiffCSE, we built a contrastive learning method based on the differences between similar sentences to further boost the quality of sentence embeddings.
I was fortunate to intern at FAIR Meta, Microsoft, and MIT-IBM Watson AI Lab. Before joining MIT, I was an undergraduate student in Electrical Engineering at National Taiwan University, where I worked with Hung-Yi Lee, Yun-Nung (Vivian) Chen, and Lin-shan Lee. Here is my Curriculum Vitae.
For a full list of papers, see my Google Scholar.