Wenxuan Ding

I am a second year PhD student in Computer Science at New York University (Courant Institute), working with Greg Durrett.

I spent the first year of my PhD at TAUR Lab @UT Austin before transferring out.

Previously, I received my Bachelor's degree at The Hong Kong University of Science and Technology with a major in CS and a minor in Math, where I worked with Yangqiu Song. I also worked with Yulia Tsvetkov at University of Washington.

Email / CV / Google Scholar / Twitter / Github

profile photo

Research

I am generally interested in Natural Language Processing, with recent focus on:

Publications

ChartMuseum: Testing Visual Reasoning Capabilities of Large Vision-Language Models
Liyan Tang, Grace Kim, Xinyu Zhao, Thom Lake, Wenxuan Ding, Fangcong Yin, Prasann Singhal, Manya Wadhwa, Zeyu Leo Liu, Zayne Sprague, Ramya Namuduri, Bodun Hu, Juan Diego Rodriguez, Puyuan Peng, Greg Durrett
NeurIPS D&B Track, 2025 paper

Sparta Alignment: Collectively Aligning Multiple Language Models through Combat
Yuru Jiang*, Wenxuan Ding*, Shangbin Feng*, Greg Durrett, Yulia Tsvetkov
NeurIPS, 2025 paper code

RankAlign: A Ranking View of the Generator-Validator Gap in Large Language Models
Juan Diego Rodriguez*, Wenxuan Ding*, Katrin Erk, Greg Durrett
COLM, 2025 paper code

When One LLM Drools, Multi-LLM Collaboration Rules
Shangbin Feng, Wenxuan Ding, Alisa Liu, Zifeng Wang, Weijia Shi, Yike Wang, Zejiang Shen, Xiaochuang Han, Hunter Lang, Chen-Yu Lee, Tomas Pfister, Yejin Choi, Yulia Tsvetkov
arXiv, 2025 paper

On the Role of Entity and Event Level Conceptualization in Generalizable Reasoning: A Survey of Tasks, Methods, Applications, and Future Directions
Weiqi Wang, Tianqing Fang, Haochen Shi, Baixuan Xu, Wenxuan Ding*, Liyu Zhang, Wei Fan, Jiaxin Bai, Haoran Li, Xin Liu, Yangqiu Song
Findings of EMNLP, 2025 paper

Varying Shades of Wrong: Aligning LLMs with Wrong Answers Only
Jihan Yao*, Wenxuan Ding*, Shangbin Feng*, Lucy Lu Wang, Yulia Tsvetkov
ICLR, 2025 paper code

Teaching LLMs to Abstain across Languages via Multilingual Feedback
Shangbin Feng, Weijia Shi, Yike Wang, Wenxuan Ding, Orevaoghene Ahia, Shuyue Stella Li, Vidhisha Balachandran, Sunayana Sitaram, Yulia Tsvetkov
EMNLP, 2024 paper code

IntentionQA: A Benchmark for Evaluating Purchase Intention Comprehension Abilities of Language Models in E-commerce
Wenxuan Ding*, Weiqi Wang*, Sze Heng Douglas Kwok, Minghao Liu, Tianqing Fang, Jiaxin Bai, Xin Liu, Changlong Yu, Zheng Li, Chen Luo, Qingyu Yin, Bing Yin, Junxian He, Yangqiu Song
Findings of EMNLP, 2024 paper code

MIND: Multimodal Shopping Intention Distillation from Large Vision-language Models for E-commerce Purchase Understanding
Baixuan Xu*, Weiqi Wang*, Haochen Shi, Wenxuan Ding, Huihao Jing, Tianqing Fang, Jiaxin Bai, Xin Liu, Changlong Yu, Zheng Li, Chen Luo, Qingyu Yin, Bing Yin, Long Chen, Yangqiu Song
EMNLP, 2024 paper code

Don't Hallucinate, Abstain: Identifying LLM Knowledge Gaps via Multi-LLM Collaboration
Shangbin Feng, Weijia Shi, Yike Wang, Wenxuan Ding, Vidhisha Balachandran, Yulia Tsvetkov
ACL, 2024 🌟 Area Chair Award; 🌟 Outstanding Paper Award paper code

CANDLE: Iterative Conceptualization and Instantiation Distillation from Large Language Models for Commonsense Reasoning
Weiqi Wang, Tianqing Fang, Chunyang Li, Haochen Shi, Wenxuan Ding, Baixuan Xu, Zhaowei Wang, Jiaxin Bai, Xin Liu, Jiayang Cheng, Chunkit Chan, Yangqiu Song
ACL, 2024 paper code

Knowledge Crosswords: Geometric Knowledge Reasoning with Large Language Models
Wenxuan Ding*, Shangbin Feng*, Yuhan Liu, Zhaoxuan Tan, Vidhisha Balachandran, Tianxing He, Yulia Tsvetkov
Findings of ACL, 2024 paper code

CAR: Conceptualization-Augmented Reasoner for Zero-Shot Commonsense Question Answering
Weiqi Wang, Tianqing Fang, Wenxuan Ding, Baixuan Xu, Xin Liu, Yangqiu Song, Antoine Bosselut
Findings of EMNLP, 2023 paper code

QADYNAMICS: Training Dynamics-Driven Synthetic QA Diagnostic for Zero-Shot Commonsense Question Answering
Haochen Shi, Weiqi Wang, Tianqing Fang, Baixuan Xu, Wenxuan Ding, Xin Liu, Yangqiu Song
Findings of EMNLP, 2023 paper code

Teaching

Misc