Hi, I am Sitao Cheng. I am a visiting research scholar at UCSB NLP Group, advised by Professor William Wang. I also closely work with Professor Liangming Pan. I obtained my Master degree from Nanjing University, advised by Professor Yuzhong Qu. I also worked as a research intern at Microsoft DKI Group.
My research interests lie in advancing the knowledge-intensive reasoning capabilities of language models (LMs). I have experience on Language Agents, RAG and Neural-Symbolic Reasoning. Currently, I am doing research on the following topics:
- Understanding and improving reasoning capabilities (e.g., how LLMs adopts parametric and contextual knowledge for reasoning).
- Language Agents (e.g., reasoning on real-world environments by information retrieval and semantic parsing).
- Retrieval-Augmented Generation (e.g., reasoning over knowledge graphs, efficient retrieval and better data organization paradigm).
I’m seeking for 25 Fall Ph.D. opportunities! Please feel free to reach out if you’re interested in my research! Please check out my CV.
Preprints
Understanding the Interplay between Parametric and Contextual Knowledge for Large Language Models
Sitao Cheng, Liangming Pan, Xunjian Yin, Xinyi Wang, William Yang Wang
[paper] [homepage]Disentangling Memory and Reasoning Ability in Large Language Models
Mingyu Jin, Weidi Luo, Sitao Cheng, Xinyi Wang, Wenyue Hua, Ruixiang Tang, William Yang Wang, Yongfeng Zhang
[paper]RuleArena: A Benchmark for Rule-Guided Reasoning with LLMs in Real-World Scenarios
Ruiwen Zhou, Wenyue Hua, Liangming Pan, Sitao Cheng, Xiaobao Wu, En Yu, William Yang Wang
[paper]Thread: A Logic-Based Data Organization Paradigm for How-To Question Answering with Retrieval Augmented Generation
Kaikai An, Fangkai Yang, Liqun Li, Junting Lu, Sitao Cheng, Shuzheng Si, Lu Wang, Pu Zhao, Lele Cao, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang, Baobao Chang
[paper]TARGA: Targeted Synthetic Data Generation for Practical Reasoning over Structured Data
Xiang Huang, Jiayu Shen, Shanshan Huang, Sitao Cheng, Xiaxia Wang, Yuzhong Qu
[paper]
Publications
[ACL’24 Findings] Call Me When Necessary: LLMs can Efficiently and Faithfully Reason over Structured Environments
Sitao Cheng, Ziyuan Zhuang, Yong Xu, Fangkai Yang, Chaoyun Zhang, Xiaoting Qin, Xiang Huang, Ling Chen, Qingwei Lin, Dongmei Zhang, Saravan Rajmohan, Qi Zhang
[paper] [code][ACL’24 Oral] QueryAgent: A Reliable and Efficient Reasoning Framework with Environmental Feedback based Self-Correction
Xiang Huang*, Sitao Cheng*, Shanshan Huang, Jiayu Shen, Yong Xu, Chaoyun Zhang, Yuzhong Qu
[paper] [code][EMNLP’24] EfficientRAG: Efficient Retriever for Multi-Hop Question Answering
Ziyuan Zhuang, Zhiyang Zhang, Sitao Cheng, Fangkai Yang, Jia Liu, Shujian Huang, Qingwei Lin, Saravan Rajmohan, Dongmei Zhang, Qi Zhang
[paper][EMNLP’23] MarkQA: A Large Scale KBQA Dataset with Numerical Reasoning
Xiang Huang, Sitao Cheng, Yuheng Bao, Shanshan Huang, Yuzhong Qu
[paper] [code] [homepage][AAAI’23 Oral] Question Decomposition Tree for Answering Complex Questions over Knowledge Bases
Xiang Huang, Sitao Cheng, Yiheng Shu, Yuheng Bao, Yuzhong Qu
[paper] [code]
Recent News
- 2024-11: Attending SoCal NLP 2024, San Diego.
- 2024-11: Attending EMNLP 2024, Miami.
- 2024-09: One paper accepted by EMNLP 2024.
- 2024-08: Attending and volunteering at ACL 2024, Bangkok.
- 2024-08: Attending EMNLP23 2024, Singapore.
- 2024-07: Joining UC Santa Barbara, NLP Group.
- 2024-05: Two papers accepted by ACL 2024 (one Main + one Findings).
- 2023-10: One paper accepted by EMNLP 2023.
- 2022-11: One paper accepted by AAAI 2023.
Services
- Reviewer: ARR, ICLR 2024
- ACL 2024 Volunteer