I am currrently 2 Grade Master’s student at Beijing University of Posts and Telecommunications (BUPT), advised by Prof. Haihong E.
My research interest includes multi-modal large language models (MLLMs) reasoning. I have published several papers at the top international AI conferences.
📖 Educations
- M.S. in School of Computer Science, Beijing University of Posts and Telecommunications, Computer Technology, 2024.09-2027.07(expected)
- B.S. in School of Computer Science, Beijing University of Posts and Telecommunications, Data Science And Big Data Technology, 2020.09-2024.07
🔥 News
- 2026.04:  ,🎉🎉 AEGIS: A Holistic Benchmark for Evaluating Forensic Analysis of AI-Generated Academic Images submitted to ACL 2026(Main Conference).
- 2026.01: 🎉🎉 THEMIS: Towards Holistic Evaluation of MLLMs for Scientific Paper Fraud Forensic submitted to ICLR 2026(Poster).
📝 Publications

THEMIS: Towards Holistic Evaluation of MLLMs for Scientific Paper Fraud Forensic
Tzu-Yen Ma*, Bo Zhang*, Zichen Tang, Junpeng Ding, Haolin Tian, Yuanze Li, Zhuodi Hao, Zixin Ding, Zirui Wang, Xinyu Yu, Shiyao Peng, Yizhuo Zhao, Ruomeng Jiang, Yiling Huang, Peizhi Zhao, Jiayuan Chen, Weisheng Tan, Haocheng Gao, Yang Liu, Jiacheng Liu, Zhongjun Yang, Jiayu Huang, Haihong E
- We present THEMIS, a holistic multi-task benchmark of over 4000 questions derived from authentic retracted-paper cases and realistically simulated synthetic data, to systematically evaluate the fine-grained visual fraud reasoning abilities of MLLMs.

AEGIS: A Holistic Benchmark for Evaluating Forensic Analysis of AI-Generated Academic Images
Bo Zhang*, Tzu-Yen Ma*, Zichen Tang, Junpeng Ding, Zirui Wang, Yizhuo Zhao, Peilin Gao, Zijie Xi, Zixin Ding, HaiyangSun, Haocheng Gao, Yuan Liu, Liangjia Wang, Yiling Huang, Yujie Wang, Yuyue Zhang, Ronghui Xi, Yuanze Li, Jiacheng Liu, Zhongjun Yang, Haihong E
- We introduce AEGIS, a holistic benchmark for evaluating expert-level forensic reasoning on AI-generated academic images, advancing domain-specific complexity, diverse forgery simulations, and multi-dimensional forensic evaluation.
🎖 Honors and Awards
- 2025.4 National Grand Champion, The 4th “Wutong Cup” Big Data Innovation & Maker Marathon Competition, China Mobile, 2025.04
🛠 Skills
- Language: Python, SQL, C
- Deep Learning: PyTorch, HuggingFace Transformers, vLLM
- LLM: Data Cleaning and Pre-processing for Pre-training and SFT、LoRA fine-tuning、prompt engineering
- 多模态工具: OpenCV, CLIP, Segment Anything (SAM)
💬 Invited Talks
- 2026.4, AITIME Forum / AITIME 论道, ICLR 2026 预讲会 北京邮电大学 BUPT ReasoningLab专场. | [video]