I am a Master’s student at University of Michigan, Ann Arbor (Class of 2026), majoring in Electrical and Computer Engineering. My academic journey began at Zhejiang University, where I earned a B.Eng. in Automation.

My research interests broadly span multimodal learning, vision-language understanding, reasoning and generation, and reinforcement learning for foundation models. I am drawn to problems where different modalities (vision, language, structured knowledge) must be jointly understood, aligned, and reasoned over. My recent work explores latent visual reasoning in VLMs and sub-dimensional cross-modal retrieval.

Beyond research, I am an adrenaline enthusiast — a licensed skydiver (USPA A License), CASI Level 2 snowboard instructor, PADI Advanced Open Water freediver, and IKO certified kitesurfer. I also share my life with an Australian Shepherd (八戒) and a Maine Coon (狗蛋儿).

🔔 I am actively seeking PhD positions and research assistant opportunities starting 2026–2027. Feel free to reach out via email or check my CV.

📝 Publications

ACL ARR 2026
DLR

Decompose, Look, and Reason: Reinforced Latent Reasoning for VLMs

Mengdan Zhu*, Senhao Cheng*, Liang Zhao (*Equal Contribution)

Under Review at ACL ARR 2026

  • Premise-conditioned latent reasoning with dynamic multi-step visual grounding
  • Spherical Gaussian Latent Policy for RL exploration on hyperspherical manifold
  • Three-stage pipeline: contrastive pretraining → SFT → reinforcement finetuning
  • V* Bench 83.8%, MathVista 67.5%, MMMU-Pro 56.1%, MMStar 65.2%, surpassing GPT-4o
ACL ARR 2026
Cross-modal RAG

Cross-modal RAG: Sub-dimensional Text-to-Image Retrieval-Augmented Generation

Mengdan Zhu*, Senhao Cheng*, Guangji Bai, Yifei Zhang, Liang Zhao (*Equal Contribution)

Under Review at ACL ARR 2026 | arXiv | Code

  • Sub-dimensional dense retriever with lightweight adaptor (0.01× CLIP’s GPU memory)
  • Multi-objective Pareto-optimal image selection with theoretical guarantees
  • MS-COCO R@1 81.82% (prev. best 59.10%), Flickr30K R@1 97.50%
Preprint 2024
ChemSafetyBench

ChemSafetyBench: Benchmarking LLM Safety on Chemistry Domain

Haochen Zhao*, Xiangru Tang*, …, Senhao Cheng, …, Mark Gerstein

arXiv | Code

  • Comprehensive benchmark with 30,000+ samples for evaluating LLM safety in chemistry
  • Covers chemical properties, usage legality, and synthesis methods
  • Incorporates handcrafted templates and advanced jailbreaking scenarios
AIBDF 2023, ACM
Breast Cancer Detection

A Breast Cancer Detection Model Based on Modified ConvNeXt v2

Senhao Cheng, Esther Sun, Wangzi Qian, Yang Han

Published | DOI

  • Modified ConvNeXt v2 with Generalized-Mean Pooling and AdaBelief optimizer
  • pF1 improvements of 0.031–0.043 over ResNet50, GoogLeNet, and EfficientNet-B2

📖 Education

  • 2024 - 2026, M.S. in Electrical and Computer Engineering, University of Michigan, Ann Arbor
    • Focus: Multimodal Reasoning, Vision-Language Models, RL for VLMs
  • 2020 - 2024, B.Eng. in Automation, Zhejiang University, Hangzhou, China

💼 Internships

  • 2023.09 - 2024.04, AI & Data Analysis Intern, MindRank Ltd., Hangzhou, China
    • Knowledge Graph construction, Biomedical Data analysis, Drug Discovery pipeline, Predictive Modeling