I am a Master’s student at University of Michigan, Ann Arbor (Class of 2026), majoring in Electrical and Computer Engineering. My academic journey began at Zhejiang University, where I earned a B.Eng. in Automation.

My research interests broadly span multimodal learning, vision-language understanding, reasoning and generation, and reinforcement learning for foundation models. I am drawn to problems where different modalities (vision, language, structured knowledge) must be jointly understood, aligned, and reasoned over. My recent work explores latent visual reasoning in VLMs and sub-dimensional cross-modal retrieval.

Beyond research, I am an adrenaline enthusiast — a licensed skydiver (USPA A License), CASI Level 2 snowboard instructor, PADI Advanced Open Water freediver, and IKO certified kitesurfer. I also share my life with an Australian Shepherd (八戒) and a Maine Coon (狗蛋儿).

🔔 I am actively seeking PhD positions and research assistant opportunities starting 2026–2027. Feel free to reach out via email or check my CV.

📝 Publications

ACL ARR 2026

Decompose, Look, and Reason: Reinforced Latent Reasoning for VLMs

Mengdan Zhu*, Senhao Cheng*, Liang Zhao (*Equal Contribution)

Under Review at ACL ARR 2026

Premise-conditioned latent reasoning with dynamic multi-step visual grounding
Spherical Gaussian Latent Policy for RL exploration on hyperspherical manifold
Three-stage pipeline: contrastive pretraining → SFT → reinforcement finetuning
V* Bench 83.8%, MathVista 67.5%, MMMU-Pro 56.1%, MMStar 65.2%, surpassing GPT-4o

ACL ARR 2026

Cross-modal RAG: Sub-dimensional Text-to-Image Retrieval-Augmented Generation

Mengdan Zhu*, Senhao Cheng*, Guangji Bai, Yifei Zhang, Liang Zhao (*Equal Contribution)

Under Review at ACL ARR 2026 | arXiv | Code

Sub-dimensional dense retriever with lightweight adaptor (0.01× CLIP’s GPU memory)
Multi-objective Pareto-optimal image selection with theoretical guarantees
MS-COCO R@1 81.82% (prev. best 59.10%), Flickr30K R@1 97.50%

Preprint 2024

ChemSafetyBench: Benchmarking LLM Safety on Chemistry Domain

Haochen Zhao*, Xiangru Tang*, …, Senhao Cheng, …, Mark Gerstein

arXiv | Code

Comprehensive benchmark with 30,000+ samples for evaluating LLM safety in chemistry
Covers chemical properties, usage legality, and synthesis methods
Incorporates handcrafted templates and advanced jailbreaking scenarios

AIBDF 2023, ACM

A Breast Cancer Detection Model Based on Modified ConvNeXt v2

Senhao Cheng, Esther Sun, Wangzi Qian, Yang Han

Published | DOI

Modified ConvNeXt v2 with Generalized-Mean Pooling and AdaBelief optimizer
pF1 improvements of 0.031–0.043 over ResNet50, GoogLeNet, and EfficientNet-B2

📖 Education

2024 - 2026, M.S. in Electrical and Computer Engineering, University of Michigan, Ann Arbor
- Focus: Multimodal Reasoning, Vision-Language Models, RL for VLMs
2020 - 2024, B.Eng. in Automation, Zhejiang University, Hangzhou, China

💼 Internships

2023.09 - 2024.04, AI & Data Analysis Intern, MindRank Ltd., Hangzhou, China
- Knowledge Graph construction, Biomedical Data analysis, Drug Discovery pipeline, Predictive Modeling

Senhao (Simon) Cheng

📝 Publications

📖 Education

💼 Internships