Xiaomeng Yang 杨晓萌

Ph.D. Candidate in Computer Engineering

Northeastern University · Boston, MA

I am a Ph.D. candidate in Computer Engineering at Northeastern University, co-advised by Prof. Yanzhi Wang and Prof. Xuan Zhang. My research centers on generative models and multimodal AI — with a focus on video generation, diffusion models, and building efficient, deployable generative systems for real-world scientific and engineering applications.

Previously, I earned my master's degree at the University of Chinese Academy of Sciences, where I worked with Prof. Yu Zhou on visual text analysis and understanding. I hold dual bachelor's degrees in Computer Engineering from the University of Illinois Urbana-Champaign and Zhejiang University.

I'm always glad to connect about research ideas or potential collaborations — feel free to reach out by email.

News

2026.06Our work Prompt2Effect was accepted to ECCV 2026.
2025.11Passed the Ph.D. qualifying exam.
2025.09Our work ALTER was accepted to NeurIPS 2025.
2025.09Started as a Research Intern at Snap Inc.

Selected Publications

Prompt2Effect: Training-Free Image-to-Video Model Specialization via LoRA Generation

Xiaomeng Yang, Yanyu Li, Gordon Guocheng Qian, Ivan Skorokhodov, Viacheslav Ivanov, Avalon Vinella, Xuan Zhang, Yanzhi Wang, Sergey Tulyakov, Anil Kag

European Conference on Computer Vision (ECCV), 2026

A training-free framework that specializes image-to-video diffusion models via generated LoRAs for controllable video effects.

Paper Project

ALTER: All-in-one Layer Pruning and Temporal Expert Routing for Efficient Diffusion Generation

Xiaomeng Yang*, Lei Lu*, Qihui Fan, Changdi Yang, Juyi Lin, Yanzhi Wang, Xuan Zhang, Shangqian Gao (* equal contribution)

Conference on Neural Information Processing Systems (NeurIPS), 2025

Combines all-in-one layer pruning with temporal expert routing to accelerate diffusion-based generation.

Paper Code

IPAD: Iterative, Parallel, and Diffusion-based Network for Scene Text Recognition

Xiaomeng Yang, Zhi Qiao, Yu Zhou

International Journal of Computer Vision (IJCV), 2025

An iterative, parallel, and diffusion-based network for accurate and efficient scene text recognition.

Paper Code

ZeroSim: Zero-Shot Analog Circuit Evaluation with Unified Transformer Embeddings

Xiaomeng Yang, Jian Gao, Yanzhi Wang, Xuan Zhang

International Conference on Computer-Aided Design (ICCAD), 2025

Zero-shot analog circuit performance evaluation using unified transformer embeddings.

Paper Code

Masked and Permuted Implicit Context Learning for Scene Text Recognition

Xiaomeng Yang, Zhi Qiao, Jin Wei, Dongbao Yang, Yu Zhou

IEEE Signal Processing Letters (SPL), 2024

Unifies permuted and masked language modeling within a single decoder for robust scene text recognition.

Paper Code

Accurate and Robust Scene Text Recognition via Adversarial Training

Xiaomeng Yang, Dongbao Yang, Zhi Qiao, Yu Zhou

IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2024

A regularization-based adversarial training method that improves both the robustness and accuracy of scene text recognition.

Paper Code

Experience

Research Intern, Creative Vision, Snap Inc.

Sep 2025 – May 2026 · Santa Monica, CA

Worked on efficient image-to-video effects generation.

Research Intern, Tomorrow Advancing Life (TAL)

Nov 2022 – Jun 2024 · Beijing, China

Worked on scene text recognition.

Education

Northeastern University
Ph.D. in Computer Engineering · 2024 – Present

University of Chinese Academy of Sciences
M.S., advised by Prof. Yu Zhou · 2021 – 2024

University of Illinois Urbana-Champaign
B.S. in Computer Engineering (dual-degree) · 2017 – 2021

Zhejiang University
B.S. in Computer Engineering (dual-degree) · 2017 – 2021