I am a postdoctoral fellow at the Department of Computer Science and Engineering (CSE), Hong Kong University of Science and Technology (HKUST), advised by Prof. Long Chen. Prior to this, I obtained my PhD degree in Computer Science and Technology from the DCD Lab, Zhejiang University (ZJU), under the supervision of Prof. Jun Xiao.

💬 Research Interests

Computer Vision, Machine Learning
Vision and Language, Multimodal Generation and Editing

📝 Publications

(Selected works are shown. Full publication list in Google Scholar)

CVPR 2026

FlowMotion: Training-Free Flow Guidance for Video Motion Transfer

Zhen Wang, Youcan Xu, Jun Xiao, Long Chen

Paper/Code

Efficient video motion transfer with one single RTX 3090.

CVPR 2026 Highlight

FlowDC: Flow-Based Decoupling-Decay for Complex Image Editing

Yilei Jiang, Zhen Wang, Yanghao Wang, Jun Yu, Yueting Zhuang, Jun Xiao, Long Chen

Paper/Code

Efficient and flexible flow-based complex image editing.

ACM TOMM 2026

Freetuner: Any subject in any style with training-free diffusion

Youcan Xu, Zhen Wang, Kexin Li, Jun Xiao, Long Chen

Paper

A flexible and training-free method for compositional personalization.

ICML 2025

Event-Customized Image Generation

Zhen Wang, Yilei Jiang, Dong Zheng, Jun Xiao, Long Chen

Paper/Code

Customize your event (actions, poses, relations and interactions) with only one single image!
Training-free, Plug-and-Play and Effective.

CVPR 2025

IterIS: Iterative Inference-Solving Alignment for LoRA Merging

Hongxu Chen, Zhen Wang, Runshi Li, Bowei Zhu, Long Chen

Paper/Code

A sample-efficient and broadly applicable LoRA merging method based on an iterative inference-solving framework with enhanced performance.

ICLR 2025

CLIPDrag: Combining Text-based and Drag-based Instructions for Image Editing

Ziqi Jiang, Zhen Wang, Long Chen

Paper/Code

Incorporating text signals into drag-based methods for precise and flexible image editing.

IJCV 2025

Learning combinatorial prompts for universal controllable image captioning

Zhen Wang, Jun Xiao, Yueting Zhuang, Fei Gao, Jian Shao, Long Chen

Paper

A lightweight prompt-based framework for controllable image captioning.
Effective and efficient for both single and combined controls.

ECCV 2024

Decap: Towards generalized explicit caption editing via diffusion mechanism

Zhen Wang, Xinyun Jiang, Jun Xiao, Tao Chen, Long Chen

Paper

A diffusion-based explicit caption editing framework with strong generalization ability across various editing and generation scenarios.

ECCV 2022

Explicit image caption editing

Zhen Wang, Long Chen, Wenbo Ma, Guangxing Han, Yulei Niu, Jian Shao, Jun Xiao

Paper/Code/Benchmark

An interesting new task: explicit caption editing (ECE), and new benchmarks.
An effective and efficient ECE model.