I am a postdoctoral fellow at the Department of Computer Science and Engineering (CSE), Hong Kong University of Science and Technology (HKUST), advised by Prof. Long Chen. Prior to this, I obtained my PhD degree in Computer Science and Technology from the DCD Lab, Zhejiang University (ZJU), under the supervision of Prof. Jun Xiao.

💬 Research Interests

  • Computer Vision, Machine Learning
  • Vision and Language, Multimodal Generation and Editing

📝 Publications

(Selected works are shown. Full publication list in Google Scholar)

ICML 2025
sym

Event-Customized Image Generation

Zhen Wang, Yilei Jiang, Dong Zheng, Jun Xiao, Long Chen

Paper/Code

  • Customize your event (actions, poses, relations and interactions) with only one single image!
  • Training-free, Plug-and-Play and Effective.
CVPR 2025
sym

IterIS: Iterative Inference-Solving Alignment for LoRA Merging

Hongxu Chen, Zhen Wang, Runshi Li, Bowei Zhu, Long Chen

Paper/Code

  • A sample-efficient and broadly applicable LoRA merging method based on an iterative inference-solving framework with enhanced performance.
ICLR 2025
sym

CLIPDrag: Combining Text-based and Drag-based Instructions for Image Editing

Ziqi Jiang, Zhen Wang, Long Chen

Paper/Code

  • Incorporating text signals into drag-based methods for precise and flexible image editing.
IJCV 2025
sym

Learning combinatorial prompts for universal controllable image captioning

Zhen Wang, Jun Xiao, Yueting Zhuang, Fei Gao, Jian Shao, Long Chen

Paper

  • A lightweight prompt-based framework for controllable image captioning.
  • Effective and efficient for both single and combined controls.
ECCV 2024
sym

Decap: Towards generalized explicit caption editing via diffusion mechanism

Zhen Wang, Xinyun Jiang, Jun Xiao, Tao Chen, Long Chen

Paper

  • A diffusion-based explicit caption editing framework with strong generalization ability across various editing and generation scenarios.
arXiv
sym

Freetuner: Any subject in any style with training-free diffusion

Youcan Xu, Zhen Wang, Jun Xiao, Wei Liu, Long Chen

Paper

  • A flexible and training-free method for compositional personalization.
ECCV 2022
sym

Explicit image caption editing

Zhen Wang, Long Chen, Wenbo Ma, Guangxing Han, Yulei Niu, Jian Shao, Jun Xiao

Paper/Code/Benchmark

  • An interesting new task: explicit caption editing (ECE), and new benchmarks.
  • An effective and efficient ECE model.
- [Lorem ipsum dolor sit amet, consectetur adipiscing elit. Vivamus ornare aliquet ipsum, ac tempus justo dapibus sit amet](https://github.com), A, B, C, **CVPR 2020** # 🔥 News - *2022.02*:  🎉🎉 Lorem ipsum dolor sit amet, consectetur adipiscing elit. Vivamus ornare aliquet ipsum, ac tempus justo dapibus sit amet. - *2022.02*:  🎉🎉 Lorem ipsum dolor sit amet, consectetur adipiscing elit. Vivamus ornare aliquet ipsum, ac tempus justo dapibus sit amet. # 🎖 Honors and Awards - *2021.10* Lorem ipsum dolor sit amet, consectetur adipiscing elit. Vivamus ornare aliquet ipsum, ac tempus justo dapibus sit amet. - *2021.09* Lorem ipsum dolor sit amet, consectetur adipiscing elit. Vivamus ornare aliquet ipsum, ac tempus justo dapibus sit amet. # 📖 Educations - *2019.06 - 2022.04 (now)*, Lorem ipsum dolor sit amet, consectetur adipiscing elit. Vivamus ornare aliquet ipsum, ac tempus justo dapibus sit amet. - *2015.09 - 2019.06*, Lorem ipsum dolor sit amet, consectetur adipiscing elit. Vivamus ornare aliquet ipsum, ac tempus justo dapibus sit amet. # 💬 Invited Talks - *2021.06*, Lorem ipsum dolor sit amet, consectetur adipiscing elit. Vivamus ornare aliquet ipsum, ac tempus justo dapibus sit amet. - *2021.03*, Lorem ipsum dolor sit amet, consectetur adipiscing elit. Vivamus ornare aliquet ipsum, ac tempus justo dapibus sit amet. \| [\[video\]](https://github.com/) # 💻 Internships - *2019.05 - 2020.02*, [Lorem](https://github.com/), China.