About
I am a final-year Ph.D. student in the ICL group at the School of Artificial Intelligence, Jilin University, advised by Associate Professor Rui Ma. I am also grateful for mentorship from Yilin Wang (Adobe Research) and Zili Yi (Nanjing University).
I received my B.E. from Dalian Maritime University and my M.S. from the School of Computer Science, Jilin University. I am currently interning at Tencent Youtu through the Qingyun Program; previously I interned at Alibaba Amap, Kunlun Tech, Baidu Maps, and JD Digits.
I founded and co-host PaperABC on Bilibili, where we explain recent papers; the channel has grown to over 12,000 subscribers.
Research
My research focuses on building models that manipulate and synthesize visual content in a controllable, personalized way. Core directions include the following.
- Image editing
- Style transfer
- Unified understanding & generation
News
I joined Tencent Youtu as a research intern through the Qingyun Program, working on unified understanding & generation and post-training.
I joined Alibaba Amap as an intern, focusing on physical-aware image generation.
Our paper OmniStyle was accepted to CVPR 2025.
Our paper SigStyle was accepted to AAAI 2025.
Internships
Industry experience in reverse chronological order.
-
Tencent Youtu (腾讯优图) Mar 2026 — PresentQingyun Program (青云计划) · Intern
-
-
Kunlun Tech (昆仑万维) Mar 2024 — Jun 2024AI Story · Research algorithm intern
Multi-subject person image generation.
-
Baidu Maps (百度地图) Mar 2021 — Jun 2021Data mining · Algorithm intern
Real-time bus ETA estimation and optimization; bus departure interval prediction.
-
JD Digits (京东数科) Nov 2020 — Jan 2021Data mining · Algorithm intern
Grey-user information mining and identification systems.
Publications
First / corresponding author
-
Learning to Stylize by Learning to Destylize: A Scalable Paradigm for Supervised Style Transfer
Destylization-driven stylization: we learn to stylize by learning to destylize, enabling scalable supervised style transfer.
-
A Training-Free Framework for High-Fidelity Appearance Transfer via Diffusion Transformers
Tech lead
ICASSP 2026 (CCF-B)
Paper link TBA
-
MXM-CLR: A Unified Framework for Contrastive Learning of Multifold Cross-Modal Representations
IJCV under review (CCF-A)
-
SingleDream: Attribute-Driven T2I Customization from a Single Reference Image
CVM 2025 (CCF-C)
Parameter-efficient fine-tuning with a single reference image for attribute-driven text-to-image customization (style, appearance, shape) via a hypernetwork-enhanced approach.