Repository logo
 

Diffusion Model for A Virtual Try-On System

aut.relation.conferenceConference on Image and Vision Computing New Zealand
dc.contributor.authorZhang, Yuchao
dc.contributor.authorT. P. Tran, Kien
dc.contributor.authorNguyen, Minh
dc.contributor.authorYan, Wei Qi
dc.date.accessioned2025-12-01T22:56:51Z
dc.date.available2025-12-01T22:56:51Z
dc.date.issued2025-12-21
dc.description.abstractWe present a modular virtual try-on (VTON) system that integrates natural language control, efficient diffusion-based image synthesis, and lightweight garment classification. User intent is parsed by a large language model (LLM) into structured visual prompts. A LoRA-tuned diffusion model generates tryon images conditioned on pose and segmentation maps, while a compact classifier, LightClothNet, handles five-category clothing recognition and pre-filtering. The pipeline is built using ComfyUI nodes and orchestrated via Dify. Compared to the existing methods, the proposed system offers improved realism, garment-pose alignment, and controllability. Our evaluations on the DressCode and VITON-HD datasets show that LoRA fine-tuning enhances fidelity under limited data, while LightClothNet achieves up to 91.76% precision and 0.91 F1-score with low latency. This result demonstrates how multimodal control, lightweight classification, and diffusion generation are unified for fast, flexible, and userdriven VTON applications.
dc.identifier.citationWu, J., Nguyen, M., & Yan, W. Q. (2025). A diffusion model for virtual try-on systems. In 2024 39th International Conference on Image and Vision Computing New Zealand (IVCNZ) (pp. 1–6). IEEE. https://doi.org/10.1109/IVCNZ63833.2024.11281834
dc.identifier.doi10.1109/IVCNZ67716.2025.11281834
dc.identifier.urihttp://hdl.handle.net/10292/20247
dc.publisherIEEE
dc.relation.urihttps://ieeexplore.ieee.org/document/11281834
dc.rightsCopyright © 2025 IEEE. Personal use of this material is permitted. Permission from IEEE must be obtained for all other uses, in any current or future media, including reprinting/republishing this material for advertising or promotional purposes, creating new collective works, for resale or redistribution to servers or lists, or reuse of any copyrighted component of this work in other works.
dc.rights.accessrightsOpenAccess
dc.titleDiffusion Model for A Virtual Try-On System
dc.typeConference Contribution
pubs.elements-id746494

Files

Original bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
Zhang et al_2025_Diffusion model for a virtual try on system.pdf
Size:
1.12 MB
Format:
Adobe Portable Document Format
Description:
Conference contribution

License bundle

Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
license.txt
Size:
1.37 KB
Format:
Plain Text
Description: