Learned Universal Interoperable VIrtual Try-ON
ACM Transactions on Graphics (SIGGRAPH 2026)
†Co-corresponding authors
Our system is a fully automated virtual try-on system that dresses arbitrary 3D garments onto arbitrarily posed humanoid characters without requiring a dressed source avatar, skeleton rigs, or predefined correspondences. Given a rest-posed garment mesh and a target humanoid body, we use SMPL as an intermediate proxy and decompose the dressing task into correspondence prediction, registration, and simulation-based fitting.
First, we estimate dense correspondences in two complementary branches. For clothing-to-body alignment, we predict clothing-to-SMPL correspondences using a DiffusionNet-based surface network that learns partial-to-complete mappings in SMPL UV space, enabling robust handling of complex, multilayer, and non-manifold garments. In parallel, we predict body-to-SMPL correspondences using a diffusion-based correspondence predictor that leverages multi-view consistent diffusion features and semantic cues from a pretrained vision foundation model, allowing robust alignment across large pose variation and stylized body shapes.
Next, we perform two registration steps guided by these correspondences. We first register SMPL inside the garment to obtain a clothing-aligned proxy, and then register an extended SMPL+D model to the target body using additional per-vertex displacements and anisotropic scaling to accommodate extreme proportions.
Finally, we interpolate shape and pose between the registered SMPL and SMPL+D representations to construct a smooth transition sequence, which drives a neural cloth simulator to transfer the garment onto the target body. This staged transition avoids unstable collisions and produces physically plausible draping while preserving garment structure. The resulting pipeline generalizes to unseen garments and bodies and supports automatic, high-quality 3D virtual try-on without human intervention.
Drag to rotate • Scroll to zoom
Drag to rotate • Scroll to zoom