DressCode: Autoregressively Sewing and Generating Garments from Text Guidance

Read original: arXiv:2401.16465 - Published 6/18/2024 by Kai He, Kaixin Yao, Qixuan Zhang, Jingyi Yu, Lingjie Liu, Lan Xu

Overview

• The paper "DressCode: Autoregressively Sewing and Generating Garments from Text Guidance" presents a novel deep learning model for generating detailed 3D garments from textual descriptions.

• The model, called DressCode, can autoregressive ally sew and generate complex garments, such as shirts, pants, and dresses, given input text that describes the desired style and attributes.

• This research advances the field of text-to-image and text-to-3D generation, enabling more detailed and controllable synthesis of clothing.

Plain English Explanation

• The researchers developed an AI system that can create 3D models of clothes based on written descriptions. For example, if you give the system a text description like "a long-sleeved blue denim jacket with front pockets," it can generate a 3D digital model of that jacket.

• This is a challenging task because clothing has complex shapes, textures, and folds that are difficult for computers to capture. The key innovation in this paper is the "autoregressive" approach, where the system builds the garment piece-by-piece, similar to how a human might sew a garment.

• By generating the garment in this step-by-step way, rather than all at once, the system is able to capture the intricate details and realistic draping of the final 3D model. This could be useful for applications like virtual fashion design, online clothing visualization, and even 3D printing of custom garments.

Technical Explanation

• The DressCode model uses a transformer-based architecture to encode the input text description and then autoregressively generate the 3D garment geometry.

• The model first encodes the text into a latent representation, then uses this to initialize the generation of a sequence of 2D "sewing patterns." These sewing patterns are gradually stitched together in an autoregressive manner to form the final 3D garment mesh.

• Key technical innovations include the use of a garment-specific latent space and novel training objectives to encourage realistic garment geometry and draping.

• Experiments show DressCode can generate a diverse range of clothing types, from simple t-shirts to more complex dresses and coats, with high fidelity to the input text prompts.

Critical Analysis

• A limitation of the current work is that the generated garments are not fully physically simulated, so the dynamics and motion of the clothing may not be perfectly accurate.

• Additionally, the system is trained on a relatively limited dataset of garment types and styles, so its ability to generalize to more diverse or custom clothing designs may be constrained.

• Future research could explore ways to integrate physical simulation or leverage large-scale fashion datasets to further improve the realism and versatility of the generated garments.

Conclusion

• The DressCode model represents an important step forward in text-to-3D garment generation, enabling more detailed and controllable synthesis of clothing from natural language descriptions.

• This technology could have significant implications for virtual fashion design, online shopping, and even custom clothing manufacturing, by allowing users to easily visualize and create desired garments.

• While the current system has some limitations, the core autoregressive approach and other technical innovations showcased in this paper point to promising directions for continued progress in this emerging field.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

DressCode: Autoregressively Sewing and Generating Garments from Text Guidance

Kai He, Kaixin Yao, Qixuan Zhang, Jingyi Yu, Lingjie Liu, Lan Xu

Apparel's significant role in human appearance underscores the importance of garment digitalization for digital human creation. Recent advances in 3D content creation are pivotal for digital human creation. Nonetheless, garment generation from text guidance is still nascent. We introduce a text-driven 3D garment generation framework, DressCode, which aims to democratize design for novices and offer immense potential in fashion design, virtual try-on, and digital human creation. We first introduce SewingGPT, a GPT-based architecture integrating cross-attention with text-conditioned embedding to generate sewing patterns with text guidance. We then tailor a pre-trained Stable Diffusion to generate tile-based Physically-based Rendering (PBR) textures for the garments. By leveraging a large language model, our framework generates CG-friendly garments through natural language interaction. It also facilitates pattern completion and texture editing, streamlining the design process through user-friendly interaction. This framework fosters innovation by allowing creators to freely experiment with designs and incorporate unique elements into their work. With comprehensive evaluations and comparisons with other state-of-the-art methods, our method showcases superior quality and alignment with input prompts. User studies further validate our high-quality rendering results, highlighting its practical utility and potential in production settings. Our project page is https://IHe-KaiI.github.io/DressCode/.

6/18/2024

ClotheDreamer: Text-Guided Garment Generation with 3D Gaussians

Yufei Liu, Junshu Tang, Chu Zheng, Shijie Zhang, Jinkun Hao, Junwei Zhu, Dongjin Huang

High-fidelity 3D garment synthesis from text is desirable yet challenging for digital avatar creation. Recent diffusion-based approaches via Score Distillation Sampling (SDS) have enabled new possibilities but either intricately couple with human body or struggle to reuse. We introduce ClotheDreamer, a 3D Gaussian-based method for generating wearable, production-ready 3D garment assets from text prompts. We propose a novel representation Disentangled Clothe Gaussian Splatting (DCGS) to enable separate optimization. DCGS represents clothed avatar as one Gaussian model but freezes body Gaussian splats. To enhance quality and completeness, we incorporate bidirectional SDS to supervise clothed avatar and garment RGBD renderings respectively with pose conditions and propose a new pruning strategy for loose clothing. Our approach can also support custom clothing templates as input. Benefiting from our design, the synthetic 3D garment can be easily applied to virtual try-on and support physically accurate animation. Extensive experiments showcase our method's superior and competitive performance. Our project page is at https://ggxxii.github.io/clothedreamer.

6/26/2024

🌿

GarmentDreamer: 3DGS Guided Garment Synthesis with Diverse Geometry and Texture Details

Boqian Li, Xuan Li, Ying Jiang, Tianyi Xie, Feng Gao, Huamin Wang, Yin Yang, Chenfanfu Jiang

Traditional 3D garment creation is labor-intensive, involving sketching, modeling, UV mapping, and texturing, which are time-consuming and costly. Recent advances in diffusion-based generative models have enabled new possibilities for 3D garment generation from text prompts, images, and videos. However, existing methods either suffer from inconsistencies among multi-view images or require additional processes to separate cloth from the underlying human model. In this paper, we propose GarmentDreamer, a novel method that leverages 3D Gaussian Splatting (GS) as guidance to generate wearable, simulation-ready 3D garment meshes from text prompts. In contrast to using multi-view images directly predicted by generative models as guidance, our 3DGS guidance ensures consistent optimization in both garment deformation and texture synthesis. Our method introduces a novel garment augmentation module, guided by normal and RGBA information, and employs implicit Neural Texture Fields (NeTF) combined with Score Distillation Sampling (SDS) to generate diverse geometric and texture details. We validate the effectiveness of our approach through comprehensive qualitative and quantitative experiments, showcasing the superior performance of GarmentDreamer over state-of-the-art alternatives. Our project page is available at: https://xuan-li.github.io/GarmentDreamerDemo/.

5/22/2024

WordRobe: Text-Guided Generation of Textured 3D Garments

Astitva Srivastava, Pranav Manu, Amit Raj, Varun Jampani, Avinash Sharma

In this paper, we tackle a new and challenging problem of text-driven generation of 3D garments with high-quality textures. We propose WordRobe, a novel framework for the generation of unposed & textured 3D garment meshes from user-friendly text prompts. We achieve this by first learning a latent representation of 3D garments using a novel coarse-to-fine training strategy and a loss for latent disentanglement, promoting better latent interpolation. Subsequently, we align the garment latent space to the CLIP embedding space in a weakly supervised manner, enabling text-driven 3D garment generation and editing. For appearance modeling, we leverage the zero-shot generation capability of ControlNet to synthesize view-consistent texture maps in a single feed-forward inference step, thereby drastically decreasing the generation time as compared to existing methods. We demonstrate superior performance over current SOTAs for learning 3D garment latent space, garment interpolation, and text-driven texture synthesis, supported by quantitative evaluation and qualitative user study. The unposed 3D garment meshes generated using WordRobe can be directly fed to standard cloth simulation & animation pipelines without any post-processing.

7/16/2024