ColorwAI: Generative Colorways of Textiles through GAN and Diffusion Disentanglement

Read original: arXiv:2407.11514 - Published 7/17/2024 by Ludovica Schaerf, Andrea Alfarano, Eric Postma

ColorwAI: Generative Colorways of Textiles through GAN and Diffusion Disentanglement

Overview

This paper introduces ColorwAI, a system that can generate new textile colorways using generative adversarial networks (GANs) and diffusion models.
The researchers aim to disentangle the factors of color, texture, and pattern in textile designs to enable more flexible and controllable generative models.
They leverage techniques like CodeGAN and ColorPeel to achieve this disentanglement.

Plain English Explanation

The paper presents a new system called ColorwAI that can create original textile color patterns and designs. Current textile design software often forces designers to work within predefined constraints, limiting their creativity. ColorwAI aims to give designers more flexibility by separating the different components of a textile design - the colors, textures, and patterns - and allowing them to be manipulated independently.

The researchers use a combination of generative adversarial networks (GANs) and diffusion models, which are types of machine learning algorithms, to achieve this disentanglement. By training the models on a dataset of existing textile designs, ColorwAI can learn the underlying structure of textile patterns and generate new ones that blend colors, textures, and motifs in unique ways.

This allows designers to, for example, take an existing textile pattern and experiment with new color palettes, or start with a desired color scheme and have the system generate complementary textures and patterns. The goal is to empower textile designers to be more creative and explore a wider design space.

Technical Explanation

The ColorwAI system consists of several key components:

Texture and Pattern Disentanglement: The researchers use a CodeGAN-inspired architecture to disentangle the texture and pattern information in textile designs. This allows the model to generate new patterns independently of the underlying texture.
Color Disentanglement: To separate the color information, the team employs a ColorPeel diffusion model. This model can generate new color palettes for a given textile pattern, decoupling the color and pattern generation.
Multimodal Integration: The texture/pattern and color models are integrated into a single end-to-end system that can generate complete textile colorways from a latent representation. This allows for fine-grained control over the different design elements.

The researchers evaluate ColorwAI on a dataset of textile images, showing that it can generate diverse and high-quality colorways that blend colors, textures, and patterns in novel ways. They also demonstrate the system's ability to edit existing textile designs by manipulating the latent representations.

Critical Analysis

The ColorwAI system represents an interesting advancement in textile design generation, leveraging state-of-the-art techniques like FashionSD and Multimodal Semantic-Aware Colorization to disentangle and control the different design factors.

One potential limitation is the reliance on a fixed dataset of textile designs, which may not capture the full breadth of possible patterns and color schemes. Exploring ways to generate more diverse and innovative designs, perhaps by incorporating Emergent Interpretable Symbols or other techniques, could further expand the system's capabilities.

Additionally, while the paper demonstrates the system's ability to edit existing designs, it would be interesting to see how ColorwAI performs in a more interactive, user-driven design process. Integrating the system with existing textile design software and gathering feedback from professional designers could uncover new avenues for improvement and adoption.

Conclusion

The ColorwAI system represents an exciting step forward in generative textile design, leveraging state-of-the-art machine learning techniques to disentangle and control the different factors that make up a textile pattern. By empowering designers to explore a wider design space, ColorwAI has the potential to unlock new creative possibilities and accelerate innovation in the textile industry.

As the field of generative design continues to evolve, systems like ColorwAI will likely play an increasingly important role in shaping the future of textile and fashion design, allowing designers to focus on their creativity while automation handles the more tedious aspects of the design process.

This summary was produced with help from an AI and may contain inaccuracies - check out the links to read the original source documents!

Follow @aimodelsfyi on 𝕏 →

Related Papers

ColorwAI: Generative Colorways of Textiles through GAN and Diffusion Disentanglement

Ludovica Schaerf, Andrea Alfarano, Eric Postma

Colorway creation is the task of generating textile samples in alternate color variations maintaining an underlying pattern. The individuation of a suitable color palette for a colorway is a complex creative task, responding to client and market needs, stylistic and cultural specifications, and mood. We introduce a modification of this task, the generative colorway creation, that includes minimal shape modifications, and propose a framework, ColorwAI, to tackle this task using color disentanglement on StyleGAN and Diffusion. We introduce a variation of the InterfaceGAN method for supervised disentanglement, ShapleyVec. We use Shapley values to subselect a few dimensions of the detected latent direction. Moreover, we introduce a general framework to adopt common disentanglement methods on any architecture with a semantic latent space and test it on Diffusion and GANs. We interpret the color representations within the models' latent space. We find StyleGAN's W space to be the most aligned with human notions of color. Finally, we suggest that disentanglement can solicit a creative system for colorway creation, and evaluate it through expert questionnaires and creativity theory.

7/17/2024

Training-free Color-Style Disentanglement for Constrained Text-to-Image Synthesis

Aishwarya Agarwal, Srikrishna Karanam, Balaji Vasan Srinivasan

We consider the problem of independently, in a disentangled fashion, controlling the outputs of text-to-image diffusion models with color and style attributes of a user-supplied reference image. We present the first training-free, test-time-only method to disentangle and condition text-to-image models on color and style attributes from reference image. To realize this, we propose two key innovations. Our first contribution is to transform the latent codes at inference time using feature transformations that make the covariance matrix of current generation follow that of the reference image, helping meaningfully transfer color. Next, we observe that there exists a natural disentanglement between color and style in the LAB image space, which we exploit to transform the self-attention feature maps of the image being generated with respect to those of the reference computed from its L channel. Both these operations happen purely at test time and can be done independently or merged. This results in a flexible method where color and style information can come from the same reference image or two different sources, and a new generation can seamlessly fuse them in either scenario.

9/5/2024

ColorPeel: Color Prompt Learning with Diffusion Models via Color and Shape Disentanglement

Muhammad Atif Butt, Kai Wang, Javier Vazquez-Corral, Joost van de Weijer

Text-to-Image (T2I) generation has made significant advancements with the advent of diffusion models. These models exhibit remarkable abilities to produce images based on textual prompts. Current T2I models allow users to specify object colors using linguistic color names. However, these labels encompass broad color ranges, making it difficult to achieve precise color matching. To tackle this challenging task, named color prompt learning, we propose to learn specific color prompts tailored to user-selected colors. Existing T2I personalization methods tend to result in color-shape entanglement. To overcome this, we generate several basic geometric objects in the target color, allowing for color and shape disentanglement during the color prompt learning. Our method, denoted as ColorPeel, successfully assists the T2I models to peel off the novel color prompts from these colored shapes. In the experiments, we demonstrate the efficacy of ColorPeel in achieving precise color generation with T2I models. Furthermore, we generalize ColorPeel to effectively learn abstract attribute concepts, including textures, materials, etc. Our findings represent a significant step towards improving precision and versatility of T2I models, offering new opportunities for creative applications and design tasks. Our project is available at https://moatifbutt.github.io/colorpeel/.

7/11/2024

🌐

CoDeGAN: Contrastive Disentanglement for Generative Adversarial Network

Jiangwei Zhao, Zejia Liu, Xiaohan Guo, Lili Pan

Disentanglement, a critical concern in interpretable machine learning, has also garnered significant attention from the computer vision community. Many existing GAN-based class disentanglement (unsupervised) approaches, such as InfoGAN and its variants, primarily aim to maximize the mutual information (MI) between the generated image and its latent codes. However, this focus may lead to a tendency for the network to generate highly similar images when presented with the same latent class factor, potentially resulting in mode collapse or mode dropping. To alleviate this problem, we propose texttt{CoDeGAN} (Contrastive Disentanglement for Generative Adversarial Networks), where we relax similarity constraints for disentanglement from the image domain to the feature domain. This modification not only enhances the stability of GAN training but also improves their disentangling capabilities. Moreover, we integrate self-supervised pre-training into CoDeGAN to learn semantic representations, significantly facilitating unsupervised disentanglement. Extensive experimental results demonstrate the superiority of our method over state-of-the-art approaches across multiple benchmarks. The code is available at https://github.com/learninginvision/CoDeGAN.

6/3/2024